Selected Publications
2025
-
ChengXiang Zhai, Information Retrieval for Artificial General Intelligence: A New Perspective of Information Retrieval Research, Proceedings of
ACM SIGIR 2025, 2025
2024
-
ChengXiang Zhai. 2024. Large Language Models and Future of Information Retrieval: Opportunities and Challenges. In Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR '24). Association for Computing Machinery, New York, NY, USA, 481–490. https://doi.org/10.1145/3626772.3657848
2021
- Safa Messaoud, Ismini Lourentzou, Assma Boughoula, Mona Zehni, Zhizhen Zhao, Chengxiang Zhai and Alexander Schwing.
DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization , In Proceedgings of SIGIR 2021. To appear.
- Saar Kuzi, ChengXiang Zhai. A Study of Distributed Representations for Figures of Research Articles. In Proceedings of ECIR 2021. To appear.
- Dominic Seyler, Wei Liu, Xiaofeng Wang and Chengxiang Zhai. Towards Dark Jargon Interpretation in Underground Forums. In Proceedings of ECIR 2021. To appear.
2020
-
Yiren Wang, ChengXiang Zhai, Hany Hassan Awadalla. Multi-task Learning for Multilingual Neural Machine Translation.
In 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP-20)
- S. Zhao, M. Jiang, B. Qin, T. Liu, C. Zhai and F. Wang, Structural and Textual Information Fusion for Symptom and Disease Representation Learning, in IEEE Transactions on Knowledge and Data Engineering, doi: 10.1109/TKDE.2020.3039469.
- Shubhra (Santu) K. Karmaker, Parikshit Sondhi, and ChengXiang Zhai. 2020. Empirical Analysis of Impact of Query-Specific Customization of nDCG: A Case-Study with Learning-to-Rank Methods. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management (CIKM '20). Association for Computing Machinery, New York, NY, USA, 3281–3284. DOI:https://doi.org/10.1145/3340531.3417454
- Michael Jeffrey Volk, Ismini Lourentzou, Shekhar Mishra, Lam Tung Vo, Chengxiang Zhai, and Huimin Zhao. Biosystems Design by Machine Learning. ACS Synthetic Biology (2020).
- Dominic Seyler, ChengXiang Zhai:
A Study of Methods for the Generation of Domain-Aware Word Embeddings. SIGIR 2020: 1609-1612
-
Saar Kuzi, ChengXiang Zhai, Yin Tian, Haichuan Tang:
FigExplorer: A System for Retrieval and Exploration of Figures from Collections of Research Articles. SIGIR 2020: 2133-2136, Demo description.
- Yiren Wang, Lijun Wu, Yingce Xia, Tao Qin, ChengXiang Zhai, Tie-Yan Liu.
Transductive Ensemble Learning for Neural Machine Translation. AAAI 2020: 6291-6298
- Alex Morales, Kanika Narang, Hari Sundaram, Chengxiang Zhai:
CrowdQM: Learning Aspect-Level User Reliability and Comment Trustworthiness in Discussion Forums. PAKDD (1) 2020: 592-605
- Bhavya, Assma Boughoula, Aaron Green, ChengXiang Zhai:
Collective Development of Large Scale Data Science Products via Modularized Assignments: An Experience Report. SIGCSE 2020: 1200-1206
2019
-
Xiao, Yuxin, Zecheng Zhang, Carl Yang, and Chengxiang Zhai. Non-local Attention Learning on Large Heterogeneous Information Networks. In 2019 IEEE International Conference on Big Data (Big Data), pp. 978-987. IEEE, 2019.
- Sahiti Labhishetty, Bhavya, Kevin Pei, Assma Boughoula, and Chengxiang Zhai.
Web of Slides: Automatic Linking of Lecture Slides to Facilitate Navigation.
In Proceedings of the Sixth (2019) ACM Conference on Learning @ Scale (L@S' 19), Article 54, 14.
- Shubhra Kanti Karmaker Santu, Kalyan Veeramachaneni, ChengXiang Zhai,
TILM: Neural Language Models with Evolving Topical Infuence,
In Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL), 2019, pp. 778-788.
- Yiren Wang, Yingce Xia, Fei Tian, Fei Gao, Tao Qin, ChengXiang Zhai, Tie-Yan Liu,
Neural Machine Translation with Soft Prototype,
In Proceedings of NeurIPS 2019 (NeurIPS'19 ), 2019, pp. 6313-6322.
- Zhenya Huang, Qi Liu, Chengxiang Zhai, Yu Yin, Enhong Chen, Weibo Gao, Guoping
Hu,
Exploring Multi-Objective Exercise Recommendations in Online Education Systems,
In Proceedings of ACM CIKM 2019 (CIKM'19), 2019, pp.1261-1270.
- Saar Kuzi, Sahiti Labhishetty, Shubhra Kanti Karmaker Santu, Prasad Pradip Joshi,
ChengXiang Zhai,
Analysis of Adaptive Training for Learning to Rank in Information
Retrieval,
In Proceedings of ACM CIKM 2019 (CIKM'19), 2019, pp.2325-2328.
- Saar Kuzi, Abhishek Narwekar, Anusri Pampari, ChengXiang Zhai,
Help Me Search: Leveraging User-System Collaboration for Query Construction to Improve Accuracy for Difficult Queries, In Proceedings of ACM SIGIR 2019 (SIGIR'19), pp. 1221-1224,
2019.
- Saar Kuzi, William Cope, Duncan Ferguson, Chase Geigle and Chengxiang Zhai. Automatic
Assessment of Complex Assignments using Topic Models, In Proceedings of 2019
ACM Learning at Scale (L@S'19), Article 13, 10 pages. 2019,
DOI: https://doi.org/10.1145/3330430.3333615.
- Chase Geigle, Himel Dev, Hari Sundaram, ChengXiang Zhai,
A Generative Model for Discovering
Action-Based Roles and Community Role Compositions on Community Question
Answering Platforms, In Proceedings of the 13th International AAAI Conference on
Web and Social Media (ICWSM'19), 2019, 181-192
- Ismini Lourentzou, Kabir Manghnani, ChengXiang Zhai, Adapting Sequence to Sequence
models for Text Normalization in Social Media, In Proceedings of the 13th International
AAAI Conference on Web and Social Media (ICWSM'19), 2019, pp. 335-345.
- Anjan Goswami, Prasant Mohapatra, Chengxiang Zhai.
Quantifying and Visualizing the Demand and Supply Gap from E-commerce Search Data using Topic Models, In
Proceedings of WWW (Companion Volume) 2019 (WWW'19), 2019, pp. 348-353.
- Yiren Wang, Yingce Xia, Tianyu He, Fei Tian, Tao Qin, ChengXiang Zhai, Tie-Yan Liu.
Multi-Agent Dual Learning, In Proceedings of the Seventh International Conference on
Learning Representations (ICLR'19), 2019.
- Saar Kuzi, ChengXiang Zhai. Figure Retrieval from Collections of Research Articles,
In Proceedings of 2019 European Conference on Information Retrieval (ECIR'19), 2019,
696-710.
- YirenWang, Fei Tian, Di He, Tao Qin, ChengXiang Zhai, Tie-Yan Liu. Non-Autoregressive
Machine Translation with Auxiliary Regularization, In Proceedings of the 33rd AAAI
Conference on Articial Intelligence (AAAI'19) , 2019.
- Brown, Nicole; Mendenhall, Ruby; Black, Michael; Moer, Mark ; Flynn, Karen ; McKee,
Malaika ; Zerai, Assata ; Lourentzou, Ismini ; Zhai, ChengXiang. (2019).
In Search of Zora/When Metadata Isn't Enough: Rescuing the Experiences of Black Women Through
Statistical Modeling. Journal of Library Metadata, 1-22. 10.1080/19386389.2019.1652967.
2018
- Shubhra Kanti Karmaker Santu, Chase Geigle, Duncan Ferguson, William Cope, Mary
Kalantzis, Duane Searsmith, Chengxiang Zhai.
SOFSAT: Towards a Setlike Operator-based Framework for Semantic Analysis of Text,
ACM SIGKDD Explorations Newsletter , 20(2), pp. 21-30, 2018.
- S. Lohmann, BX White, Z. Zuo, MS. Chan, A. Morales, B. Li, C. Zhai, D. Albarracn.
HIV messaging on Twitter: an analysis of current practice and data-driven recommendations,
AIDS, 2018 Nov 28;32(18):2799-2805. doi: 10.1097/QAD.0000000000002018.
- Shan Jiang, ChengXiang Zhai, Qiaozhu Mei.
Exploiting Knowledge Graph to Improve Text-based Prediction,
Proceedings of 2018 IEEE Conference on BigData , 2018. pdf
- Alex Morales, Nupoor Gandhi, Man-pui Sally Chan, Sophie Lohmann, Travis Sanchez, Lyle Ungar, Dolores Albaracin, and Chengxiang Zhai.
Multi-Attribute Topic Feature Construction for Social Media-based Prediction,
Proceedings of 2018 IEEE Conference on BigData , 2018.
-
Shubhra Kanti Karmaker Santu, Vincent Bindschaedler, Chengxiang Zhai and Carl A. Gunter.
NRF: a Naive Probabilistic Re-identification Framework,
Proceedings of the 17th Workshop on Privacy in the Electronic Society, 2018 (held in conjunction with ACM CCS 2018). pdf
- Chase Geigle, Qiaozhu Mei, ChengXiang Zhai.
Feature Engineering for Text Data.
In G. Dong and H. Liu (editors), Feature Engineering for Machine Learning and Data Analytics, CRC Press, forthcoming, 2018.
- Shubhra Kanti Karmaker Santu, Liangda Li, Yi Chang, ChengXiang Zhai,
JIM: Joint Influence Modeling for Collective Search Behavior,
Proceedings of ACM CIKM 2018,
- Xueqing Liu, Yue Leng, Wei Yang, Chengxiang Zhai, Tao Xie,
Mining Android app descriptions for permission requirements recommendation,
Proceedings of the International Requirements Engineering Conference, 2018.
- Xueqing Liu, Yue Leng, Wei Yang, Wenyu Wang, Chengxiang Zhai and Tao Xie.
A Large-scale Empirical Study on Android Runtime-Permission Rationales,
Proceedings of the IEEE Symposium on Visual Languages and Human-Centric Computing, 2018.
-
Man-pui Sally Chan, Sophie Lohmann, Alex Morales, Chengxiang Zhai, Lyle Ungar, David R Holtgrave, Dolores Albarracín,
An Online Risk Index for the Cross-Sectional Prediction of New HIV Chlamydia, and Gonorrhea Diagnoses Across US Counties and Across Years,
AIDS and Behavior, 2018 Jul;22(7):2322-2333. doi: 10.1007/s10461-018-2046-0.
-
Chase Geigle, Ismini Lourentzou, Hari Sundaram, ChengXiang Zhai,
CLaDS: a cloud-based virtual lab for the delivery of scalable hands-on assignments for practical data science education,
Proceedings of ACM ITiCSE 2018, pp. 176-181, 2018. pdf
- Yixing Fan, Jiafeng Guo, Yanyan Lan, Jun Xu, Chengxiang Zhai, Xueqi Cheng,
Modeling Diverse Relevance Patterns in Ad-hoc Retrieval,
Proceedings of ACM SIGIR 2018, pp. 375-384, 2018.
- Enrique Amigó, Hui Fang, Stefano Mizzaro, ChengXiang Zhai.
Are we on the Right Track?: An Examination of Information Retrieval Methodologies.
Proceedings of ACM SIGIR 2018, pp. 997-1000, 2018.
- Parikshit Sondhi, Mohit Sharma, Pranam Kolari, ChengXiang Zhai.
A Taxonomy of Queries for E-commerce Search
Proceedings of ACM SIGIR 2018, pp. 1245-1248, 2018. PDF
2017
- Chase Geigle, ChengXiang Zhai.
Modeling MOOC Student Behavior With Two-Layer Hidden Markov Models,
Journal of Educational Data Mining, 9(1): 1-24 (2017)
- Peng Bao, Chengxiang Zhai.
Dynamic credit allocation in scientific literature,
Scientometrics, 112(1): 595-606 (2017).
- Alex Morales, Chengxiang Zhai,
Identifying Humor in Reviews using Background Text Sources,
Proceedings of EMNLP 2017 , 492-501, 2017.
- Yinan Zhang, Xueqing Liu, ChengXiang Zhai,
Information Retrieval Evaluation as Search Simulation: A General Formal Framework for IR Evaluation,
Proceedings of ACM ICTIR 2017 , 193-200, 2017. PDF
- Yiren Wang, Dominic Seyler, Shubhra Kanti Karmaker Santu, ChengXiang Zhai,
A Study of Feature Construction for Text-based Forecasting of Time Series Variables,
Proceedings of ACM CIKM 2017 , 2347-2350, 2017.
- Mingjie Qian, Jyotishman Pathak, Naveen L. Pereira, Chengxiang Zhai.
Temporal reflected logistic regression for probabilistic heart failure survival score prediction,
Proceedings of IEEE BIBM 2017 , pp. 410-416, 2017.
-
Edward W. Huang, Sheng Wang, Doris Jung-Lin Lee, Runshun Zhang, Baoyan Liu, Xuezhong Zhou, ChengXiang Zhai.
Framing Electronic Medical Records as Polylingual Documents in Query Expansion,
Proceedings of American Medical Informatics Association 2017 Annual Symposium , 2017.
- Ismini Lourentzou, Alex Morales, ChengXiang Zhai,
Text-based geolocation prediction of social media users with neural networks,
Proceedings of IEEE BigData 2017, 696-705, 2017.
- Rongda Zhu, Lingxiao Wang, Chengxiang Zhai, Quanquan Gu.
High-Dimensional Variance-Reduced Stochastic Gradient Expectation-Maximization Algorithm,
Proceedings of ICML 2017 , pp. 4180-4188, 2017.
- Peilin Yang, Mianwei Zhou, Yi Chang, Chengxiang Zhai, Hui Fang.
Towards Privacy-Preserving Evaluation for Information Retrieval Models Over Industry Data Sets,
Proceedings of AIRS 2017 , 210-221, 2017.
- Shubhra Kanti Karmaker Santu, Parikshit Sondhi, ChengXiang Zhai.
On Application of Learning to Rank for E-Commerce Search,
Proceedings of ACM SIGIR 2017 , pp. 475-484, 2017.
- Edward W. Huang, Sheng Wang, Bingxue Li, Ran Zhang, Baoyan Liu, Runshun Zhang, Jie Liu, Xuezhong Zhou, Hongsheng Lin, ChengXiang Zhai.
HEMnet: Integration of Electronic Medical Records with Molecular Interaction Networks and Domain Knowledge for Survival Analysis,
Proceedings of ACM BCB 2017 , pp. 378-387, 2017.
-
Shubhra Kanti Karmaker Santu, Liangda Li, Dae Hoon Park, Chengxiang Zhai, Yi Chang,
Modeling the Influence of Popular Trending Events on User Search Behavior,
Proceedings of WWW 2017 , pp. 535-544, 2017.
-
Xueqing Liu, Chengxiang Zhai, Wei Han, Onur Gungor,
Numerical Range Facets Partition: Evaluation Metric and Methods,
Proceedings of WWW 2017. pp.662-671, 2017.
- Sendong Zhao, Meng Jiang, Quan Yuan, Bing Qin, Ting Liu, ChengXiang Zhai.
ContextCare: Incorporating Contextual Information Networks to Representation Learning on Medical Forum Data,
Proceedings of IJCAI 2017 , pp. 3497-3503, 2017.
-
Xiaolong Wang, Jingjing Wang, Chengxiang Zhai.
Dual-Clustering Maximum Entropy with Application to Classification and Word Embedding.
In Proceedings of the 31st AAAI conference on Artificial Intelligence, 2017, pp. 3323-3329, 2017.
-
Chase Geigle and Chengxiang Zhai,
Modeling MOOC Student Behavior With Two-Layer Hidden Markov Models.
Proceedings of ACM Learning at Scale 2017, pp. 205-208, 2017.
- Assma Boughoula, Chase Geigle, ChengXiang Zhai.
A Probabilistic Approach for Discovering Difficult Course Topics Using Clickstream Data , Proceedings of ACM Learning at Scale 2017 , pp. 303-306, 2017.
-
Sendong Zhao, Quan Wang, Sean Massung, Ting Liu, and ChengXiang Zhai.
Constructing and Embedding Abstract Event Causality Networks from Text Snippets,
Proceedings of WSDM 2017, pp. 335-344, 2017.
2016
- Sheng Wang, Edward Huang, Runshun Zhang, Xiaoping Zhang, Baoyan Liu, Xuezhong Zhou, ChengXiang Zhai.
A Conditional Probabilistic Model for Joint Analysis of Symptoms, Diagnoses, and Herbs in Traditional Chinese Medicine Patient Records.
Proceedings of IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pp. 411-418, 2016.
-
ChengXiang Zhai:
Towards a game-theoretic framework for text data retrieval.
IEEE Data Eng. Bull. 39(3): 51-62 (2016).
- Xiaolong Wang, Jingjing Wang, Jie Luo, Chengxiang Zhai, Yi Chang.
Blind Men and The Elephant Thurstonian Pairwise Preference for Ranking in Crowdsourcing,
Proceedings of ICDM 2016
- Rongda Zhu, Aston Zhang, Jian Peng, and Chengxiang Zhai,
Exploiting Temporal Divergence of Topic Distributions for Event Detection,
Proceedings of IEEE BigData 2016
- Edward W Huang, Sheng Wang, Runshun Zhang, Baoyan Liu, Xuezhong Zhou, ChengXiang Zhai.
PaReCat: Patient Record Subcategorization for Precision Traditional Chinese Medicine.
Proceedings of ACM BCB 2016, Oct. 2016.
- Shubhra Kanti Karmaker Santu, Parikshit Sondhi and ChengXiang Zhai.
Generative Feature Language Models for Mining Implicit Features from Customer Reviews.
Proceedings of ACM CIKM 2016
-
Dae Hoon Park, Yi Fang, Mengwen Liu, ChengXiang Zhai:
Mobile App Retrieval for Social Media Users via Inference of Implicit Intent in Social Media Text.
Proceedings of ACM CIKM 2016. pp. 959-968.
-
Shengwen Peng, Ronghui You, Hongning Wang, Chengxiang Zhai, Hiroshi Mamitsuka, Shanfeng Zhu:
DeepMeSH: deep semantic representation for improving large-scale MeSH indexing.
Bioinformatics 32(12): 70-79 (2016)
- Martin Leginus, ChengXiang Zhai, Peter Dolog:
Personalized generation of word clouds from tweets.
JASIST 67(5): 1021-1032 (2016)
-
Chase Geigle, ChengXiang Zhai:
Scaling up Online Question Answering via Similar Question Retrieval.
ACM Learning at Scale 2016: pp. 257-260, 2016.
-
Chase Geigle, ChengXiang Zhai, Duncan C. Ferguson:
An Exploration of Automated Grading of Complex Assignments.
ACM Learning at Scale 2016: pp. 351-360, 2016.
-
Yinan Zhang, ChengXiang Zhai:
A Sequential Decision Formulation of the Interface Card Model for Interactive IR.
Proceedings of ACM SIGIR 2016: pp.85-94, 2016.
-
Shan Jiang, Yuening Hu, Changsung Kang, Tim Daly Jr., Dawei Yin, Yi Chang, ChengXiang Zhai:
Learning Query and Document Relevance from a Web-scale Click Graph.
Proceedings of ACM SIGIR 2016: pp. 185-194, 2016.
- Sean Massung, ChengXiang Zhai,
Non-Native Text Analysis: A Survey,
Natural Language Engineering, Volume 22, Issue 2
March 2016, pp. 163-186.
2015
-
Thomas Zhang, Jason H. D. Cho, Chengxiang Zhai:
Understanding User Intents in Online Health Forums.
IEEE J. Biomedical and Health Informatics 19(4): 1392-1398 (2015)
-
Martin Leginus, ChengXiang Zhai, Peter Dolog:
Beomap: Ad Hoc Topic Maps for Enhanced Exploration of Social Media Data.
ICWE 2015: pp. 200-218, 2015.
-
Huizhong Duan, ChengXiang Zhai:
Mining Coordinated Intent Representation for Entity Search and Recommendation.
CIKM 2015: pp. 333-342, 2015.
-
Ismini Lourentzou, Graham Dyer, Abhishek Sharma, ChengXiang Zhai:
Hotspots of news articles: Joint mining of news text & social media to discover controversial points in news.
Big Data 2015: pp. 2948-2950, 2015.
- Jason H. D. Cho, Yanen Li, Roxana Girju, Chengxiang Zhai:
Recommending forum posts to designated experts.
Big Data 2015: pp. 659-666, 2015.
-
Sean Massung, ChengXiang Zhai:
SyntacticDiff: Operator-based transformation for comparative text mining.
Big Data 2015: pp. 571-580, 2015.
- Ke Liu, Shengwen Peng, Junqiu Wu, ChengXiang Zhai, Hiroshi Mamitsuka, Shanfeng Zhu:
MeSHLabeler: improving the accuracy of large-scale MeSH indexing by integrating diverse evidence.
Bioinformatics 31(12): 339-347 (2015)
-
Sheng Wang, Hyunghoon Cho, ChengXiang Zhai, Bonnie Berger, Jian Peng:
Exploiting ontology graph for predicting sparsely annotated gene function.
Bioinformatics 31(12): 357-364 (2015)
-
Yuanhua Lv, ChengXiang Zhai:
Negative query generation: bridging the gap between query likelihood retrieval models and relevance.
Inf. Retr. Journal 18(4): 359-378 (2015)
-
Kavita Ganesan, ChengXiang Zhai:
OpinoFetch: a practical and efficient approach to collecting opinions on arbitrary entities.
Inf. Retr. Journal 18(6): 530-558 (2015)
-
V. G. Vinod Vydiswaran, ChengXiang Zhai, Dan Roth, Peter Pirolli:
Overcoming bias to learn about controversial topics.
JASIST 66(8): 1655-1672 (2015)
- Hussein Hazimeh, ChengXiang Zhai,
Axiomatic Analysis of Smoothing Methods in Language Models for Pseudo-Relevance Feedback,
Proceedings of ACM SIGIR ICTIR 2015, pp. 141-150, 2015.
- Yinan Zhang, ChengXiang Zhai,
Information Retrieval as Card Playing: A Formal Model for Optimizing Interactive Retrieval Interface,
Proceedings of ACM SIGIR 2015, pp. 685-694, 2015.
- Dae Hoon Park, Hyun Duk Kim, ChengXiang Zhai, Lifan Guo,
Retrieval of Relevant Opinion Sentences for New Products,
Proceedings of ACM SIGIR 2015, pp. 393-402, 2015.
- Dae Hoon Park, Mengwen Liu, ChengXiang Zhai,
A Study of Retrieval Models for Mobile App Retrieval,
Proceedings of ACM SIGIR 2015, pp. 533-542, 2015.
- Mingjie Qian, ChengXiang Zhai,
Joint Adaptive Loss and L2/L0-norm Minimization for Unsupervised Feature Selection,
Proceedings of IJCNN 2015 , pp. 1-8, 2015.
- DaeHoon Park, ChengXiang Zhai, Lifan Guo,
SpecLDA: Modeling Product Reviews and Specifications to Generate Augmented Specifications,
Proceedings of 2015 SIAM International Conference on Data Mining (SDM'15), pp. 837-845, 2015.
2014
- Mingjie Qian, ChengXiang Zhai,
Unsupervised Feature Selection for Multi-View Clustering on Text-Image Web News Data,
Proceedings of ACM CIKM 2014, 2014.
- Parikshit Sondhi, ChengXiang Zhai,
Mining Semi-Structured Online Knowledge Bases to Answer Natural Language Questions on Community QA Websites,
Proceedings of ACM CIKM 2014, 2014.
- Shan Jiang, ChengXiang Zhai,
Random Walks on Adjacency Graphs for Mining Lexical Relations from Big Text Data,
Proceedings of IEEE BigData 2014, 2014. pdf
- Hyun Duk Cho, Jason H. D. Cho, Parikshit Sondhi, Chengxiang Zhai, and Bruce R. Schatz.
Resolving healthcare forum posts via similar thread retrieval.
Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (BCB '14), 2014.
- Thomas Zhang, Jason H. D. Cho, and Chengxiang Zhai.
Understanding user intents in online health forums,
Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (BCB '14), 2014.
- Sheng Wang, Yanen Li, Duncan Ferguson, and Chengxiang Zhai,
SideEffectPTM: An Unsupervised Topic Model to Mine Adverse Drug Reactions from Health Forums,
Proceedings of the 5th ACM Conference on Bioinformatics, Computational Biology, and Health Informatics (BCB '14), 2014.
- Yanen Li, Anlei Dong, Hongning Wang, Hongbo Deng, Yi Chang, ChengXiang Zhai,
A Two-dimensional Click Model for Query Auto-completion,
Proceedings of ACM SIGIR 2014 , 2014, pp. 455-464.
- Hui Fang, Hao Wu, Peilin Yang, ChengXiang Zhai,
VIRLab: A Web-based Virtual Lab for Learning and Studying Information Retrieval Models,
Proceedings of ACM SIGIR 2014, demo paper, 2014, pp. 1249-1250.
- Yanen Li, ChengXiang Zhai, Ye Chen,
Exploiting rich user information for one-class collaborative filtering,
Knowledge Information Systems, 38(2): 277-301 (2014).
- Hongning Wang, ChengXiang Zhai, Feng Liang, Anlei Dong, Yi Chang,
User Modeling in Search Logs via A Nonparametric Bayesian Approach,
Proceedings of WSDM 2014 , 2014.
2013
- Huizhong Duan, ChengXiang Zhai, Jinxing Cheng, Abhishek Gattani,
Supporting Keyword Search in Product
Database: A Probabilistic Approach,
Proceedings of VLDB 2014, PVLDB 6(14): 1786-1797 (2013).
- Huizhong Duan, ChengXiang Zhai, Jinxing Cheng, Abhishek Gattani,
A probabilistic mixture model for mining and analyzing product search log,
Proceedings of ACM CIKM 2013, pp. 2179-2188, 2013.
- Yanen Li, Bo-June Paul Hsu, ChengXiang Zhai,
Unsupervised identification of synonymous query intent templates for attribute intents,
Proceedings of ACM CIKM 2013, pp. 2029-2038, 2013.
- Chi Wang, Xiao Yu, Yanen Li, Chengxiang Zhai, Jiawei Han,
Content coverage maximization on word networks for hierarchical topic summarization,
Proceedings of ACM CIKM 2013, pp. 249-258, 2013.
- Yanen Li, Bo-June Paul Hsu, ChengXiang Zhai, Kuansan Wang,
Mining entity attribute synonyms via compact clustering,,
Proceedings of ACM CIKM 2013, pp. 867-872, 2013.
- Hyun Duk Kim, Malu Castellanos, Meichun Hsu, ChengXiang Zhai, Thomas A. Rietz, Daniel Diermeier,
Mining causal topics in text data: iterative topic modeling with time series feedback,
Proceedings of ACM CIKM 2013, pp. 885-890, 2013.
- Hyun Duk Kim, Malu Castellanos, Meichun Hsu, ChengXiang Zhai, Umeshwar Dayal, Riddhiman Ghosh,
Compact explanatory opinion summarization,
Proceedings of ACM CIKM 2013, pp. 1697-1702, 2013.
- Kumaresh Pattabiraman, Parikshit Sondhi, ChengXiang Zhai,
Exploiting Forum Thread Structures to Improve Thread Clustering,
Proceedings of ICTIR 2013, 2013.
- Hyun Duk Kim, Danila Nikitin, ChengXiang Zhai, Malu Castellanos, Meichun Hsu,
Information Retrieval with Time Series Query,
Proceedings of ICTIR 2013, 2013.
- Maryam Karimzadehgan, ChengXiang Zhai, Miles Efron,
Statistical Translation Language Model for Twitter Search,
Proceedings of ICTIR 2013, 2013.
- Xiaolong Wang, ChengXiang Zhai, Dan Roth,
Understanding Evolution of Research Themes: A Probabilistic Generative Model for Citations,
Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'13), pp. 1115-1123, 2013. pdf
- Mingjie Qian, ChengXiang Zhai.
Robust Unsupervised Feature Selection,
Proceedings of the 23rd International Joint Conference on Artificial Intelligence ( IJCAI'13), 2013, pp. 1621-1627, 2013.
- Hyun Duk Kim, Malu Castellanos, Meichun Hsu, ChengXiang Zhai, Umeshwar Dayal, Riddhiman Ghosh,
Ranking explanatory sentences for opinion summarization,
Proceedings of ACM SIGIR 2013 , pp. 1069-1072, 2013.
- Hongning Wang, ChengXiang Zhai, Anlei Dong, Yi Chang,
Content-Aware Click Modeling,
Proceedings of the World Wide Conference 2013 ( WWW'13),
pp. 1365-1376, 2013.
- Azadeh Shakery, ChengXiang Zhai.
Leveraging comparable corpora for cross-lingual information retrieval in resource-lean language pairs ,
Information Retrieval, 16(1), 1-29, 2013.
- Maryam Karimzadehgan, ChengXiang Zhai.
A Learning Approach to Optimizing Exploration-Exploitation Tradeoff in Relevance Feedback,
Information Retrieval , 16(3), 307-330, 2013.
2012
- Huizhong Duan, Emre Kiciman, ChengXiang Zhai,
Click Patterns: An Empirical Representation of Complex Query Intents ,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), . pdf
- Yue Lu, Hongning Wang, ChengXiang Zhai, Dan Roth,
Unsupervised Discovery of Opposing Opinion Networks From Forum Discussions,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), pp. 1642-1646. pdf
- Bin Tan, Yuanhua Lv, ChengXiang Zhai,
Mining long-lasting exploratory user interests from search history,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), pp. 1477-1481, 2012. pdf
- Yuanhua Lv, ChengXiang Zhai,
Query Likelihood with Negative Query Generation,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), pp. 1799-1803, 2012. pdf
- V.G.Vinod Vydiswaran, ChengXiang Zhai, Dan Roth, and Peter Pirolli,
BiasTrust: Teaching biased users about controversial topic,
Proceedings of the 21st ACM International Conference on Information and Knowledge Management (CIKM'12), pp. 1905-1909, 2012.
- Huizhong Duan, Yanen Li, ChengXiang Zhai and Dan Roth,
A Discriminative Model for Query Spelling Correction with Latent Structural SVM,
Proceedings of EMNLP-CoNLL 2012 (EMNLP'12), pages 1511-1521, 2012.
- Yanen Li, Huizhong Duan, ChengXiang Zhai,
A Generalized Hidden Markov Model with Discriminative Training for
Query Spelling Correction , Proceedings of ACM SIGIR 2012 (SIGIR'12), pages 611-620, 2012. pdf
- Parikshit Sondhi, Jimeng Sun, Hanghang Tong, ChengXiang Zhai,
SympGraph: A Mining Framework of Clinical Notes through Symptom Relation Graphs, Proceedings
of KDD 2012 (KDD'12), pages 1167-1175, 2012.
- Parikshit Sondhi, Jimeng Sun, ChengXiang Zhai, Robert Sorrentino and Martin S. Kohn,
Leveraging Medical Thesauri and Physician Feedback for Improving Medical Literature Retrieval for Case Queries,
Journal of American Medical Informatics Association (JAMIA), 19(5): 851-858 (2012).
- Kavita Ganesan, Chengxiang Zhai and Evelyne Viegas,
Micropinion Generation: An Unsupervised Approach to Generating Ultra-Concise Summaries of Opinions,
Proceedings of the World Wide Conference 2012 ( WWW'12), pages 869-878, 2012. (acceptance rate 12%) pdf
- Alex Kotov, ChengXiang Zhai,
Tapping into Knowledge Base for Concept Feedback: Leveraging ConceptNet to Improve Search Results for Difficult Queries,
Proceedings of the 5th ACM International Conference on Web Search and Data Mining (WSDM'12), pages 403-412, 2012. (acceptance rate 21%)
-
Shima Gerani, ChengXiang Zhai, Fabio Crestani,
Score Transformation in Linear Combination for Multi-Criteria Relevance Ranking ,
Proceedings of the 34th European Conference on Information Retrieval (ECIR'12), pages 256-267, 2012. (acceptance rate 21%)
-
Parikshit Sondhi, V.G.Vinod Vydiswaran, ChengXiang Zhai,
Reliability Prediction of Webpages in the Medical Domain,
Proceedings of the 34th European Conference on Information Retrieval (ECIR'12), pages 219-231, 2012.(acceptance rate 21%) pdf
-
Maryam Karimzadehgan, Chengxiang Zhai,
Axiomatic Analysis of Translation Language Model For Information Retrieval ,
Proceedings of the 34th European Conference on Information Retrieval (ECIR'12), pages 268-280, 2012. (acceptance rate 21%)
-
Yuanhua Lv, Chengxiang Zhai,
A Log-logistic Model-based Interpretation of TF Normalization of BM25,
Proceedings of the 34th European Conference on Information Retrieval (ECIR'12), pages 244-255, 2012. (acceptance rate 21%)
-
Kavita Ganesan, ChengXiang Zhai, Opinion-based Entity Ranking, Information Retrieval, 15(2): 116-150 (2012) pdf
2011
-
Duo Zhang, ChengXiang Zhai, Jiawei Han,
MiTexCube: MicroTextCluster Cube for Online Analysis of Text Cells,
Proceedings of NASA Conference on Intelligent Data Understanding 2011, CIDU 2011: 204-218, 2011.
- Alexander Kotov, ChengXiang Zhai,
An Exploration of the Potential Effectiveness of Interactive Sense Feedback for Difficult Queries,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), pages 163-172, 2011.
-
Yuanhua Lv, ChengXiang Zhai,
Lower Bounding Term Frequency Normalization,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), pages 7-16, 2011. ( Best Student Paper Award) pdf
- Huizhong Duan, Rui Li, ChengXiang Zhai,
Automatic Query Reformulation with Syntactic Operators to Alleviate Search Difficulty,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), poster paper, pages 2037-2040, 2011.
-
Yuanhua Lv, ChengXiang Zhai,
Adaptive Term Frequency Normalization for BM25,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), poster paper, pages 1985-1988, 2011.
-
Maryam Karimzadehgan, ChengXiang Zhai,
Improving Retrieval Accuracy of Difficult Queries through Generalizing Negative Document Language Models,
Proceedings of the 20th ACM International Conference on Information and Knowledge Management (CIKM'11), pages 27-36, 2011.
- Hongning Wang, Yue Lu, ChengXiang Zhai,
Latent Aspect Rating Analysis without Aspect Keyword Supervision,
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), 2011, pages 618-626. ( 17.5% acceptance)
- V.G.Vinod Vydiswaran, ChengXiang Zhai, Dan Roth,
Content-driven Trust Propagation Framework ,
Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'11), 2011, pages 974-982. ( 17.5% acceptance)
-
Hongning Wang, Chi Wang, ChengXiang Zhai, Jiawei Han,
Learning Online Discussion Structures by Conditional Random Fields,
Proceedings of the 34th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'11 ), 2011, pages 435-444. ( 20% acceptance)
pdf
- Yanen Li, Bo-June Hsu, ChengXiang Zhai, Kuansan Wang,
Unsupervised Query Segmentation Using Clickthrough for Information Retrieval,
Proceedings of the 34th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'11 ), 2011, pages 285-294. ( 20% acceptance)
- Yuanhua Lv, ChengXiang Zhai, Wan Chen,
A Boosting Approach to Improving Pseudo-Relevance Feedback,
Proceedings of the 34th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'11 ), 2011, pages 165-174. ( 20% acceptance) pdf
- Hongning Wang, Duo Zhang, ChengXiang Zhai, Structural Topic Model for Latent Topical Structure Analysis,
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies (ACL HTL'11), pp.
1526-1535, 2011.
pdf
- Yue Lu, Malu Castellanos, Umeshwar Dayal, ChengXiang Zhai, Automatic Construction of a Context-Aware Sentiment
Lexicon: An Optimization Approach,
Proceedings of the World Wide Conference 2011 ( WWW'11), pages 347-356. pdf
- Zhijun Yin, Liangliang Cao, Jiawei Han, Chengxiang Zhai, and Thomas Huang, Geographical Topic Discovery and Comparison,
Proceedings of the World Wide Conference 2011 ( WWW'11), pages 247-256. pdf
-
Huizhong Duan and Chengxiang Zhai, Exploiting Thread Structure to Improve Smoothing of Language Models for Forum Post Retrieval, Proceedings of the 33rd European Conference on Information Retrieval (ECIR'11), pp. 350-361, 2011. pdf
- Alex Kotov, ChengXiang Zhai, Richard Sproat, Mining Named Entities with Temporally Correlated Bursts from Multilin
gual Web News Streams, Proceedings of WSDM 2011, pp. 237-246, 2011.
- Hui Fang, Tao Tao, ChengXiang Zhai, Diagnostic Evaluation of Information Retrieval
Models, ACM Transactions on Information Systems (ACM TOIS), 29(2), pp. 1-42, April 2011, pdf
- Yue Lu, Qiaozhu Mei, ChengXiang Zhai.
Investigating Task Performance of Probabilistic Topic Models - An Empirical Study of PLSA and LDA,
Information Retrieval, vol. 14, no. 2, April, 2011.
2010
- Yanen Li, Jia Hu, ChengXiang Zhai, Ye Chen.
Improving One-Class Collaborative Filtering by Incorporating Rich User Information,
Proceedings of the 19th ACM International Conference on Information and Knowledge Management (CIKM'10), pages 959-968, 2010. ( 13.4% acceptance) pdf
-
Michael J. Paul, ChengXiang Zhai and Roxana Girju.
Summarizing Contrastive Viewpoints In Opinionated Text,
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing (EMNLP'10), pages 65-75, 2010. ( 25% acceptance) pdf
- Kavita Ganesan, ChengXiang Zhai, Jiawei Han.
Opinosis: A Graph Based Approach to Abstractive Summarization of Highly Redundant Opinions, Proceedings of COLING 2010, pages 340-348. pdf
-
Yue Lu, Huizhong Duan, Hongning Wang and ChengXiang Zhai.
Exploiting Structured Ontology to Organize Scattered Online Opinions,
Proceedings of COLING 2010, pages 734-742. pdf
-
Parikshit Sondhi, Manish Gupta, ChengXiang Zhai and Julia Hockenmaier.
Shallow Information Extraction from Medical Forum Data,
Proceedings of COLING 2010, pages 1158-1166. pdf
- Hongning Wang, Yue Lu, ChengXiang Zhai.
Latent Aspect Rating Analysis on Review Text Data: A Rating Regression Approach, Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'10), pages 115-124, 2010. pdf
- Xin He, Yanen Li, Radhika Khetani, Barry Sanders, Yue Lu, Xu Ling, ChengXiang Zhai, Bruce Schatz.
BSQA: integrated text mining using entity relation semantics extracted from biological literature of insects, Nucleic Acids Research . download
- Xin He, Moushumi Sen Sarma, Xu Ling, Brant Chee, ChengXiang Zhai, Bruce Schatz.
Identifying overrepresented concepts in gene lists from
literature: a statistical approach based on Poisson mixture
model,
BMC Bioinformatics 2010, 11:272 (20 May 2010). download
- Duo Zhang, Qiaozhu Mei, ChengXiang Zhai.
Cross-Lingual Latent Topic Extraction,
Proceedings of the 48th Annual Meeting of the Association for
Computational Linguistics ( ACL'10), pages 1128-1137, 2010. pdf
- Maryam Karimzadehgan, ChengXiang Zhai,
Estimation of Statistical Translation Models Based on Mutual Information for Ad Hoc Information Retrieval ,
Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'10 ), pages 323-330, 2010.
( 16.7% acceptance) pdf
- Yuanhua Lv, ChengXiang Zhai, Positional Relevance Model for Pseudo-Relevance Feedback ,
Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'10 ), pages 579-586, 2010.
( 16.7% acceptance) pdf
- Alexander Kotov, ChengXiang Zhai, Towards Natural Question-Guided Search,
Proceedings of the World Wide Conference 2010 ( WWW'10), pages 541-550. pdf
- Hyun Duk Kim, ChengXiang Zhai, Jiawei Han, Aggregation of Multiple Judgments for
Evaluating Ordered Lists,
Proceedings of the 32nd European Conference on Information Retrieval (ECIR'10), pages 166-178, 2010. (22% acceptance) pdf
2009
- Xuanhui Wang, Bin Tan, Azadeh Shakery, ChengXiang Zhai, Beyond Hyperlinks: Organizing Information Footprints in Search Logs to Support Effective Browsing,
Proceedings of the 18th ACM International Conference on Information and Knowledge Management ( CIKM'09), pages 1237-1246, 2009.
( full paper, 14.5% acceptance) pdf
-
Hyun Duk Kim, ChengXiang Zhai, Generating Comparative Summaries of Contradictory Opinions in Text,
Proceedings of the 18th ACM International Conference on Information and Knowledge Management ( CIKM'09), pages 385-394, 2009.
( full paper, 14.5% acceptance) pdf
- Yuanhua Lv, ChengXiang Zhai, Adaptive Relevance Feedback in Information Retrieval,
Proceedings of the 18th ACM International Conference on Information and Knowledge Management ( CIKM'09), pages 255-264, 2009.
( full paper, 14.5% acceptance) pdf
- Yuanhua Lv, ChengXiang Zhai, Positonal Language Models for Information Retrieval,
Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'09 ), pages 299-306, 2009.
( 16% acceptance) pdf
- Younhee Ko, ChengXiang Zhai, Sandra Rodriguez-Zas, Inference of Gene Pathways using Mixture Bayesian Networks,
BMC Systems Biology, 3:54, 2009, doi:10.1186/1752-0509-3-54. pdf.
- Duo Zhang, ChengXiang Zhai, Jiawei Han, Topic Cube: Topic Modeling for OLAP on Multidimensional Text Databases,
Proceedings of 2009 SIAM International Conference on Data Mining (SDM'09), pages 1123-1134, 2009. ( 16% acceptance)
pdf
- Yue Lu, ChengXiang Zhai, Neel Sundaresan, Rated Aspect Summarization of Short Comments,
Proceedings of the World Wide Conference 2009 ( WWW'09), pages 131-140.
( 12% acceptance) pdf
- Yue Lu, Hui Fang, ChengXiang Zhai, An Empirical Study of Gene Synonym
Query Expansion in Biomedical Information Retrieval, Information Retrieval, Volume 12, Number 1, Feb. 2009, Pages 51-68.
link
2008
- ChengXiang Zhai, Statistical Language Models for Information Retrieval: A Critical Review, Foundations and Trends in Information Retrieval, Vol. 2, No. 3 (2008), pages 137-215, doi:10.1561/1500000008. pdf
- ChengXiang Zhai, Statistical Language Models for Information Retrieval (Synthesis Lectures Series on Human Language Technologies), Morgan & Claypool Publishers, 2008. PDF, Amazon page
- Bo Jin, Brian Muller, ChengXiang Zhai, Xinghua Lu, Multi-label literature classification based on the Gene Ontology graph,
BMC Bioinformatics, 2008, 9:525, doi:10.1186/1471-2105-9-525.
- Maryam Karimzadehgan, ChengXiang Zhai, Geneva Belford, Multi-Aspect Expertise Matching
for Review Assignment,
Proceedings of the 17th ACM International Conference on Information and Knowledge Management ( CIKM'08), pages 1113-1122.
(17% acceptance)
- Xuanhui Wang, ChengXiang Zhai, Mining term association patterns from search logs for effective query reformulation,
Proceedings of the 17th ACM International Conference on Information and Knowledge Management ( CIKM'08), pages 479-488.
(17% acceptance)
-
Deng Cai, Qiaozhu Mei, Jiawei Han, ChengXiang Zhai,
Modeling Hidden Topics on Document Manifold ,
Proceedings of the 17th ACM International Conference on Information and Knowledge Management ( CIKM'08), pages 911-920.
(17% acceptance)
- Xu Ling, Qiaozhu Mei, ChengXiang Zhai, Bruce R. Schatz, Mining multi-faceted overviews of arbitrary topics in a text collection,
Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'08), pages 497-505, 2008.
( 20% acceptance)
- Qiaozhu Mei, Duo Zhang, ChengXiang Zhai.
Smoothing Language Models with Document and Word Graphs ,
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'08 ), pages 611-618.
( 17% acceptance)
- Xuanhui Wang, Hui Fang, ChengXiang Zhai.
A study of methods for negative relevance feedback ,
Proceedings of the 31st Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'08 ), pages 219-226.
( 17% acceptance)
- Qiaozhu Mei, ChengXiang Zhai.
Generating Impact-Based Summaries for Scientific Literature ,
Proceedings of the 46th Annual Meeting of the Association for
Computational Linguistics: Human Language Technologies ( ACL-08:HLT), pages 816-824. (25% acceptance)
- Yue Lu, ChengXiang Zhai.
Opinion Integration Through Semi-supervised Topic
Modeling,
Proceedings of the World Wide Conference 2008 ( WWW'08), pages 121-130. ( 12% acceptance) pdf.
- Qiaozhu Mei, Deng Cai, Duo Zhang, ChengXiang Zhai.
Topic Modeling with Network Regularization,
Proceedings of the World Wide Conference 2008 ( WWW'08), pages 101-110. (12% acceptance) pdf.
- Azadeh Shakery, ChengXiang Zhai.
Smoothing Document Language Models with Probabilistic
Term Count Propagation, Information Retrieval,
11(2), 2008, pages 139-164.
- Xuanhui Wang, Tao Tao, Jian-Tao Sun, Azadeh Shakery, and ChengXiang Zhai, DirichletRank:
Solving the Zero-One Gap Problem of PageRank, ACM Transactions on Information Systems,
26(2), 2008, Article No. 10.
2007
-
- Qiaozhu Mei, Dong Xin, Hong Cheng, Jiawei Han, and ChengXiang Zhai, Semantic Annotation of
Frequent Patterns, ACM Transactions on Knowledge Discovery from Data,
1(3), Dec. 2007, Article No. 11.
- Jing Jiang, ChengXiang Zhai, A Two-Stage Approach to Domain Adaptation for Statistical Classifiers ,
Proceedings of the 16th ACM International Conference on Information and Knowledge Management ( CIKM'07), pages 401-410.
( full paper, 17% acceptance)
- Xuanhui Wang, Hui Fang, ChengXiang Zhai, Improve Retrieval Accuracy for Difficult Queries using Negative Feedback ,
Proceedings of the 16th ACM International Conference on Information and Knowledge Management ( CIKM'07), pages 991-994.
( short paper, 26% acceptance)
- Shui-Lung Chuang, Kevin Chen-Chuan Chuang, and ChengXiang Zhai,
Context-Aware Wrapping: Synchronized Data Extraction,
Proceedings of the 33rd Very Large Data Bases Conference (VLDB'07),pages 699-710. (17.5% acceptance)
- Xuehua Shen, Bin Tan, and ChengXiang Zhai, Privacy Protection in Personalized Search,
ACM SIGIR Forum , 41(1), pages 4-17. pdf
- Qiaozhu Mei, Xuehua Shen, and ChengXiang Zhai, Automatic Labeling of Multinomial Topic Models ,
Proceedings of the 2007 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07 ), pages 490-499. ( 19% acceptance ) pdf
- Xuanhui Wang, ChengXiang Zhai, Xiao Hu, and Richard Sproat, Mining Correlated Bursty Topic Patterns from Coordinated Text Streams ,
Proceedings of the 2007 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'07 ), pages 784-793. (19% acceptance rate) pdf
- Xuanhui Wang, ChengXiang Zhai, Learn from Web Search Logs to
Organize Search Results,
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'07 ), pages 87-94. ( 18% acceptance) pdf
- Bin Tan, Atulya Velivelli, Hui Fang, ChengXiang Zhai,
Term Feedback for Information Retrieval with Language Models,
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'07 ), pages 263-270. ( 18% acceptance) pdf
- Qiaozhu Mei, Hui Fang, ChengXiang Zhai,
A Study of Poisson Query Generation Model for Information Retrieval,
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'07 ), pages 319-326. ( 18% acceptance) pdf
- Tao Tao, ChengXiang Zhai, An Exploration of Proximity Measures in Information Retrieval,
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and
Development in Information Retrieval ( SIGIR'07 ), pages 295-302. ( 18% acceptance) pdf
- Jing Jiang and ChengXiang Zhai, An Empirical Study of
Tokenization Strategies for Biomedical Information Retrieval, Information Retrieval,
10(4-5), Oct. 2007, pp. 341-363. pdf
- Jing Jiang and ChengXiang Zhai, Instance Weighting for Domain Adaptation in NLP,
Proceedings of ACL 2007, pages 264-271. pdf
- Qiaozhu Mei, Xu Ling, Matthew Wondra, Hang Su, ChengXiang Zhai, Topic Sentiment Mixture: Modeling Facets and Opinions in Weblogs, Proceedings of the World Wide Conference 2007 ( WWW'07), pages 171-180. pdf
- Hui Fang, ChengXiang Zhai, Probabilistic Models for Expert Finding , Proceedings of
the 29th European Conference on Information Retrieval (ECIR'07), pages 418-430. ( 19% acceptance) pdf
- Jing Jiang, ChengXiang Zhai,
A Systematic Exploration of The Feature Space for Relation Extraction
,
Proceedings of Human Language Technologies: The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT 2007), pages 113-120. ( 24% acceptance) pdf
- Xu Ling, Jing Jiang, Xin He, Qiaozhu Mei, ChengXiang Zhai, Bruce Schatz,
Generating Semi-Structured Gene Summaries from Biomedical Literature,
Information Processing and Management, 43(6), Nov. 2007, pp. 1777-1791.
pdf
2006
- Saurabh Sinha, Xu Ling, Charles W. Whitfield, ChengXiang Zhai, and Gene E. Robinson,
Genome scan for cis-regulatory DNA motifs associated with social behavior in honey bees ,
Proceedings of National Academy of Sciences of the United States of America (PNAS) ,
103(44), Oct. 2006, pages 16352-16357. URL
- Jing Jiang and ChengXiang Zhai,
Extraction of coherent relevant passages
using hidden Markov models, ACM Transactions on Information
Systems, 24(3), July 2006, pages 295-319. URL
- Azadeh Shakery and ChengXiang Zhai,
A probabilistic relevance propagation model for hypertext retrieval,
In Proceedings of the 15th ACM International Conference on Information and Knowledge Management ( CIKM'06), pages 550-558. ( 15% acceptance) pdf
- Rong Jin, Luo Si, and ChengXiang Zhai,
A study of mixture models for collaborative filtering, Information Retrieval,
9(3), Jun. 2006, pages 357-382. URL
- Bin Tan, Xuehua Shen, ChengXiang Zhai,
Mining long-term search history to improve search
accuracy ,
Proceedings of the 2006 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , (KDD'06 ), pages 718-723. (poster paper, 23% acceptance) pdf
- Qiaozhu Mei, ChengXiang Zhai,
A Mixture Model for Contextual Text Mining,
Proceedings of the 2006 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , (KDD'06 ), pages 649-655. (poster paper, 23% acceptance) pdf
- Qiaozhu Mei, Dong Xin, Hong Cheng, Jiawei Han, ChengXiang Zhai,
Generating Semantic Annotations for Frequent Patterns
with Context Analysis ,
Proceedings of the 2006 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , (KDD'06 ), pages 337-346. Best Student Paper Award Runner-Up.
(full paper, 11% acceptance) pdf
- Tao Tao, Su-Youn Yoon, Andrew Fister, Richard Sproat and ChengXiang Zhai,
Unsupervised Named Entity Transliteration Using Temporal and Phonetic Correlation ,
Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing (EMNLP 2006), pages 250-257. ( 31% acceptance) pdf
- Richard Sproat, Tao Tao and ChengXiang Zhai,
Named Entity Transliteration with Comparable Corpora,
Proceedings of COLING-ACL 2006, pages 73-80. ( 23% acceptance) pdf
- Xuanhui Wang, Jian-Tao Sun, Zheng Chen, ChengXiang Zhai,
Latent Semantic Analysis for Multiple-Type Interrelated Data Objects
Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'06 ), pages 236-243. ( 19% acceptance) pdf
- Hui Fang, ChengXiang Zhai,
Semantic Term Matching in Axiomatic Approaches to Information Retrieval
Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'06 ), pages 115-122. ( 19% acceptance) pdf
- Tao Tao, ChengXiang Zhai,
Regularized Estimation of Mixture Models for Robust Pseudo-Relevance Feedback
Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'06 ), pages 162-169. ( 19% acceptance) pdf
- Jing Jiang, ChengXiang Zhai,
Exploiting Domain Structure for Named Entity Recognition.
Proceedings of HLT/NAACL 2006, pages 74-81. ( 25% acceptance) pdf, ppt
- Tao Tao, Xuanhui Wang, Qiaozhu Mei, ChengXiang Zhai,
Language Model Information Retrieval with Document Expansion.
Proceedings of HLT/NAACL 2006, pages 407-414. ( 25% acceptance) pdf
- Qiaozhu Mei, Chao Liu, Hang Su, and ChengXiang Zhai,
A Probabilistic Approach to Spatiotemporal Theme Pattern Mining on Weblogs.
Proceedings of the
World Wide Web Conference 2006 ( WWW'06), pages 533-542. (11% acceptance) pdf
- Xu Ling, Jing Jiang, Xin He, Qiaozhu Mei, ChengXiang Zhai, and Bruce Schatz,
Automatically Generating Gene Summaries from Biomedical Literature . In Proceedings of
Pacific Symposium on Biocomputing 2006 (PSB'06), pages 40-51.
pdf
- ChengXiang Zhai and John Lafferty,
A risk minimization framework for information retrieval ,
Information Processing and Management ( IP &M ), 42(1), Jan. 2006. pages 31-55.
URL
2005
- Xuehua Shen, Bin Tan, and ChengXiang Zhai, Implicit User Modeling for Personalized Search ,
In Proceedings of the 14th ACM International Conference on Information and Knowledge Management ( CIKM'05), pages 824-831.
pdf ( 18% acceptance)
- Qiaozhu Mei, ChengXiang Zhai, Discovering Evolutionary Theme Patterns from Text -- An Exploration of Temporal Text Mining,
Proceedings of the 2005 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining , (KDD'05 ), pages 198-207, 2005. pdf
(full paper, 12% acceptance)
- Tao Tao, ChengXiang Zhai, Mining Comparable Bilingual Text Corpora for Cross-Language Information Integration ,
Proceedings of the 2005 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD'05 ), pages 691-696, 2005. pdf ( poster paper, 22% acceptance)
- Hui Fang, ChengXiang Zhai, An Exploration of Axiomatic Approach to Information Retrieval ,
Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'05 ), 480-487, 2005.
pdf ( 19% acceptance)
- Xuehua Shen, ChengXiang Zhai, Active Feedback in Ad Hoc Information Retrieval,
Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'05), 59-66, 2005.
pdf ( 19% acceptance )
- Xuehua Shen, Bin Tan, ChengXiang Zhai, Context-Sensitive Information Retrieval with Implicit Feedback,
Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'05), 43-50, 2005.
pdf ( 19% acceptance )
2004
-
Tao Tao, ChengXiang Zhai, Xinghua Lu, and Hui Fang, A study of statistical methods for function prediction of protein motifs , Applied Bioinformatics, Volume 3, No. 2-3, pages 115-124. (BLM 03 paper: ps, pdf)
-
Xinghua Lu, Chengxiang Zhai , Vanathi Gopalakrishnan, and Bruce G Buchanan,
Automatic annotation of protein motif function with Gene Ontology terms, BMC Bioinformatics 2004, 5:122. (url) (Impact factor=5.42, as of 2006)
- Hui Fang, Tao Tao, ChengXiang Zhai, A formal study of information retrieval heuristics,
Proceedings of the 27th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'04), pages 49-56, 2004. Best Paper Award. pdf ( 22% acceptance )
- ChengXiang Zhai, Atulya Velivelli, Bei Yu, A cross-collection mixture model for comparative text mining, Proceedings of ACM KDD 2004 ( KDD'04 ), pages 743-748, 2004. pdf, ppt ( poster paper, 25% acceptance )
- Tao Tao, ChengXiang Zhai, A Mixture Clustering Model for Pseudo Feedback in Information Retrieval ,
Proceedings of the 2004 Meeting of the International Federation of Classification Societies ( IFCS'04), pages 541-552. Invited Paper. pdf
- ChengXiang Zhai, John Lafferty, A study of smoothing methods for language models applied to information retrieval , ACM Transactions on Information Systems ( ACM TOIS ), Vol. 22, No. 2, April 2004, pages 179-214. ( ps)
2003
-
Hwanjo Yu, ChengXiang Zhai, and Jiawei Han,
Text Classification from Positive and Unlabeled Documents , Proceedings of ACM CIKM 2003 (CIKM'03), pages 232-239, 2003. pdf ( 15% acceptance )
- Jin Rong, Luo Si, ChengXiang Zhai, and Jamie Callan,
Collaborative Filtering with Decoupled Models for Preferences and Ratings ,
Proceedings of ACM CIKM 2003 (CIKM'03 ), pages 301-316, 2003. ps, pdf ( 15% acceptance)
- ChengXiang Zhai, William W. Cohen, and John Lafferty, Beyond Independent Relevance: Methods and Evaluation Metrics for Subtopic Retrieval ,
Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'03 ), pages 10-17, 2003.
ps, pdf ( 17% acceptance )
- Rong Jin, Luo Si, and ChengXiang Zhai, Preference-based Graphic Models for Collaborative Filtering, In Proceedings of UAI 2003 (UAI'03 ), pages 329-336, 2003. ps, pdf ( 25% acceptance )
- John Lafferty and Chengxiang Zhai, Probabilistic relevance models based on document and query generation , In Language Modeling and Information Retrieval, Kluwer International Series on Information Retrieval, Vol. 13, 2003. ps,
pdf
2002
- ChengXiang Zhai, Risk Minimization and Language Modeling in Information Retrieval, Ph.D. thesis, Carnegie Mellon University, 2002. (summary).
- ChengXiang Zhai and John Lafferty, Two-Stage Language Models for Information Retrieval ,
Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'02), pages 49-56, 2002.
ps, pdf ( 20% acceptance )
- Rong Jin, Alex G. Hauptmann, and ChengXiang Zhai, Title Language
Model for Information Retrieval,
Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval ( SIGIR'02 ), pages 42-48, 2002.
ps,
pdf ( 20% acceptance )
2001
- Chengxiang Zhai and John Lafferty, Model-based feedback in the language modeling approach to information retrieval , Proceedings of the Tenth ACM International Conference on Information and Knowledge Management (CIKM'01), pages 403-410, 2001. ps,
pdf ( 25% acceptance)
- Chengxiang Zhai and John Lafferty, A study of smoothing methods for
language models applied to ad hoc information retrieval,
Proceedings of the 24th Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval (SIGIR'01 ), pages 334-342, 2001. ps, pdf
( 23% acceptance )
- John Lafferty and Chengxiang Zhai, Document language models, query models, and risk minimization for information
retrieval ,
Proceedings of the 24th Annual International ACM SIGIR
Conference on Research and Development in Information Retrieval (SIGIR'01 ), pages 111-119, 2001. ps,
pdf ( 23% acceptance )