| 2003 | A System for new event detection. Thorsten Brants, Francine Chen |
| 2003 | A comparative study on content-based music genre classification. Tao Li, Mitsunori Ogihara, Qi Li |
| 2003 | A comparison of various approaches for using probabilistic dependencies in language modeling. Peter Bruza, Dawei Song |
| 2003 | A frequency-based and a poisson-based definition of the probability of being informative. Thomas Rölleke |
| 2003 | A light weight PDA-friendly collection fusion technique. Jeffery Antoniuk, Mario A. Nascimento |
| 2003 | A maximal figure-of-merit learning approach to text categorization. Sheng Gao, Wen Wu, Chin-Hui Lee, Tat-Seng Chua |
| 2003 | A personalised information retrieval tool. Innes Martin, Joemon M. Jose |
| 2003 | A repetition based measure for verification of text collections and for text categorization. Dmitry V. Khmelev, William John Teahan |
| 2003 | A scalability analysis of classifiers in text categorization. Yiming Yang, Jian Zhang, Bryan Kisiel |
| 2003 | A unified model for metasearch and the efficient evaluation of retrieval systems via the hedge algorithm. Javed A. Aslam, Virgiliu Pavlu, Robert Savell |
| 2003 | An architecture for peer-to-peer information retrieval. Iraklis A. Klampanos, Joemon M. Jose |
| 2003 | An empirical study on retrieval models for different document genres: patents and newspaper articles. Makoto Iwayama, Atsushi Fujii, Noriko Kando, Yuzo Marukawa |
| 2003 | An information-theoretic measure for document similarity. Javed A. Aslam, Meredith Frost |
| 2003 | An investigation of broad coverage automatic pronoun resolution for information retrieval. Richard J. Edens, Helen L. Gaylard, Gareth J. F. Jones, Adenike M. Lam-Adesina |
| 2003 | Analysis of anchor text for web search. Nadav Eiron, Kevin S. McCurley |
| 2003 | Assessing the effectiveness of pen-based input queries. Stephen Levin, Paul D. Clough, Mark Sanderson |
| 2003 | Automatic image annotation and retrieval using cross-media relevance models. Jiwoon Jeon, Victor Lavrenko, R. Manmatha |
| 2003 | Automatic ranking of retrieval systems in imperfect environments. Rabia Nuray, Fazli Can |
| 2003 | Automatic transliteration for Japanese-to-English text retrieval. Yan Qu, Gregory Grefenstette, David A. Evans |
| 2003 | Average gain ratio: a simple retrieval performance measure for evaluation with multiple relevance levels. Tetsuya Sakai |
| 2003 | Bayesian extension to the language model for ad hoc information retrieval. Hugo Zaragoza, Djoerd Hiemstra, Michael E. Tipping |
| 2003 | Beyond independent relevance: methods and evaluation metrics for subtopic retrieval. ChengXiang Zhai, William W. Cohen, John D. Lafferty |
| 2003 | Building a filtering test collection for TREC 2002. Ian Soboroff, Stephen E. Robertson |
| 2003 | Building a web thesaurus from web link structure. Zheng Chen, Shengping Liu, Liu Wenyin, Geguang Pu, Wei-Ying Ma |
| 2003 | Building and applying a concept hierarchy representation of a user profile. Nikolaos Nanas, Victoria S. Uren, Anne N. De Roeck |
| 2003 | Classification of source code archives. Robert Krovetz, Secil Ugurel, C. Lee Giles |
| 2003 | Collaborative filtering via gaussian probabilistic latent semantic analysis. Thomas Hofmann |
| 2003 | Combining document representations for known-item search. Paul Ogilvie, James P. Callan |
| 2003 | DefScriber: a hybrid system for definitional QA. Sasha Blair-Goldensohn, Kathleen R. McKeown, Andrew Hazen Schlaikjer |
| 2003 | Discovering and structuring information flow among bioinformatics resources. Joan C. Bartlett, Elaine G. Toms |
| 2003 | Document clustering based on non-negative matrix factorization. Wei Xu, Xin Liu, Yihong Gong |
| 2003 | Document retrieval from user-selected web sites. Ulrich Bohnacker, Ingrid Renz |
| 2003 | Document-self expansion for text categorization. Yuen-Hsien Tseng, Da-Wei Juang |
| 2003 | Domain-independent text segmentation using anisotropic diffusion and dynamic programming. Xiang Ji, Hongyuan Zha |
| 2003 | Empirical development of an exponential probabilistic model for text retrieval: using textual analysis to build a better model. Jaime Teevan, David R. Karger |
| 2003 | Enhancing cross-language information retrieval by an automatic acquisition of bilingual terminology from comparable corpora. Fatiha Sadat, Masatoshi Yoshikawa, Shunsuke Uemura |
| 2003 | Error analysis of difficult TREC topics. Xiao Hu, Sindhura Bandhakavi, ChengXiang Zhai |
| 2003 | Evaluating different methods of estimating retrieval quality for resource selection. Henrik Nottelmann, Norbert Fuhr |
| 2003 | Evaluating retrieval performance for Japanese question answering: what are best passages? Tetsuya Sakai, Tomoharu Kokubu |
| 2003 | Experimental result analysis for a generative probabilistic image retrieval model. Thijs Westerveld, Arjen P. de Vries |
| 2003 | Exploiting query history for document ranking in interactive information retrieval. Xuehua Shen, ChengXiang Zhai |
| 2003 | Fractal summarization: summarization based on fractal theory. Christopher C. Yang, Fu Lee Wang |
| 2003 | Fuzzy translation of cross-lingual spelling variants. Ari Pirkola, Jarmo Toivonen, Heikki Keskustalo, Kari Visala, Kalervo Järvelin |
| 2003 | Generating hierarchical summaries for web searches. Dawn J. Lawrie, W. Bruce Croft |
| 2003 | HAT: a hardware assisted TOP-DOC inverted index component. S. Kagan Agun, Ophir Frieder |
| 2003 | Head/modifier pairs for everyone. Cornelis H. A. Koster |
| 2003 | Image classification using hybrid neural networks. Chih-Fong Tsai, Kenneth McGarry, John Tait |
| 2003 | Implicit link analysis for small web search. Gui-Rong Xue, Hua-Jun Zeng, Zheng Chen, Wei-Ying Ma, HongJiang Zhang, Chao-Jun Lu |
| 2003 | Incorporating query term dependencies in language models for document retrieval. Munirathnam Srikanth, Rohini K. Srihari |
| 2003 | Investigating the relationship between language model perplexity and IR precision-recall measures. Leif Azzopardi, Mark A. Girolami, Keith van Rijsbergen |
| 2003 | Keynote Address - exploring, modeling, and using the web graph. Andrei Z. Broder |
| 2003 | Latent concepts and the number orthogonal factors in latent semantic analysis. Georges Dupret |
| 2003 | MIND: resource selection and data fusion in multimedia distributed digital libraries. Stefano Berretti, James P. Callan, Henrik Nottelmann, Xiao Mang Shou, Shengli Wu |
| 2003 | Modeling annotated data. David M. Blei, Michael I. Jordan |
| 2003 | Music modeling with random fields. Victor Lavrenko, Jeremy Pickens |
| 2003 | On an equivalence between PLSI and LDA. Mark A. Girolami, Ata Kabán |
| 2003 | On the effectiveness of evaluating retrieval systems in the absence of relevance judgments. Javed A. Aslam, Robert Savell |
| 2003 | Optimizing term vectors for efficient and robust filtering. David A. Evans, Jeffrey Bennett, David A. Hull |
| 2003 | Passage retrieval vs. document retrieval for factoid question answering. Charles L. A. Clarke, Egidio L. Terra |
| 2003 | Popular music retrieval by detecting mood. Yazhong Feng, Yueting Zhuang, Yunhe Pan |
| 2003 | Probabilistic structured query methods. Kareem Darwish, Douglas W. Oard |
| 2003 | Probabilistic term variant generator for biomedical terms. Yoshimasa Tsuruoka, Jun'ichi Tsujii |
| 2003 | Quantitative evaluation of passage retrieval algorithms for question answering. Stefanie Tellex, Boris Katz, Jimmy Lin, Aaron Fernandes, Gregory Marton |
| 2003 | Query length in interactive information retrieval. Nicholas J. Belkin, Diane Kelly, Giyeong Kim, Ja-Young Kim, Hyuk-Jin Lee, Gheorghe Muresan, Muh-Chyun (Morris) Tang, Xiaojun Yuan, Colleen Cool |
| 2003 | Query type classification for web document retrieval. In-Ho Kang, Gil-Chang Kim |
| 2003 | Query word deletion prediction. Rosie Jones, Daniel C. Fain |
| 2003 | Querying XML using structures and keywords in timber. Cong Yu, H. V. Jagadish, Dragomir R. Radev |
| 2003 | Question classification using support vector machines. Dell Zhang, Wee Sun Lee |
| 2003 | Re-examining the potential effectiveness of interactive query expansion. Ian Ruthven |
| 2003 | ReCoM: reinforcement clustering of multi-type interrelated data objects. Jidong Wang, Hua-Jun Zeng, Zheng Chen, Hongjun Lu, Li Tao, Wei-Ying Ma |
| 2003 | Relevant document distribution estimation method for resource selection. Luo Si, James P. Callan |
| 2003 | Resource selection and data fusion in multimedia distributed digital libraries. James P. Callan, Fabio Crestani, Henrik Nottelmann, Pietro Pala, Xiao Mang Shou |
| 2003 | Retrieval and novelty detection at the sentence level. James Allan, Courtney Wade, Alvaro Bolivar |
| 2003 | Robustness of regularized linear classification methods in text categorization. Jian Zhang, Yiming Yang |
| 2003 | Rule-based word clustering for text classification. Hui Han, Eren Manavoglu, C. Lee Giles, Hongyuan Zha |
| 2003 | SE-LEGO: creating metasearch engines on demand. Zonghuan Wu, Vijay V. Raghavan, Chun Du, Komanduru Sai C, Weiyi Meng, Hai He, Clement T. Yu |
| 2003 | SETS: search enhanced by topic segmentation. Mayank Bawa, Gurmeet Singh Manku, Prabhakar Raghavan |
| 2003 | SIGIR 2003: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 28 - August 1, 2003, Toronto, Canada Charles L. A. Clarke, Gordon V. Cormack, Jamie Callan, David Hawking, Alan F. Smeaton |
| 2003 | Salton Award Lecture - Information retrieval and computer science: an evolving relationship. W. Bruce Croft |
| 2003 | Search strategies in content-based image retrieval. Sharon McDonald, John Tait |
| 2003 | Searchers' criteria For assessing web pages. Anastasios Tombros, Ian Ruthven, Joemon M. Jose |
| 2003 | Searching XML documents via XML fragments. David Carmel, Yoëlle S. Maarek, Matan Mandelbrod, Yosi Mass, Aya Soffer |
| 2003 | Single n-gram stemming. James Mayfield, Paul McNamee |
| 2003 | Speech-based and video-supported indexing of multimedia broadcast news. Yoshihiko Hayashi, Katsutoshi Ohtsuki, Katsuji Bessho, Osamu Mizuno, Yoshihiro Matsuo, Shoichi Matsunaga, Minoru Hayashi, Takaaki Hasegawa, Naruhiro Ikeda |
| 2003 | Statistical visual feature indexes in video retrieval. Xiangming Mu, Gary Marchionini |
| 2003 | Stemming in the language modeling framework. James Allan, Giridhar Kumaran |
| 2003 | Structured use of external knowledge for event-based open domain question answering. Hui Yang, Tat-Seng Chua, Shuguang Wang, Chun-Keat Koh |
| 2003 | Stuff I've seen: a system for personal information retrieval and re-use. Susan T. Dumais, Edward Cutrell, Jonathan J. Cadiz, Gavin Jancke, Raman Sarin, Daniel C. Robbins |
| 2003 | Summary evaluation and text categorization. Khurshid Ahmad, Bogdan Vrusias, Paulo C. F. de Oliveira |
| 2003 | Syntactic features in question answering. Xiaoyan Li |
| 2003 | Table extraction using conditional random fields. David Pinto, Andrew McCallum, Xing Wei, W. Bruce Croft |
| 2003 | Text categorization by boosting automatically extracted concepts. Lijuan Cai, Thomas Hofmann |
| 2003 | The TREC-like evaluation of music IR systems. J. Stephen Downie |
| 2003 | Topic distillation using hierarchy concept tree. Ikkyu Choi, Minkoo Kim |
| 2003 | Topic hierarchy generation via linear discriminant projection. Tao Li, Shenghuo Zhu, Mitsunori Ogihara |
| 2003 | Toward a unification of text and link analysis. Brian D. Davison |
| 2003 | Transliteration of proper names in cross-language applications. Paola Virga, Sanjeev Khudanpur |
| 2003 | User-assisted query translation for interactive CLIR. Daqing He, Jianqiang Wang, Douglas W. Oard, Michael Nossal |
| 2003 | User-trainable video annotation using multimodal cues. Ching-Yung Lin, Milind R. Naphade, Apostol Natsev, Chalapathy Neti, John R. Smith, Belle L. Tseng, Harriet J. Nock, W. H. Adams |
| 2003 | Using asymmetric distributions to improve text classifier probability estimates. Paul N. Bennett |
| 2003 | Using manually-built web directories for automatic evaluation of known-item retrieval. Steven M. Beitzel, Eric C. Jensen, Abdur Chowdhury, David A. Grossman, Ophir Frieder |
| 2003 | Using terminological feedback for web search refinement: a log-based study. Peter G. Anick |
| 2003 | When query expansion fails. Bodo Billerbeck, Justin Zobel |
| 2003 | Word sense disambiguation in information retrieval revisited. Christopher Stokoe, Michael P. Oakes, John Tait |
| 2003 | XML retrieval: what to retrieve? Jaap Kamps, Maarten Marx, Maarten de Rijke, Börkur Sigurbjörnsson |
| 2003 | eArchivarius: accessing collections of electronic mail. Anton Leuski, Douglas W. Oard, Rahul Bhagat |
| 2003 | eBizSearch: a niche search engine for e-business. C. Lee Giles, Yves Petinot, Pradeep B. Teregowda, Hui Han, Steve Lawrence, Arvind Rangaswamy, Nirmal Pal |