| 2005 | A Bayesian network classifier with inverse tree structure for voxelwise magnetic resonance image analysis. Rong Chen, Edward Herskovits |
| 2005 | A distributed learning framework for heterogeneous data sources. Srujana Merugu, Joydeep Ghosh |
| 2005 | A fast kernel-based multilevel algorithm for graph clustering. Inderjit S. Dhillon, Yuqiang Guan, Brian Kulis |
| 2005 | A general model for clustering binary data. Tao Li |
| 2005 | A generalized framework for mining spatio-temporal patterns in scientific data. Hui Yang, Srinivasan Parthasarathy, Sameep Mehta |
| 2005 | A hit-miss model for duplicate detection in the WHO drug safety database. G. Niklas Norén, Roland Orre, Andrew Bate |
| 2005 | A hybrid unsupervised approach for document clustering. Mihai Surdeanu, Jordi Turmo, Alicia Ageno |
| 2005 | A maximum entropy web recommendation system: combining collaborative and content features. Xin Jin, Yanzan Zhou, Bamshad Mobasher |
| 2005 | A multinomial clustering model for fast simulation of computer architecture designs. Kaushal Sanghai, Ting Su, Jennifer G. Dy, David R. Kaeli |
| 2005 | A multiple tree algorithm for the efficient association of asteroid observations. Jeremy Kubica, Andrew W. Moore, Andrew J. Connolly, Robert Jedicke |
| 2005 | A new scheme on privacy-preserving data classification. Nan Zhang, Shengquan Wang, Wei Zhao |
| 2005 | Adversarial learning. Daniel Lowd, Christopher Meek |
| 2005 | An approach to spacecraft anomaly detection problem using kernel feature space. Ryohei Fujimaki, Takehisa Yairi, Kazuo Machida |
| 2005 | An integrated framework on mining logs files for computing system management. Tao Li, Feng Liang, Sheng Ma, Wei Peng |
| 2005 | Anonymity-preserving data collection. Zhiqiang Yang, Sheng Zhong, Rebecca N. Wright |
| 2005 | Application of kernels to link analysis. Takahiko Ito, Masashi Shimbo, Taku Kudo, Yuji Matsumoto |
| 2005 | Automated detection of frontal systems from numerical model-generated data. Xiang Li, Rahul Ramachandran, Sara J. Graves, Sunil Movva, Bilahari Akkiraju, David Emmitt, Steven Greco, Robert Atlas, Joseph Terry, Juan-Carlos Jusem |
| 2005 | Building connected neighborhood graphs for isometric data embedding. Li Yang |
| 2005 | CLICKS: an effective algorithm for mining subspace clusters in categorical datasets. Mohammed Javeed Zaki, Markus Peters, Ira Assent, Thomas Seidl |
| 2005 | Co-clustering by block value decomposition. Bo Long, Zhongfei (Mark) Zhang, Philip S. Yu |
| 2005 | Combining email models for false positive reduction. Shlomo Hershkop, Salvatore J. Stolfo |
| 2005 | Combining partitions by probabilistic label aggregation. Tilman Lange, Joachim M. Buhmann |
| 2005 | Combining proactive and reactive predictions for data streams. Ying Yang, Xindong Wu, Xingquan Zhu |
| 2005 | Consistent bipartite graph co-partitioning for star-structured high-order heterogeneous data co-clustering. Bin Gao, Tie-Yan Liu, Xin Zheng, QianSheng Cheng, Wei-Ying Ma |
| 2005 | Creating social networks to improve peer-to-peer networking. Andrew S. Fast, David D. Jensen, Brian Neil Levine |
| 2005 | Cross-relational clustering with user's guidance. Xiaoxin Yin, Jiawei Han, Philip S. Yu |
| 2005 | Data mining in the chemical industry. Alex N. Kalos, Tim Rey |
| 2005 | Density-based clustering of uncertain data. Hans-Peter Kriegel, Martin Pfeifle |
| 2005 | Deriving marketing intelligence from online discussion. Natalie S. Glance, Matthew Hurst, Kamal Nigam, Matthew Siegler, Robert Stockton, Takashi Tomokiyo |
| 2005 | Detection of emerging space-time clusters. Daniel B. Neill, Andrew W. Moore, Maheshkumar Sabhnani, Kenny Daniel |
| 2005 | Determining an author's native language by mining a text for errors. Moshe Koppel, Jonathan Schler, Kfir Zigdon |
| 2005 | Dimension induced clustering. Aristides Gionis, Alexander Hinneburg, Spiros Papadimitriou, Panayiotis Tsaparas |
| 2005 | Discovering evolutionary theme patterns from text: an exploration of temporal text mining. Qiaozhu Mei, ChengXiang Zhai |
| 2005 | Discovering frequent topological structures from graph datasets. Ruoming Jin, Chao Wang, Dmitrii Polshakov, Srinivasan Parthasarathy, Gagan Agrawal |
| 2005 | Disease progression modeling from historical clinical databases. Ronald K. Pearson, Robert J. Kingan, Alan Hochberg |
| 2005 | Dynamic syslog mining for network failure monitoring. Kenji Yamanishi, Yuko Maruyama |
| 2005 | Efficient computations via scalable sparse kernel partial least squares and boosted latent features. Michinari Momma |
| 2005 | Email data cleaning. Jie Tang, Hang Li, Yunbo Cao, ZhaoHui Tang |
| 2005 | Enhancing the lift under budget constraints: an application in the mutual fund industry. Lian Yan, Michael Fassino, Patrick Baldasare |
| 2005 | Estimating missed actual positives using independent classifiers. Sandeep Mane, Jaideep Srivastava, San-Yih Hwang |
| 2005 | Evaluating similarity measures: a large-scale study in the orkut social network. Ellen Spertus, Mehran Sahami, Orkut Buyukkokten |
| 2005 | Failure detection and localization in component based systems by online tracking. Haifeng Chen, Guofei Jiang, Cristian Ungureanu, Kenji Yoshihira |
| 2005 | Fast discovery of unexpected patterns in data, relative to a Bayesian network. Szymon Jaroszewicz, Tobias Scheffer |
| 2005 | Fast window correlations over uncooperative time series. Richard Cole, Dennis E. Shasha, Xiaojian Zhao |
| 2005 | Feature bagging for outlier detection. Aleksandar Lazarevic, Vipin Kumar |
| 2005 | Finding partial orders from unordered 0-1 data. Antti Ukkonen, Mikael Fortelius, Heikki Mannila |
| 2005 | Finding similar files in large document repositories. George Forman, Kave Eshghi, Stephane Chiocchetti |
| 2005 | Formulating distance functions via the kernel trick. Gang Wu, Edward Y. Chang, Navneet Panda |
| 2005 | Generation of synthetic data sets for evaluating the accuracy of knowledge discovery systems. Daniel R. Jeske, Behrokh Samadi, Pengyue J. Lin, Lan Ye, Sean Cox, Rui Xiao, Ted Younglove, Minh Ly, Douglas Holt, Ryan Rich |
| 2005 | Graphs over time: densification laws, shrinking diameters and possible explanations. Jure Leskovec, Jon M. Kleinberg, Christos Faloutsos |
| 2005 | Improving discriminative sequential learning with rare--but--important associations. Xuan Hieu Phan, Minh Le Nguyen, Tu Bao Ho, Susumu Horiguchi |
| 2005 | Incentive networks. Prabhakar Raghavan |
| 2005 | Information retrieval based on collaborative filtering with latent interest semantic map. Noriaki Kawamae, Katsumi Takahashi |
| 2005 | Integration of profile hidden Markov model output into association rule mining. Christopher Besemann, Anne Denton |
| 2005 | Key semantics extraction by dependency tree mining. Satoshi Morinaga, Hiroki Arimura, Takahiro Ikeda, Yosuke Sakao, Susumu Akamine |
| 2005 | LIPED: HMM-based life profiles for adaptive event detection. Chien Chin Chen, Meng Chang Chen, Ming-Syan Chen |
| 2005 | Learning to predict train wheel failures. Chunsheng Yang, Sylvain Létourneau |
| 2005 | Local sparsity control for naive Bayes with extreme misclassification costs. Aleksander Kolcz |
| 2005 | Making holistic schema matching robust: an ensemble approach. Bin He, Kevin Chen-Chuan Chang |
| 2005 | Maximal boasting. Cinda Heeren, Leonard Pitt |
| 2005 | Mining closed relational graphs with connectivity constraints. Xifeng Yan, Xianghong Jasmine Zhou, Jiawei Han |
| 2005 | Mining comparable bilingual text corpora for cross-language information integration. Tao Tao, ChengXiang Zhai |
| 2005 | Mining images on semantics via statistical learning. Jianping Fan, Hangzai Luo, Mohand-Said Hacid |
| 2005 | Mining rare and frequent events in multi-camera surveillance video using self-organizing maps. Valery A. Petrushin |
| 2005 | Mining risk patterns in medical data. Jiuyong Li, Ada Wai-Chee Fu, Hongxing He, Jie Chen, Huidong Jin, Damien McAullay, Graham J. Williams, Ross Sparks, Chris Kelman |
| 2005 | Mining the internet: the eighth wonder of the world. Gian Fulgoni |
| 2005 | Mining tree queries in a graph. Bart Goethals, Eveline Hoekx, Jan Van den Bussche |
| 2005 | Model-based overlapping clustering. Arindam Banerjee, Chase Krumpelman, Joydeep Ghosh, Sugato Basu, Raymond J. Mooney |
| 2005 | Modeling and predicting personal information dissemination behavior. Xiaodan Song, Ching-Yung Lin, Belle L. Tseng, Ming-Ting Sun |
| 2005 | Nomograms for visualizing support vector machines. Aleks Jakulin, Martin Mozina, Janez Demsar, Ivan Bratko, Blaz Zupan |
| 2005 | Non-redundant clustering with conditional ensembles. David Gondek, Thomas Hofmann |
| 2005 | On mining cross-graph quasi-cliques. Jian Pei, Daxin Jiang, Aidong Zhang |
| 2005 | On the use of linear programming for unsupervised text classification. Mark Sandler |
| 2005 | Optimizing time series discretization for knowledge discovery. Fabian Mörchen, Alfred Ultsch |
| 2005 | Parallel mining of closed sequential patterns. Shengnan Cong, Jiawei Han, David A. Padua |
| 2005 | Pattern lattice traversal by selective jumps. Osmar R. Zaïane, Mohammad El-Hajj |
| 2005 | Pattern-based similarity search for microarray data. Haixun Wang, Jian Pei, Philip S. Yu |
| 2005 | Predicting the product purchase patterns of corporate customers. Bhavani Raskutti, Alan Herschtal |
| 2005 | Price prediction and insurance for online auctions. Rayid Ghani |
| 2005 | Privacy-preserving distributed k-means clustering over arbitrarily partitioned data. Geetha Jagannathan, Rebecca N. Wright |
| 2005 | Probabilistic workflow mining. Ricardo Bezerra de Andrade e Silva, Jiji Zhang, James G. Shanahan |
| 2005 | Proceedings of the Eleventh ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Chicago, Illinois, USA, August 21-24, 2005 Robert Grossman, Roberto J. Bayardo, Kristin P. Bennett |
| 2005 | Query chains: learning to rank from implicit feedback. Filip Radlinski, Thorsten Joachims |
| 2005 | Reasoning about sets using redescription mining. Mohammed Javeed Zaki, Naren Ramakrishnan |
| 2005 | Regression error characteristic surfaces. Luís Torgo |
| 2005 | Robust boosting and its relation to bagging. Saharon Rosset |
| 2005 | Rule extraction from linear support vector machines. Glenn Fung, Sathyakama Sandilya, R. Bharat Rao |
| 2005 | SVM selective sampling for ranking with application to data retrieval. Hwanjo Yu |
| 2005 | Sampling-based sequential subgroup mining. Martin Scholz |
| 2005 | Scalable discovery of hidden emails from large folders. Giuseppe Carenini, Raymond T. Ng, Xiaodong Zhou |
| 2005 | Short term performance forecasting in enterprise systems. Rob Powers, Moisés Goldszmidt, Ira Cohen |
| 2005 | Simple and effective visual models for gene expression cancer diagnostics. Gregor Leban, Minca Mramor, Ivan Bratko, Blaz Zupan |
| 2005 | Simultaneous optimization of complex mining tasks with a knowledgeable cache. Ruoming Jin, Kaushik Sinha, Gagan Agrawal |
| 2005 | Streaming feature selection using alpha-investing. Jing Zhou, Dean P. Foster, Robert A. Stine, Lyle H. Ungar |
| 2005 | Summarizing itemset patterns: a profile-based approach. Xifeng Yan, Hong Cheng, Jiawei Han, Dong Xin |
| 2005 | The architecture of complexity: the structure and the dynamics of networks, from the web to the cell. Albert-László Barabási |
| 2005 | The predictive power of online chatter. Daniel Gruhl, Ramanathan V. Guha, Ravi Kumar, Jasmine Novak, Andrew Tomkins |
| 2005 | Towards exploratory test instance specific algorithms for high dimensional classification. Charu C. Aggarwal |
| 2005 | Unweaving a web of documents. Ramanathan V. Guha, Ravi Kumar, D. Sivakumar, Ravi Sundaram |
| 2005 | Using relational knowledge discovery to prevent securities fraud. Jennifer Neville, Özgür Simsek, David D. Jensen, John Komoroske, Kelly Palmer, Henry G. Goldberg |
| 2005 | Using retrieval measures to assess similarity in mining dynamic web clickstreams. Olfa Nasraoui, Cesar Cardona, Carlos Rojas |
| 2005 | Variable latent semantic indexing. Anirban Dasgupta, Ravi Kumar, Prabhakar Raghavan, Andrew Tomkins |
| 2005 | Wavelet synopsis for data streams: minimizing non-euclidean error. Sudipto Guha, Boulos Harb |
| 2005 | Web mining from competitors' websites. Xin Chen, Yi-fang Brook Wu |
| 2005 | Web object indexing using domain knowledge. Muyuan Wang, Zhiwei Li, Lie Lu, Wei-Ying Ma, Naiyao Zhang |