| 2006 | A Framework for Clustering Massive Text and Categorical Data Streams. Charu C. Aggarwal, Philip S. Yu |
| 2006 | A Framework for Local Supervised Dimensionality Reduction of High Dimensional Data. Charu C. Aggarwal |
| 2006 | A Latent Dirichlet Model for Unsupervised Entity Resolution. Indrajit Bhattacharya, Lise Getoor |
| 2006 | A New Privacy-Preserving Distributed k-Clustering Algorithm. Geetha Jagannathan, Krishnan Pillaipakkamnatt, Rebecca N. Wright |
| 2006 | A Novel Framework for Incorporating Labeled Examples into Anomaly Detection. Jing Gao, Haibin Cheng, Pang-Ning Tan |
| 2006 | A Random Walks Method for Text Classification. Yunpeng Xu, Xing Yi, Changshui Zhang |
| 2006 | A Semantic Approach for Mining Hidden Links from Complementary and Non-interactive Biomedical Literature. Xiaohua Hu, Xiaodan Zhang, Illhoi Yoo, Yanqing Zhang |
| 2006 | A Systematic Cross-Comparison of Sequence Classifiers. Binyamin Rosenfeld, Ronen Feldman, Moshe Fresko |
| 2006 | Adapting K-Medians to Generate Normalized Cluster Centers. Benjamin J. Anderson, Deborah S. Gross, David R. Musicant, Anna M. Ritz, Thomas G. Smith, Leah E. Steinberg |
| 2006 | Advanced Prototype Machines: Exploring Prototypes for Classification. Hans-Peter Kriegel, Matthias Schubert |
| 2006 | Area Under ROC Optimisation using a Ramp Approximation. Alan Herschtal, Bhavani Raskutti, Peter K. Campbell |
| 2006 | Automated Knowledge Discovery from Simulators. Michael C. Burl, Dennis DeCoste, Brian L. Enke, Dominic Mazzoni, William J. Merline, Lucas Scharenbroich |
| 2006 | Bayesian K-Means as a "Maximization-Expectation" Algorithm. Max Welling, Kenichi Kurihara |
| 2006 | CPM: A Covariance-preserving Projection Method. Jieping Ye, Tao Xiong, Ravi Janardan |
| 2006 | Cluster Description Formats, Problems and Algorithms. Byron J. Gao, Martin Ester |
| 2006 | Clustering in the Presence of Bridge-Nodes. Jerry Scripps, Pang-Ning Tan |
| 2006 | Collaborative Document Clustering. Khaled M. Hammouda, Mohamed S. Kamel |
| 2006 | Collaborative Information Extraction and Mining from Multiple Web Documents. Tak-Lam Wong, Wai Lam, Shing-Kit Chan |
| 2006 | Cone Cluster Labeling for Support Vector Clustering. Sei-Hyung Lee, Karen M. Daniels |
| 2006 | Confidence Estimation Methods for Partially Supervised Information Extraction. Eugene Agichtein |
| 2006 | Data-Enhanced Predictive Modeling for Sales Targeting. Saharon Rosset, Richard D. Lawrence |
| 2006 | Density-Based Clustering over an Evolving Data Stream with Noise. Feng Cao, Martin Ester, Weining Qian, Aoying Zhou |
| 2006 | Deriving Private Information from Randomly Perturbed Ratings. Sheng Zhang, James Ford, Fillia Makedon |
| 2006 | Detecting the Change of Clustering Structure in Categorical Data Streams. Keke Chen, Ling Liu |
| 2006 | Discovering Frequent Tree Patterns over Data Streams. Mark Cheng-Enn Hsieh, Yi-Hung Wu, Arbee L. P. Chen |
| 2006 | Discovery of Co-evoluting Spatial Co-located Event Sets. Jin Soung Yoo, Shashi Shekhar, Sangho Kim, Mete Celik |
| 2006 | Dissimilarity Measures for Detecting Hepatotoxicity in Clinical Trial Data. Matthew Eric Otey, Srinivasan Parthasarathy, Donald C. Trost |
| 2006 | Efficient Algorithms for Sequence Segmentation. Evimaria Terzi, Panayiotis Tsaparas |
| 2006 | Efficient Markov Network Structure Discovery using Independence Tests. Facundo Bromberg, Dimitris Margaritis, Vasant G. Honavar |
| 2006 | Efficient Mining of Temporally Annotated Sequences. Fosca Giannotti, Mirco Nanni, Dino Pedreschi |
| 2006 | Fast Mining of Distance-Based Outliers in High Dimensional Datasets. Amol Ghoting, Srinivasan Parthasarathy, Matthew Eric Otey |
| 2006 | Fast optimal bandwidth selection for kernel density estimation. Vikas C. Raykar, Ramani Duraiswami |
| 2006 | Finding Sequential Patterns from a Massive Number of Spatio-Temporal Events. Yan Huang, Liqin Zhang, Pusheng Zhang |
| 2006 | Graph-based Methods for Orbit Classification. Abraham Bagherjeiran, Chandrika Kamath |
| 2006 | Health monitoring of a shaft transmission system via hybrid models of PCR and PLS. Yi Fang, Hyun-Woo Cho, Myong Kee Jeong |
| 2006 | Inference of Node Replacement Recursive Graph Grammars. Jacek P. Kukluk, Lawrence B. Holder, Diane J. Cook |
| 2006 | Item Sets that Compress. Arno Siebes, Jilles Vreeken, Matthijs van Leeuwen |
| 2006 | Joint Cluster Analysis of Attribute Data and Relationship Data: the Connected k-Center Problem. Martin Ester, Rong Ge, Byron J. Gao, Zengjian Hu, Boaz Ben-Moshe |
| 2006 | K-Means Clustering Over a Large, Dynamic Network. Souptik Datta, Chris Giannella, Hillol Kargupta |
| 2006 | Learning Bayesian Networks from Incomplete Data: An Efficient Method for Generating Approximate Predictive Distributions. Carsten Riggelsen |
| 2006 | Learning from Incomplete Ratings Using Non-negative Matrix Factorization. Sheng Zhang, Weihong Wang, James Ford, Fillia Makedon |
| 2006 | Local L2-Thresholding Based Data Mining in Peer-to-Peer Systems. Ran Wolff, Kanishka Bhaduri, Hillol Kargupta |
| 2006 | Mining Approximate Frequent Itemsets In the Presence of Noise: Algorithm and Analysis. Jinze Liu, Susan Paulsen, Xing Sun, Wei Wang, Andrew B. Nobel, Jan F. Prins |
| 2006 | Mining Control Flow Abnormality for Logic Error Isolation. Chao Liu, Xifeng Yan, Jiawei Han |
| 2006 | Mining Frequent Agreement Subtrees in Phylogenetic Databases. Sen Zhang, Jason Tsong-Li Wang |
| 2006 | Mining Frequent Patterns by Differential Refinement of Clustered Bitmaps. Jianwei Li, Alok N. Choudhary, Nan Jiang, Wei-keng Liao |
| 2006 | Mining Interesting Patterns from Very High Dimensional Data: A Top-Down Row Enumeration Approach. Hongyan Liu, Jiawei Han, Dong Xin, Zheng Shao |
| 2006 | Mining Minimal Contrast Subgraph Patterns. Roger Ming Hieng Ting, James Bailey |
| 2006 | Mining and Validating Localized Frequent Itemsets with Dynamic Tolerance. Olfa Nasraoui, Suchandra Goswami |
| 2006 | Mining for Outliers in Sequential Databases. Pei Sun, Sanjay Chawla, Bavani Arunasalam |
| 2006 | Mining frequent closed itemsets out-of-core. Claudio Lucchese, Salvatore Orlando, Raffaele Perego |
| 2006 | Modeling Evolutionary Behaviors for Community-based Dynamic Recommendation. Xiaodan Song, Ching-Yung Lin, Belle L. Tseng, Ming-Ting Sun |
| 2006 | Name Reference Resolution in Organizational Email Archives. Christopher P. Diehl, Lise Getoor, Galileo Namata |
| 2006 | ODAC: Hierarchical Clustering of Time Series Data Streams. Pedro Pereira Rodrigues, João Gama, João Pedro Pedroso |
| 2006 | On Approximate Solutions to Support Vector Machines. Dongwei Cao, Daniel Boley |
| 2006 | On the Necessary and Sufficient Conditions of a Meaningful Distance Function for High Dimensional Data Space. Chih-Ming Hsu, Ming-Syan Chen |
| 2006 | Personalized Knowledge Discovery: Mining Novel Association Rules from Text. Xin Chen, Yi-Fang Wu |
| 2006 | Positive Borders or Negative Borders: How to Make Lossless Generator Based Representations Concise. Guimei Liu, Jinyan Li, Limsoon Wong, Wynne Hsu |
| 2006 | Probabilistic Multi-State Split-Merge Algorithm for Coupling Parameter Estimates. Juan K. Lin |
| 2006 | Proceedings of the Sixth SIAM International Conference on Data Mining, April 20-22, 2006, Bethesda, MD, USA Joydeep Ghosh, Diane Lambert, David B. Skillicorn, Jaideep Srivastava |
| 2006 | Profiling Protein Families from Partially Aligned Sequences. Saikat Mukherjee, Chang Zhao, I. V. Ramakrishnan |
| 2006 | Representation is Everything: Towards Efficient and Adaptable Similarity Measures for Biological Data. Charu C. Aggarwal |
| 2006 | Risk-Sensitive Learning via Expected Shortfall Minimization. Hisashi Kashima |
| 2006 | Robust Clustering for Tracking Noisy Evolving Data Streams. Olfa Nasraoui, Carlos Rojas |
| 2006 | Robust Estimation for Mixture of Probability Tables based on beta-likelihood. Yu Fujimoto, Noboru Murata |
| 2006 | Scan Detection: A Data Mining Approach. György J. Simon, Hui Xiong, Eric Eilertson, Vipin Kumar |
| 2006 | Segmentation and dimensionality reduction. Ella Bingham, Aristides Gionis, Niina Haiminen, Heli Hiisilä, Heikki Mannila, Evimaria Terzi |
| 2006 | Semi-Supervised Clustering with Partial Background Information. Jing Gao, Pang-Ning Tan, Haibin Cheng |
| 2006 | Spatial Weighted Outlier Detection. Yufeng Kou, Chang-Tien Lu, Dechang Chen |
| 2006 | Toward Semantic XML Clustering. Andrea Tagarelli, Sergio Greco |
| 2006 | Towards the Prediction of Protein Abundance from Tandem Mass Spectrometry Data. Anthony J. Bonner, Han Liu |
| 2006 | Transductive De-Noising and Dimensionality Reduction using Total Bregman Regression. Sreangsu Acharyya |
| 2006 | Transform Regression and the Kolmogorov Superposition Theorem. Edwin P. D. Pednault |
| 2006 | Trend Relational Analysis and Grey-Fuzzy Clustering Method. Zhijie Chen, Weizhen Chen, Qile Chen, Mian-Yun Chen |
| 2006 | Using Compression to Identify Classes of Inauthentic Texts. Mehmet M. Dalkilic, Wyatt T. Clark, James C. Costello, Predrag Radivojac |
| 2006 | WIP: mining Weighted Interesting Patterns with a strong weight and/or support affinity. Unil Yun, John J. Leggett |
| 2006 | Weighted Clustering Ensembles. Muna Al-Razgan, Carlotta Domeniconi |