| 2006 | A Balanced Ensemble Approach to Weighting Classifiers for Text Classification. Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Haixun Wang, David W. Cheung, Huan Liu |
| 2006 | A Data Mining Approach for Capacity Building of Stakeholders in Integrated Flood Management. Peter Owotoki, Natasa Manojlovic, Friedrich Mayer-Lindenberg, Erik Pasche |
| 2006 | A Feature Selection and Evaluation Scheme for Computer Virus Detection. Olivier Henchiri, Nathalie Japkowicz |
| 2006 | A Framework for Regional Association Rule Mining in Spatial Datasets. Wei Ding, Christoph F. Eick, Jing Wang, Xiaojing Yuan |
| 2006 | A Novel Method for Detecting Outlying Subspaces in High-dimensional Databases Using Genetic Algorithm. Ji Zhang, Qigang Gao, Hai H. Wang |
| 2006 | A Novel Scalable Algorithm for Supervised Subspace Learning. Jun Yan, Ning Liu, Benyu Zhang, Qiang Yang, Shuicheng Yan, Zheng Chen |
| 2006 | A Parameterized Probabilistic Model of Network Evolution for Supervised Link Prediction. Hisashi Kashima, Naoki Abe |
| 2006 | A Simple Yet Effective Data Clustering Algorithm. Soujanya Vadapalli, Satyanarayana R. Valluri, Kamalakar Karlapalem |
| 2006 | AC-Close: Efficiently Mining Approximate Closed Itemsets by Core Pattern Recovery. Hong Cheng, Philip S. Yu, Jiawei Han |
| 2006 | Accelerating Newton Optimization for Log-Linear Models through Feature Redundancy. Arpit Mathur, Soumen Chakrabarti |
| 2006 | Active Learning to Maximize Area Under the ROC Curve. Matt Culver, Kun Deng, Stephen Scott |
| 2006 | Adaptive Blocking: Learning to Scale Up Record Linkage. Mikhail Bilenko, Beena Kamath, Raymond J. Mooney |
| 2006 | Adaptive Kernel Principal Component Analysis with Unsupervised Learning of Kernels. Daoqiang Zhang, Zhi-Hua Zhou, Songcan Chen |
| 2006 | Adaptive Parallel Graph Mining for CMP Architectures. Gregory Buehrer, Srinivasan Parthasarathy, Yen-Kuang Chen |
| 2006 | Adding Semantics to Email Clustering. Hua Li, Dou Shen, Benyu Zhang, Zheng Chen, Qiang Yang |
| 2006 | An Efficient Reference-Based Approach to Outlier Detection in Large Datasets. Yaling Pei, Osmar R. Zaïane, Yong Gao |
| 2006 | An Experimental Investigation of Graph Kernels on a Collaborative Recommendation Task. François Fouss, Luh Yen, Alain Pirotte, Marco Saerens |
| 2006 | An Information Theoretic Approach to Detection of Minority Subsets in Database. Shin Ando, Einoshin Suzuki |
| 2006 | An Interactive Semantic Video Mining and Retrieval Platform--Application in Transportation Surveillance Video for Incident Detection. Xin Chen, Chengcui Zhang |
| 2006 | Anytime Classification Using the Nearest Neighbor Algorithm with Applications to Stream Mining. Ken Ueno, Xiaopeng Xi, Eamonn J. Keogh, Dah-Jye Lee |
| 2006 | Applying Data Mining to Pseudo-Relevance Feedback for High Performance Text Retrieval. Xiangji Huang, Yan Rui Huang, Miao Wen, Aijun An, Yang Liu, Josiah Poon |
| 2006 | Automatic Single-Organ Segmentation in Computed Tomography Images. Ruchaneewan Susomboon, Daniela Stan Raicu, Jacob Furst, David S. Channin |
| 2006 | Bayesian State Space Modeling Approach for Measuring the Effectiveness of Marketing Activities and Baseline Sales from POS Data. Tomohiro Ando |
| 2006 | Belief Propagation in Large, Highly Connected Graphs for 3D Part-Based Object Recognition. Frank DiMaio, Jude W. Shavlik |
| 2006 | Biclustering Protein Complex Interactions with a Biclique Finding Algorithm. Chris H. Q. Ding, Ya Zhang, Tao Li, Stephen R. Holbrook |
| 2006 | Boosting Kernel Models for Regression. Ping Sun, Xin Yao |
| 2006 | Boosting for Learning Multiple Classes with Imbalanced Class Distribution. Yanmin Sun, Mohamed S. Kamel, Yang Wang |
| 2006 | Boosting the Feature Space: Text Classification for Unstructured Data on the Web. Yang Song, Ding Zhou, Jian Huang, Isaac G. Councill, Hongyuan Zha, C. Lee Giles |
| 2006 | Bregman Bubble Clustering: A Robust, Scalable Framework for Locating Multiple, Dense Regions in Data. Gunjan Gupta, Joydeep Ghosh |
| 2006 | COALA: A Novel Approach for the Extraction of an Alternate Clustering of High Quality and High Dissimilarity. Eric Bae, James Bailey |
| 2006 | COSMIC: Conceptually Specified Multi-Instance Clusters. Hans-Peter Kriegel, Alexey Pryakhin, Matthias Schubert, Arthur Zimek |
| 2006 | Cluster Analysis of Time-Series Medical Data Based on the Trajectory Representation and Multiscale Comparison Techniques. Shoji Hirano, Shusaku Tsumoto |
| 2006 | Cluster Based Core Vector Machine. Asharaf S., M. Narasimha Murty, Shirish K. Shevade |
| 2006 | Cluster Ranking with an Application to Mining Mailbox Networks. Ziv Bar-Yossef, Ido Guy, Ronny Lempel, Yoëlle S. Maarek, Vladimir Soroka |
| 2006 | Co-clustering Documents and Words Using Bipartite Isoperimetric Graph Partitioning. Manjeet Rege, Ming Dong, Farshad Fotouhi |
| 2006 | CoMiner: An Effective Algorithm for Mining Competitors from the Web. Rui Li, Shenghua Bao, Jin Wang, Yong Yu, Yunbo Cao |
| 2006 | Comparison of Descriptor Spaces for Chemical Compound Retrieval and Classification. Nikil Wale, George Karypis |
| 2006 | Comparisons of K-Anonymization and Randomization Schemes under Linking Attacks. Zhouxuan Teng, Wenliang Du |
| 2006 | Constructing Ensembles for Better Ranking. Jin Huang, Charles X. Ling |
| 2006 | Converting Output Scores from Outlier Detection Algorithms into Probability Estimates. Jing Gao, Pang-Ning Tan |
| 2006 | Corrective Classification: Classifier Ensembling with Corrective and Diverse Base Learners. Yan Zhang, Xingquan Zhu, Xindong Wu |
| 2006 | DSTree: A Tree Structure for the Mining of Frequent Sets from Data Streams. Carson Kai-Sang Leung, Quamrul I. Khan |
| 2006 | Data Mining Approaches to Criminal Career Analysis. Jeroen S. de Bruin, Tim K. Cocx, Walter A. Kosters, Jeroen F. J. Laros, Joost N. Kok |
| 2006 | Data Mining Methods for Modeling Gene Expression Regulation and Their Applications. Weixiong Zhang |
| 2006 | Decision Trees for Functional Variables. Suhrid Balakrishnan, David Madigan |
| 2006 | Deploying Approaches for Pattern Refinement in Text Mining. Sheng-Tang Wu, Yuefeng Li, Yue Xu |
| 2006 | Detecting Link Spam Using Temporal Information. Guoyang Shen, Bin Gao, Tie-Yan Liu, Guang Feng, Shiji Song, Hang Li |
| 2006 | Detection of Interdomain Routing Anomalies Based on Higher-Order Path Analysis. Murat Can Ganiz, Sudhan Kanitkar, Mooi Choo Chuah, William M. Pottenger |
| 2006 | Dimension Reduction for Supervised Ordering. Toshihiro Kamishima, Shotaro Akaho |
| 2006 | Direct Marketing When There Are Voluntary Buyers. Yi-Ting Lai, Ke Wang, Daymond Ling, Hua Shi, Jason Zhang |
| 2006 | Dirichlet Aspect Weighting: A Generalized EM Algorithm for Integrating External Data Fields with Semantically Structured Queries by Using Gradient Projection Method. Atulya Velivelli, Thomas S. Huang |
| 2006 | Discover Bayesian Networks from Incomplete Data Using a Hybrid Evolutionary Algorithm. Man Leung Wong, Yuan Yuan Guo |
| 2006 | Discovering Partial Orders in Binary Data. Deepak Rajan, Philip S. Yu |
| 2006 | Discovering Unrevealed Properties of Probability Estimation Trees: On Algorithm Selection and Performance Explanation. Kun Zhang, Wei Fan, Bill P. Buckles, Xiaojing Yuan, Zujia Xu |
| 2006 | Discovery of Collocation Episodes in Spatiotemporal Data. Huiping Cao, Nikos Mamoulis, David W. Cheung |
| 2006 | Distances and (Indefinite) Kernels for Sets of Objects. Adam Woznica, Alexandros Kalousis, Melanie Hilario |
| 2006 | Diverse Topic Phrase Extraction through Latent Semantic Analysis. Jilin Chen, Jun Yan, Benyu Zhang, Qiang Yang, Zheng Chen |
| 2006 | Efficient Clustering of Uncertain Data. Wang Kay Ngai, Ben Kao, Chun Kit Chui, Reynold Cheng, Michael Chau, Kevin Y. Yip |
| 2006 | Enhancing Text Clustering Using Concept-based Mining Model. Shady Shehata, Fakhri Karray, Mohamed S. Kamel |
| 2006 | Entity Resolution with Markov Logic. Parag Singla, Pedro M. Domingos |
| 2006 | Entropy-based Concept Shift Detection. Peter Vorburger, Abraham Bernstein |
| 2006 | Exploratory Mining in Cube Space. Raghu Ramakrishnan |
| 2006 | Exploratory Under-Sampling for Class-Imbalance Learning. Xu-Ying Liu, Jianxin Wu, Zhi-Hua Zhou |
| 2006 | Fast On-line Kernel Learning for Trees. Fabio Aiolli, Giovanni Da San Martino, Alessandro Sperduti, Alessandro Moschitti |
| 2006 | Fast Random Walk with Restart and Its Applications. Hanghang Tong, Christos Faloutsos, Jia-Yu Pan |
| 2006 | Fast Relevance Discovery in Time Series. Chang-Shing Perng, Haixun Wang, Sheng Ma |
| 2006 | Finding "Who Is Talking to Whom" in VoIP Networks via Progressive Stream Clustering. Olivier Verscheure, Michail Vlachos, Aris Anagnostopoulos, Pascal Frossard, Eric Bouillet, Philip S. Yu |
| 2006 | Forecasting Skewed Biased Stochastic Ozone Days: Analyses and Solutions. Kun Zhang, Wei Fan, Xiaojing Yuan, Ian Davidson, Xiangshang Li |
| 2006 | Frequent Closed Itemset Mining Using Prefix Graphs with an Efficient Flow-Based Pruning Strategy. H. D. K. Moonesinghe, Samah Jamal Fodeh, Pang-Ning Tan |
| 2006 | Geometrically Inspired Itemset Mining. Florian Verhein, Sanjay Chawla |
| 2006 | Getting the Most Out of Ensemble Selection. Rich Caruana, Art Munson, Alexandru Niculescu-Mizil |
| 2006 | Global and Componentwise Extrapolation for Accelerating Data Mining from Large Incomplete Data Sets with the EM Algorithm. Chun-Nan Hsu, Han-Shen Huang, Bo-Hou Yang |
| 2006 | Gradual Cube: Customize Profile on Mobile OLAP. Jun Li, Haofeng Zhou, Wei Wang |
| 2006 | GraphRank: Statistical Modeling and Mining of Significant Subgraphs in the Feature Space. Huahai He, Ambuj K. Singh |
| 2006 | Hierarchical Classification by Expected Utility Maximization. Korinna Bade, Eyke Hüllermeier, Andreas Nürnberger |
| 2006 | High Quality, Efficient Hierarchical Document Clustering Using Closed Interesting Itemsets. Hassan H. Malik, John R. Kender |
| 2006 | High-Performance Unsupervised Relation Extraction from Large Corpora. Binyamin Rosenfeld, Ronen Feldman |
| 2006 | How Bayesians Debug. Chao Liu, Zeng Lian, Jiawei Han |
| 2006 | Identifying Follow-Correlation Itemset-Pairs. Shichao Zhang, Jilian Zhang, Xiaofeng Zhu, Zifang Huang |
| 2006 | Improving Grouped-Entity Resolution Using Quasi-Cliques. Byung-Won On, Ergin Elmacioglu, Dongwon Lee, Jaewoo Kang, Jian Pei |
| 2006 | Improving Nearest Neighbor Classifier Using Tabu Search and Ensemble Distance Metrics. Muhammad Atif Tahir, Jim E. Smith |
| 2006 | Improving Personalization Solutions through Optimal Segmentation of Customer Bases. Tianyi Jiang, Alexander Tuzhilin |
| 2006 | Incremental Mining of Frequent Query Patterns from XML Queries for Caching. Guoliang Li, Jianhua Feng, Jianyong Wang, Yong Zhang, Lizhu Zhou |
| 2006 | Integrating Features from Different Sources for Music Information Retrieval. Tao Li, Mitsunori Ogihara, Shenghuo Zhu |
| 2006 | Intelligent Icons: Integrating Lite-Weight Data Mining and Visualization into GUI Operating Systems. Eamonn J. Keogh, Li Wei, Xiaopeng Xi, Stefano Lonardi, Jin Shieh, Scott Sirowy |
| 2006 | Keyphrase Extraction Using Semantic Networks Structure Analysis. Chong Huang, Yonghong Tian, Zhi Zhou, Charles X. Ling, Tiejun Huang |
| 2006 | LOCI: Load Shedding through Class-Preserving Data Acquisition. Peng Wang, Haixun Wang, Wei Wang, Baile Shi, Philip S. Yu |
| 2006 | Large Scale Detection of Irregularities in Accounting Data. Stephen Bay, Krishna Kumaraswamy, Markus G. Anderle, Rohit Kumar, David M. Steier |
| 2006 | Latent Dirichlet Co-Clustering. M. Mahdi Shafiei, Evangelos E. Milios |
| 2006 | Latent Friend Mining from Blog Data. Dou Shen, Jian-Tao Sun, Qiang Yang, Zheng Chen |
| 2006 | Lazy Associative Classification. Adriano Veloso, Wagner Meira Jr., Mohammed J. Zaki |
| 2006 | Learning to Use a Learned Model: A Two-Stage Approach to Classification. Maria-Luiza Antonie, Osmar R. Zaïane, Robert C. Holte |
| 2006 | Linear and Non-Linear Dimensional Reduction via Class Representatives for Text Classification. Dimitrios Zeimpekis, Efstratios Gallopoulos |
| 2006 | Local Correlation Tracking in Time Series. Spiros Papadimitriou, Jimeng Sun, Philip S. Yu |
| 2006 | MARGIN: Maximal Frequent Subgraph Mining. Lini T. Thomas, Satyanarayana R. Valluri, Kamalakar Karlapalem |
| 2006 | Manifold Clustering of Shapes. Dragomir Yankov, Eamonn J. Keogh |
| 2006 | Meta Clustering. Rich Caruana, Mohamed Farid Elhawary, Nam Nguyen, Casey Smith |
| 2006 | Minimum Enclosing Spheres Formulations for Support Vector Ordinal Regression. Shirish K. Shevade, Wei Chu |
| 2006 | Mining Complex Time-Series Data by Learning Markovian Models. Yi Wang, Lizhu Zhou, Jianhua Feng, Jianyong Wang, Zhi-Qiang Liu |
| 2006 | Mining Correlation between Motifs and Gene Expression. Yi Lu, Shiyong Lu, Adrian E. Platts, Stephen A. Krawetz |
| 2006 | Mining Generalized Graph Patterns Based on User Examples. Pavel A. Dmitriev, Carl Lagoze |
| 2006 | Mining Latent Associations of Objects Using a Typed Mixture Model--A Case Study on Expert/Expertise Mining. Shenghua Bao, Yunbo Cao, Bing Liu, Yong Yu, Hang Li |
| 2006 | Mining Maximal Generalized Frequent Geographic Patterns with Knowledge Constraints. Vania Bogorny, João Francisco Valiati, Sandro da Silva Camargo, Paulo Martins Engel, Bart Kuijpers, Luis Otávio Alvares |
| 2006 | Mining Maximal Quasi-Bicliques to Co-Cluster Stocks and Financial Ratios for Value Investment. Kelvin Sim, Jinyan Li, Vivekanand Gopalkrishnan, Guimei Liu |
| 2006 | Mining for Tree-Query Associations in a Graph. Eveline Hoekx, Jan Van den Bussche |
| 2006 | Mixed-Drove Spatio-Temporal Co-occurence Pattern Mining: A Summary of Results. Mete Celik, Shashi Shekhar, James P. Rogers, James A. Shine, Jin Soung Yoo |
| 2006 | Multi-Tier Granule Mining for Representations of Multidimensional Association Rules. Yuefeng Li, Wanzhong Yang, Yue Xu |
| 2006 | NewsCATS: A News Categorization and Trading System. Marc-André Mittermayer, Gerhard Knolmayer |
| 2006 | Object Identification with Constraints. Steffen Rendle, Lars Schmidt-Thieme |
| 2006 | On Trajectory Representation for Scientific Features. Sameep Mehta, Srinivasan Parthasarathy, Raghu Machiraju |
| 2006 | On the Lower Bound of Local Optimums in K-Means Algorithm. Zhenjie Zhang, Bing Tian Dai, Anthony K. H. Tung |
| 2006 | On the Use of Structure and Sequence-Based Features for Protein Classification and Retrieval. Keith Marsolo, Srinivasan Parthasarathy |
| 2006 | Opening the Black Box of Feature Extraction: Incorporating Visualization into High-Dimensional Data Mining Processes. Jianting Zhang, Le Gruenwald |
| 2006 | Optimal Segmentation Using Tree Models. Robert Gwadera, Aristides Gionis, Heikki Mannila |
| 2006 | P3C: A Robust Projected Clustering Algorithm. Gabriela Moise, Jörg Sander, Martin Ester |
| 2006 | Pattern Mining in Frequent Dynamic Subgraphs. Karsten M. Borgwardt, Hans-Peter Kriegel, Peter Wackersreuther |
| 2006 | Personalization in Context: Does Context Matter When Building Personalized Customer Models? Michele Gorgoglione, Cosimo Palmisano, Alexander Tuzhilin |
| 2006 | Plagiarism Detection in arXiv. Daria Sorokina, Johannes Gehrke, Simeon Warner, Paul Ginsparg |
| 2006 | Probabilistic Enhanced Mapping with the Generative Tabular Model. Rodolphe Priam, Mohamed Nadif |
| 2006 | Probabilistic Segmentation and Analysis of Horizontal Cells. Vebjorn Ljosa, Ambuj K. Singh |
| 2006 | Proceedings of the 6th IEEE International Conference on Data Mining (ICDM 2006), 18-22 December 2006, Hong Kong, China |
| 2006 | Query-Sensitive Similarity Measure for Content-Based Image Retrieval. Zhi-Hua Zhou, Hong-Bin Dai |
| 2006 | Rapid Identification of Column Heterogeneity. Bing Tian Dai, Nick Koudas, Beng Chin Ooi, Divesh Srivastava, Suresh Venkatasubramanian |
| 2006 | Recommendation on Item Graphs. Fei Wang, Sheng Ma, Liuzhong Yang, Tao Li |
| 2006 | Regularized Least Absolute Deviations Regression and an Efficient Algorithm for Parameter Tuning. Li Wang, Michael D. Gordon, Ji Zhu |
| 2006 | Relational Ensemble Classification. Christine Preisach, Lars Schmidt-Thieme |
| 2006 | Resource Management for Networked Classifiers in Distributed Stream Mining Systems. Deepak S. Turaga, Olivier Verscheure, Upendra V. Chaudhari, Lisa Amini |
| 2006 | Rule-Based Platform for Web User Profiling. Jianping Zhang, Manu Shukla |
| 2006 | SAXually Explicit Images: Finding Unusual Shapes. Li Wei, Eamonn J. Keogh, Xiaopeng Xi |
| 2006 | STAGGER: Periodicity Mining of Data Streams Using Expanding Sliding Windows. Mohamed G. Elfeky, Walid G. Aref, Ahmed K. Elmagarmid |
| 2006 | Searching for Pattern Rules. Guichong Li, Howard J. Hamilton |
| 2006 | Secure Distributed k-Anonymous Pattern Mining. Wei Jiang, Maurizio Atzori |
| 2006 | Semantic Kernels for Text Classification Based on Topological Measures of Feature Similarity. Stephan Bloehdorn, Roberto Basili, Marco Cammisa, Alessandro Moschitti |
| 2006 | Semantic Smoothing for Model-based Document Clustering. Xiaodan Zhang, Xiaohua Zhou, Xiaohua Hu |
| 2006 | Semi-Supervised Kernel Regression. Meng Wang, Xian-Sheng Hua, Yan Song, Li-Rong Dai, HongJiang Zhang |
| 2006 | Similarity of Temporal Query Logs Based on ARIMA Model. Ning Liu, Shuzhen Nong, Jun Yan, Benyu Zhang, Zheng Chen, Ying Li |
| 2006 | Social Capital in Friendship-Event Networks. Louis Licamele, Lise Getoor |
| 2006 | Solution Path for Semi-Supervised Classification with Manifold Regularization. Gang Wang, Tao Chen, Dit-Yan Yeung, Frederick H. Lochovsky |
| 2006 | Speedup Clustering with Hierarchical Ranking. Jianjun Zhou, Jörg Sander |
| 2006 | Stability Region Based Expectation Maximization for Model-based Clustering. Chandan K. Reddy, Hsiao-Dong Chiang, Bala Rajaratnam |
| 2006 | Star-Structured High-Order Heterogeneous Data Co-clustering Based on Consistent Information Theory. Bin Gao, Tie-Yan Liu, Wei-Ying Ma |
| 2006 | Subjectivity Categorization of Weblog with Part-of-Speech Based Smoothing. Shen Huang, Jian-Tao Sun, Xuanhui Wang, Hua-Jun Zeng, Zheng Chen |
| 2006 | TOP-COP: Mining TOP-K Strongly Correlated Pairs in Large Databases. Hui Xiong, Mark Brodie, Sheng Ma |
| 2006 | TRIAS - An Algorithm for Mining Iceberg Tri-Lattices. Robert Jäschke, Andreas Hotho, Christoph Schmitz, Bernhard Ganter, Gerd Stumme |
| 2006 | Temporal Data Mining in Dynamic Feature Spaces. Brent Wenerstrom, Christophe G. Giraud-Carrier |
| 2006 | The Influence of Class Imbalance on Cost-Sensitive Learning: An Empirical Study. Xu-Ying Liu, Zhi-Hua Zhou |
| 2006 | The PDD Framework for Detecting Categories of Peculiar Data. Mahesh Shrestha, Howard J. Hamilton, Yiyu Yao, Ken Konkel, Liqiang Geng |
| 2006 | The Relationships Among Various Nonnegative Matrix Factorization Methods for Clustering. Tao Li, Chris H. Q. Ding |
| 2006 | Turning Clusters into Patterns: Rectangle-Based Discriminative Data Description. Byron J. Gao, Martin Ester |
| 2006 | Using an Ensemble of One-Class SVM Classifiers to Harden Payload-based Anomaly Detection Systems. Roberto Perdisci, Guofei Gu, Wenke Lee |
| 2006 | What is the Dimension of Your Binary Data? Nikolaj Tatti, Taneli Mielikäinen, Aristides Gionis, Heikki Mannila |
| 2006 | Who Thinks Who Knows Who? Socio-cognitive Analysis of Email Networks. Nishith Pathak, Sandeep Mane, Jaideep Srivastava |
| 2006 | Window-based Tensor Analysis on High-dimensional and Multi-aspect Streams. Jimeng Sun, Spiros Papadimitriou, Philip S. Yu |
| 2006 | bitSPADE: A Lattice-based Sequential Pattern Mining Algorithm Using Bitmap Representation. Sujeevan Aseervatham, Aomar Osmani, Emmanuel Viennet |
| 2006 | delta-Tolerance Closed Frequent Itemsets. James Cheng, Yiping Ke, Wilfred Ng |