| 2006 | (alpha, k)-anonymity: an enhanced k-anonymity model for privacy preserving data publishing. Raymond Chi-Wing Wong, Jiuyong Li, Ada Wai-Chee Fu, Ke Wang |
| 2006 | A component-based framework for knowledge discovery in bioinformatics. Julien Etienne, Bernd Wachmann, Lei Zhang |
| 2006 | A framework for analysis of dynamic social networks. Tanya Y. Berger-Wolf, Jared Saia |
| 2006 | A general framework for accurate and fast regression by data summarization in random decision trees. Wei Fan, Joe McCloskey, Philip S. Yu |
| 2006 | A large-scale analysis of query logs for assessing personalization opportunities. Steve Wedig, Omid Madani |
| 2006 | A mixture model for contextual text mining. Qiaozhu Mei, ChengXiang Zhai |
| 2006 | A new efficient probabilistic model for mining labeled ordered trees. Kosuke Hashimoto, Kiyoko F. Aoki-Kinoshita, Nobuhisa Ueda, Minoru Kanehisa, Hiroshi Mamitsuka |
| 2006 | A new multi-view regression approach with an application to customer wallet estimation. Srujana Merugu, Saharon Rosset, Claudia Perlich |
| 2006 | Acclimatizing taxonomic semantics for hierarchical content classification from semantics to data-driven taxonomy. Lei Tang, Jianping Zhang, Huan Liu |
| 2006 | Adaptive event detection with time-varying poisson processes. Alexander Ihler, Jon Hutchins, Padhraic Smyth |
| 2006 | Aggregating time partitions. Taneli Mielikäinen, Evimaria Terzi, Panayiotis Tsaparas |
| 2006 | Algorithms for discovering bucket orders from data. Aristides Gionis, Heikki Mannila, Kai Puolamäki, Antti Ukkonen |
| 2006 | Algorithms for storytelling. Deept Kumar, Naren Ramakrishnan, Richard F. Helm, Malcolm Potts |
| 2006 | Algorithms for time series knowledge mining. Fabian Mörchen |
| 2006 | Anonymizing sequential releases. Ke Wang, Benjamin C. M. Fung |
| 2006 | Assessing data mining results via swap randomization. Aristides Gionis, Heikki Mannila, Taneli Mielikäinen, Panayiotis Tsaparas |
| 2006 | Attack detection in time series for recommender systems. Sheng Zhang, Amit Chakrabarti, James Ford, Fillia Makedon |
| 2006 | Automatic mining of fruit fly embryo images. Jia-Yu Pan, André G. R. Balan, Eric P. Xing, Agma J. M. Traina, Christos Faloutsos |
| 2006 | BLOSOM: a framework for mining arbitrary boolean expressions. Lizhuang Zhao, Mohammed J. Zaki, Naren Ramakrishnan |
| 2006 | Beyond classification and ranking: constrained optimization of the ROI. Lian Yan, Patrick Baldasare |
| 2006 | Beyond streams and graphs: dynamic tensor analysis. Jimeng Sun, Dacheng Tao, Christos Faloutsos |
| 2006 | Bias and controversy: beyond the statistical deviation. Hady Wirawan Lauw, Ee-Peng Lim, Ke Wang |
| 2006 | CCCS: a top-down associative classifier for imbalanced class distribution. Bavani Arunasalam, Sanjay Chawla |
| 2006 | CFI-Stream: mining closed frequent itemsets in data streams. Nan Jiang, Le Gruenwald |
| 2006 | Camouflaged fraud detection in domains with complex relationships. Sankar Virdhagriswaran, Gordon Dakin |
| 2006 | Capital One's statistical problems: our top ten list. William Kahn |
| 2006 | Center-piece subgraphs: problem definition and fast solutions. Hanghang Tong, Christos Faloutsos |
| 2006 | Classification features for attack detection in collaborative recommender systems. Robin D. Burke, Bamshad Mobasher, Chad Williams, Runa Bhaumik |
| 2006 | Clustering based large margin classification: a scalable approach using SOCP formulation. J. Saketha Nath, Chiranjib Bhattacharyya, M. Narasimha Murty |
| 2006 | Clustering pair-wise dissimilarity data into partially ordered sets. Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, Jan F. Prins |
| 2006 | Coherent closed quasi-clique discovery from large dense graph databases. Zhiping Zeng, Jianyong Wang, Lizhu Zhou, George Karypis |
| 2006 | Combining linguistic and statistical analysis to extract relations from web documents. Fabian M. Suchanek, Georgiana Ifrim, Gerhard Weikum |
| 2006 | Computer aided detection via asymmetric cascade of sparse hyperplane classifiers. Jinbo Bi, Senthil Periaswamy, Kazunori Okada, Toshiro Kubota, Glenn Fung, Marcos Salganicoff, R. Bharat Rao |
| 2006 | Cryptographically private support vector machines. Sven Laur, Helger Lipmaa, Taneli Mielikäinen |
| 2006 | Data mining challenges in the automotive domain. Michael Cavaretta |
| 2006 | Deriving quantitative models for correlation clusters. Elke Achtert, Christian Böhm, Hans-Peter Kriegel, Peer Kröger, Arthur Zimek |
| 2006 | Detecting outliers using transduction and statistical testing. Daniel Barbará, Carlotta Domeniconi, James P. Rogers |
| 2006 | Discovering interesting patterns through user's interactive feedback. Dong Xin, Xuehua Shen, Qiaozhu Mei, Jiawei Han |
| 2006 | Discovering significant OPSM subspace clusters in massive gene expression data. Byron J. Gao, Obi L. Griffith, Martin Ester, Steven J. M. Jones |
| 2006 | Discovering significant rules. Geoffrey I. Webb |
| 2006 | Dynamic, real-time forecasting of online auctions via functional models. Wolfgang Jank, Galit Shmueli, Shanshan Wang |
| 2006 | Efficient anonymity-preserving data collection. Justin Brickell, Vitaly Shmatikov |
| 2006 | Efficient kernel feature extraction for massive data sets. Ivor W. Tsang, András Kocsor, James T. Kwok |
| 2006 | Efficient multidimensional data representations based on multiple correspondence analysis. Riadh Ben Messaoud, Omar Boussaid, Sabine Loudcher Rabaséda |
| 2006 | Estimating the global pagerank of web communities. Jason V. Davis, Inderjit S. Dhillon |
| 2006 | Event detection from evolution of click-through data. Qiankun Zhao, Tie-Yan Liu, Sourav S. Bhowmick, Wei-Ying Ma |
| 2006 | Evolutionary clustering. Deepayan Chakrabarti, Ravi Kumar, Andrew Tomkins |
| 2006 | Extracting key-substring-group features for text classification. Dell Zhang, Wee Sun Lee |
| 2006 | Extracting redundancy-aware top-k patterns. Dong Xin, Hong Cheng, Xifeng Yan, Jiawei Han |
| 2006 | Fast mining of high dimensional expressive contrast patterns using zero-suppressed binary decision diagrams. Elsa Loekito, James Bailey |
| 2006 | Frequent subgraph mining in outerplanar graphs. Tamás Horváth, Jan Ramon, Stefan Wrobel |
| 2006 | GPLAG: detection of software plagiarism by program dependence graph analysis. Chao Liu, Chen Chen, Jiawei Han, Philip S. Yu |
| 2006 | Generating semantic annotations for frequent patterns with context analysis. Qiaozhu Mei, Dong Xin, Hong Cheng, Jiawei Han, ChengXiang Zhai |
| 2006 | Global distance-based segmentation of trajectories. Aris Anagnostopoulos, Michail Vlachos, Marios Hadjieleftheriou, Eamonn J. Keogh, Philip S. Yu |
| 2006 | Group formation in large social networks: membership, growth, and evolution. Lars Backstrom, Daniel P. Huttenlocher, Jon M. Kleinberg, Xiangyang Lan |
| 2006 | Hierarchical topic segmentation of websites. Ravi Kumar, Kunal Punera, Andrew Tomkins |
| 2006 | Identifying "best bet" web search results by mining past user behavior. Eugene Agichtein, Zijian Zheng |
| 2006 | Identifying bridging rules between conceptual clusters. Shichao Zhang, Feng Chen, Xindong Wu, Chengqi Zhang |
| 2006 | Incremental approximate matrix factorization for speeding up support vector machines. Gang Wu, Edward Y. Chang, Yen-Kuang Chen, Christopher J. Hughes |
| 2006 | Information extraction, data mining and joint inference. Andrew McCallum |
| 2006 | Integration of semantic-based bipartite graph representation and mutual refinement strategy for biomedical literature clustering. Illhoi Yoo, Xiaohua Hu, Il-Yeol Song |
| 2006 | Introducing perpetual analytics. Jeff Jonas |
| 2006 | Is there a grand challenge or X-prize for data mining? Gregory Piatetsky-Shapiro, Robert Grossman, Chabane Djeraba, Ronen Feldman, Lise Getoor, Mohammed Javeed Zaki |
| 2006 | K-means clustering versus validation measures: a data distribution perspective. Hui Xiong, Junjie Wu, Jian Chen |
| 2006 | Learning sparse metrics via linear programming. Rómer Rosales, Glenn Fung |
| 2006 | Learning the unified kernel machines for classification. Steven C. H. Hoi, Michael R. Lyu, Edward Y. Chang |
| 2006 | Learning to rank networked entities. Alekh Agarwal, Soumen Chakrabarti, Sunny Aggarwal |
| 2006 | Linear prediction models with graph regularization for web-page categorization. Tong Zhang, Alexandrin Popescul, Byron Dom |
| 2006 | MONIC: modeling and monitoring cluster transitions. Myra Spiliopoulou, Irene Ntoutsi, Yannis Theodoridis, René Schult |
| 2006 | Maximally informative k-itemsets and their efficient discovery. Arno J. Knobbe, Eric K. Y. Ho |
| 2006 | Maximum profit mining and its application in software development. Charles X. Ling, Victor S. Sheng, Tilmann F. W. Bruckhaus, Nazim H. Madhavji |
| 2006 | Measuring and extracting proximity in networks. Yehuda Koren, Stephen C. North, Chris Volinsky |
| 2006 | Mining citizen science data to predict orevalence of wild bird species. Rich Caruana, Mohamed Farid Elhawary, Art Munson, Mirek Riedewald, Daria Sorokina, Daniel Fink, Wesley M. Hochachka, Steve Kelling |
| 2006 | Mining distance-based outliers from large databases in any metric space. Yufei Tao, Xiaokui Xiao, Shuigeng Zhou |
| 2006 | Mining for misconfigured machines in grid systems. Noam Palatin, Arie Leizarowitz, Assaf Schuster, Ran Wolff |
| 2006 | Mining for proposal reviewers: lessons learned at the national science foundation. Seth Hettich, Michael J. Pazzani |
| 2006 | Mining long-term search history to improve search accuracy. Bin Tan, Xuehua Shen, ChengXiang Zhai |
| 2006 | Mining progressive confident rules. Minghua Zhang, Wynne Hsu, Mong-Li Lee |
| 2006 | Mining quantitative correlated patterns using an information-theoretic approach. Yiping Ke, James Cheng, Wilfred Ng |
| 2006 | Mining rank-correlated sets of numerical attributes. Toon Calders, Bart Goethals, Szymon Jaroszewicz |
| 2006 | Mining relational data through correlation-based multiple view validation. Hongyu Guo, Herna L. Viktor |
| 2006 | Model compression. Cristian Bucila, Rich Caruana, Alexandru Niculescu-Mizil |
| 2006 | Naïve filterbots for robust cold-start recommendations. Seung-Taek Park, David M. Pennock, Omid Madani, Nathan Good, Dennis DeCoste |
| 2006 | NeMoFinder: dissecting genome-wide protein-protein interactions with meso-scale network motifs. Jin Chen, Wynne Hsu, Mong-Li Lee, See-Kiong Ng |
| 2006 | New EM derived from Kullback-Leibler divergence. Longin Jan Latecki, Marc Sobel, Rolf Lakämper |
| 2006 | New cached-sufficient statistics algorithms for quickly answering statistical questions. Andrew W. Moore |
| 2006 | Next frontier. Rakesh Agrawal |
| 2006 | On privacy preservation against adversarial data mining. Charu C. Aggarwal, Jian Pei, Bo Zhang |
| 2006 | Onboard classifiers for science event detection on a remote sensing spacecraft. Rebecca Castaño, Dominic Mazzoni, Nghia Tang, Ronald Greeley, Thomas Doggett, Benjamin Cichy, Steve A. Chien, Ashley Davies |
| 2006 | Opportunity map: identifying causes of failure - a deployed data mining system. Kaidi Zhao, Bing Liu, Jeffrey Benkler, Weimin Xiao |
| 2006 | Orthogonal nonnegative matrix t-factorizations for clustering. Chris H. Q. Ding, Tao Li, Wei Peng, Haesun Park |
| 2006 | Out-of-core frequent pattern mining on a commodity PC. Gregory Buehrer, Srinivasan Parthasarathy, Amol Ghoting |
| 2006 | Outlier detection by active learning. Naoki Abe, Bianca Zadrozny, John Langford |
| 2006 | Outlier detection by sampling with accuracy guarantees. Mingxi Wu, Chris Jermaine |
| 2006 | Polynomial association rules with applications to logistic regression. Szymon Jaroszewicz |
| 2006 | Pragmatic text mining: minimizing human effort to quantify many issues in call logs. George Forman, Evan Kirshenbaum, Jaap Suermondt |
| 2006 | Proceedings of the Twelfth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Philadelphia, PA, USA, August 20-23, 2006 Tina Eliassi-Rad, Lyle H. Ungar, Mark Craven, Dimitrios Gunopulos |
| 2006 | Quantifying trends accurately despite classifier error and class imbalance. George Forman |
| 2006 | Query-time entity resolution. Indrajit Bhattacharya, Lise Getoor, Louis Licamele |
| 2006 | Recommendation method for extending subscription periods. Tomoharu Iwata, Kazumi Saito, Takeshi Yamada |
| 2006 | Reducing the human overhead in text categorization. Arnd Christian König, Eric Brill |
| 2006 | Regularized discriminant analysis for high dimensional, low sample size data. Jieping Ye, Tie Wang |
| 2006 | Reverse testing: an efficient framework to select amongst classifiers under sample selection bias. Wei Fan, Ian Davidson |
| 2006 | Robust information-theoretic clustering. Christian Böhm, Christos Faloutsos, Jia-Yu Pan, Claudia Plant |
| 2006 | Rule interestingness analysis using OLAP operations. Bing Liu, Kaidi Zhao, Jeffrey Benkler, Weimin Xiao |
| 2006 | Sampling from large graphs. Jure Leskovec, Christos Faloutsos |
| 2006 | Self-Organizing wireless sensor networks in action. John A. Stankovic |
| 2006 | Semi-supervised time series classification. Li Wei, Eamonn J. Keogh |
| 2006 | Simultaneous record detection and attribute labeling in web data extraction. Jun Zhu, Zaiqing Nie, Ji-Rong Wen, Bo Zhang, Wei-Ying Ma |
| 2006 | Single-pass online learning: performance, voting schemes and online feature selection. Vitor R. Carvalho, William W. Cohen |
| 2006 | Spatial scan statistics: approximations and performance study. Deepak Agarwal, Andrew McGregor, Jeff M. Phillips, Suresh Venkatasubramanian, Zhengyuan Zhu |
| 2006 | Statistical entity-topic models. David Newman, Chaitanya Chemudugunta, Padhraic Smyth |
| 2006 | Structure and evolution of online social networks. Ravi Kumar, Jasmine Novak, Andrew Tomkins |
| 2006 | Summarizing itemset patterns using probabilistic models. Chao Wang, Srinivasan Parthasarathy |
| 2006 | Supervised probabilistic principal component analysis. Shipeng Yu, Kai Yu, Volker Tresp, Hans-Peter Kriegel, Mingrui Wu |
| 2006 | Suppressing model overfitting in mining concept-drifting data streams. Haixun Wang, Jian Yin, Jian Pei, Philip S. Yu, Jeffrey Xu Yu |
| 2006 | Tensor-CUR decompositions for tensor-based data. Michael W. Mahoney, Mauro Maggioni, Petros Drineas |
| 2006 | Topics over time: a non-Markov continuous-time model of topical trends. Xuerui Wang, Andrew McCallum |
| 2006 | Training linear SVMs in linear time. Thorsten Joachims |
| 2006 | Understandable models Of music collections based on exhaustive feature generation with temporal statistics. Fabian Mörchen, Ingo Mierswa, Alfred Ultsch |
| 2006 | Unsupervised learning on k-partite graphs. Bo Long, Xiaoyun Wu, Zhongfei (Mark) Zhang, Philip S. Yu |
| 2006 | Using structure indices for efficient approximation of network properties. Matthew J. Rattigan, Marc E. Maier, David D. Jensen |
| 2006 | Utility-based anonymization using local recoding. Jian Xu, Wei Wang, Jian Pei, Xiaoyuan Wang, Baile Shi, Ada Wai-Chee Fu |
| 2006 | Very sparse random projections. Ping Li, Trevor Hastie, Kenneth Ward Church |
| 2006 | Visual data mining using principled projection algorithms and information visualization techniques. Dharmesh M. Maniyar, Ian T. Nabney |
| 2006 | Workload-aware anonymization. Kristen LeFevre, David J. DeWitt, Raghu Ramakrishnan |
| 2006 | YALE: rapid prototyping for complex data mining tasks. Ingo Mierswa, Michael Wurst, Ralf Klinkenberg, Martin Scholz, Timm Euler |