| 2002 | A model for discovering customer value for E-content. Srinivasan Jagannathan, Jayanth Nayak, Kevin C. Almeroth, Markus Hofmann |
| 2002 | A new two-phase sampling based algorithm for discovering association rules. Bin Chen, Peter J. Haas, Peter Scheuermann |
| 2002 | A parallel learning algorithm for text classification. Canasai Kruengkrai, Chuleerat Jaruskulchai |
| 2002 | A refinement approach to handling model misfit in text categorization. Haoran Wu, Tong-Heng Phang, Bing Liu, Xiaoli Li |
| 2002 | A robust and efficient clustering algorithm based on cohesion self-merging. Cheng-Ru Lin, Ming-Syan Chen |
| 2002 | A system for real-time competitive market intelligence. Sholom M. Weiss, Naval K. Verma |
| 2002 | A theoretical framework for learning from a pool of disparate data sources. Shai Ben-David, Johannes Gehrke, Reba Schuller |
| 2002 | A unifying framework for detecting outliers and change points from non-stationary time series data. Kenji Yamanishi, Jun'ichi Takeuchi |
| 2002 | ADMIT: anomaly-based data mining for intrusions. Karlton Sequeira, Mohammed Javeed Zaki |
| 2002 | ANF: a fast and scalable tool for data mining in massive graphs. Christopher R. Palmer, Phillip B. Gibbons, Christos Faloutsos |
| 2002 | B-EM: a classifier incorporating bootstrap with EM approach for data mining. Xintao Wu, Jianping Fan, Kalpathi R. Subramanian |
| 2002 | Bayesian analysis of massive datasets via particle filters. Greg Ridgeway, David Madigan |
| 2002 | Bursty and hierarchical structure in streams. Jon M. Kleinberg |
| 2002 | CLOPE: a fast and effective clustering algorithm for transactional data. Yiling Yang, Xudong Guan, Jinyuan You |
| 2002 | CVS: a Correlation-Verification based Smoothing technique on information retrieval and term clustering. Christina Yip Chung, Bin Chen |
| 2002 | Clustering seasonality patterns in the presence of errors. Mahesh Kumar, Nitin R. Patel, Jonathan Woo |
| 2002 | Collaborative crawling: mining user experiences for topical resource discovery. Charu C. Aggarwal |
| 2002 | Collusion in the U.S. crop insurance program: applied data mining. Bertis B. Little, Walter L. Johnston, Ashley C. Lovell, Roderick M. Rejesus, Steve A. Steed |
| 2002 | Combining clustering and co-training to enhance text classification using unlabelled data. Bhavani Raskutti, Herman L. Ferrá, Adam Kowalczyk |
| 2002 | Construct robust rule sets for classification. Jiuyong Li, Rodney W. Topor, Hong Shen |
| 2002 | Customer lifetime value modeling and its use for customer retention planning. Saharon Rosset, Einat Neumann, Uri Eick, Nurit Vatnik, Yizhak Idan |
| 2002 | Discovering informative content blocks from Web documents. Shian-Hua Lin, Jan-Ming Ho |
| 2002 | Discovering word senses from text. Patrick Pantel, Dekang Lin |
| 2002 | Discovery net: towards a grid of knowledge discovery. Vasa Curcin, Moustafa Ghanem, Yike Guo, Martin Köhler, Anthony Rowe, Jameel Syed, Patrick Wendel |
| 2002 | Distributed data mining in a chain store database of short transactions. Cheng-Ru Lin, Chang-Hung Lee, Ming-Syan Chen, Philip S. Yu |
| 2002 | DualMiner: a dual-pruning algorithm for itemsets with constraints. Cristian Bucila, Johannes Gehrke, Daniel Kifer, Walker M. White |
| 2002 | Efficient handling of high-dimensional feature spaces by randomized classifier ensembles. Aleksander Kolcz, Xiaomei Sun, Jugal K. Kalita |
| 2002 | Efficiently mining frequent trees in a forest. Mohammed Javeed Zaki |
| 2002 | Enhanced word clustering for hierarchical text classification. Inderjit S. Dhillon, Subramanyam Mallela, Rahul Kumar |
| 2002 | Evaluating classifiers' performance in a constrained environment. Anna Olecka |
| 2002 | Exploiting response models: optimizing cross-sell and up-sell opportunities in banking. Andrew Storey, Marc-David Cohen |
| 2002 | Exploiting unlabeled data in ensemble methods. Kristin P. Bennett, Ayhan Demiriz, Richard Maclin |
| 2002 | Extracting decision trees from trained neural networks. Olcay Boz |
| 2002 | Finding surprising patterns in a time series database in linear time and space. Eamonn J. Keogh, Stefano Lonardi, Bill Yuan-chi Chiu |
| 2002 | Frequent term-based text clustering. Florian Beil, Martin Ester, Xiaowei Xu |
| 2002 | From run-time behavior to usage scenarios: an interaction-pattern mining approach. Mohammad El-Ramly, Eleni Stroulia, Paul G. Sorenson |
| 2002 | Handling very large numbers of association rules in the analysis of microarray data. Alexander Tuzhilin, Gediminas Adomavicius |
| 2002 | Hierarchical model-based clustering of large datasets through fractionation and refractionation. Jeremy Tantrum, Alejandro Murua, Werner Stuetzle |
| 2002 | Incremental context mining for adaptive document classification. Rey-Long Liu, Yun-Ling Lu |
| 2002 | Instability of decision tree classification algorithms. Ruey-Hsia Li, Geneva G. Belford |
| 2002 | Integrating feature and instance selection for text classification. Dimitris Fragoudis, Dimitris Meretakis, Spiros Likothanassis |
| 2002 | Interactive deduplication using active learning. Sunita Sarawagi, Anuradha Bhamidipaty |
| 2002 | Item selection by "hub-authority" profit ranking. Ke Wang, Ming-Yen Thomas Su |
| 2002 | Learning domain-independent string transformation weights for high accuracy object identification. Sheila Tejada, Craig A. Knoblock, Steven Minton |
| 2002 | Learning nonstationary models of normal network traffic for detecting novel attacks. Matthew V. Mahoney, Philip K. Chan |
| 2002 | Learning to match and cluster large high-dimensional data sets for data integration. William W. Cohen, Jacob Richman |
| 2002 | MARK: a boosting algorithm for heterogeneous kernel models. Kristin P. Bennett, Michinari Momma, Mark J. Embrechts |
| 2002 | Making every bit count: fast nonlinear axis scaling. Leejay Wu, Christos Faloutsos |
| 2002 | Mining complex models from arbitrarily large databases in constant time. Geoff Hulten, Pedro M. Domingos |
| 2002 | Mining frequent item sets by opportunistic projection. Junqiang Liu, Yunhe Pan, Ke Wang, Jiawei Han |
| 2002 | Mining heterogeneous gene expression data with time lagged recurrent neural networks. Yulan Liang, Arpad Kelemen |
| 2002 | Mining intrusion detection alarms for actionable knowledge. Klaus Julisch, Marc Dacier |
| 2002 | Mining knowledge-sharing sites for viral marketing. Matthew Richardson, Pedro M. Domingos |
| 2002 | Mining product reputations on the Web. Satoshi Morinaga, Kenji Yamanishi, Kenji Tateishi, Toshikazu Fukushima |
| 2002 | Non-linear dimensionality reduction techniques for classification and visualization. Michail Vlachos, Carlotta Domeniconi, Dimitrios Gunopulos, George Kollios, Nick Koudas |
| 2002 | On effective classification of strings with wavelets. Charu C. Aggarwal |
| 2002 | On interactive visualization of high-dimensional data using the hyperbolic plane. Jörg A. Walter, Helge J. Ritter |
| 2002 | On the need for time series data mining benchmarks: a survey and empirical demonstration. Eamonn J. Keogh, Shruti Kasetty |
| 2002 | On the potential of domain literature for clustering and Bayesian network learning. Peter Antal, Patrick Glenisson, Geert Fannes |
| 2002 | Optimizing search engines using clickthrough data. Thorsten Joachims |
| 2002 | PEBL: positive example based learning for Web page classification using SVM. Hwanjo Yu, Jiawei Han, Kevin Chen-Chuan Chang |
| 2002 | Pattern discovery in sequences under a Markov assumption. Darya Chudova, Padhraic Smyth |
| 2002 | Predicting rare classes: can boosting make any weak learner strong? Mahesh V. Joshi, Ramesh C. Agarwal, Vipin Kumar |
| 2002 | Privacy preserving association rule mining in vertically partitioned data. Jaideep Vaidya, Chris Clifton |
| 2002 | Privacy preserving mining of association rules. Alexandre V. Evfimievski, Ramakrishnan Srikant, Rakesh Agrawal, Johannes Gehrke |
| 2002 | Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, July 23-26, 2002, Edmonton, Alberta, Canada |
| 2002 | Query, analysis, and visualization of hierarchically structured data using Polaris. Chris Stolte, Diane Tang, Pat Hanrahan |
| 2002 | Querying multiple sets of discovered rules. Alexander Tuzhilin, Bing Liu |
| 2002 | Relational Markov models and their application to adaptive web navigation. Corin R. Anderson, Pedro M. Domingos, Daniel S. Weld |
| 2002 | SECRET: a scalable linear regression tree algorithm. Alin Dobra, Johannes Gehrke |
| 2002 | Scalable robust covariance and correlation estimates for data mining. Fatemah A. Alqallaf, Kjell P. Konis, R. Douglas Martin, Ruben H. Zamar |
| 2002 | Scaling multi-class support vector machines using inter-class confusion. Shantanu Godbole, Sunita Sarawagi, Soumen Chakrabarti |
| 2002 | Selecting the right interestingness measure for association patterns. Pang-Ning Tan, Vipin Kumar, Jaideep Srivastava |
| 2002 | Sequential PAttern mining using a bitmap representation. Jay Ayres, Jason Flannick, Johannes Gehrke, Tomi Yiu |
| 2002 | Sequential cost-sensitive decision making with reinforcement learning. Edwin P. D. Pednault, Naoki Abe, Bianca Zadrozny |
| 2002 | Shrinkage estimator generalizations of Proximal Support Vector Machines. Deepak K. Agarwal |
| 2002 | SimRank: a measure of structural-context similarity. Glen Jeh, Jennifer Widom |
| 2002 | Similarity measure based on partial information of time series. Xiaoming Jin, Yuchang Lu, Chunyi Shi |
| 2002 | Single-shot detection of multiple categories of text using parametric mixture models. Naonori Ueda, Kazumi Saito |
| 2002 | SyMP: an efficient clustering approach to identify clusters of arbitrary shapes in large data sets. Hichem Frigui |
| 2002 | Tina Eliassi-Rad, Terence Critchlow, Ghaleb Abdulla. Tina Eliassi-Rad, Terence Critchlow, Ghaleb Abdulla |
| 2002 | Topic-conditioned novelty detection. Yiming Yang, Jian Zhang, Jaime G. Carbonell, Chun Jin |
| 2002 | Topics in 0--1 data. Ella Bingham, Heikki Mannila, Jouni K. Seppänen |
| 2002 | Transforming classifier scores into accurate multiclass probability estimates. Bianca Zadrozny, Charles Elkan |
| 2002 | Transforming data to satisfy privacy constraints. Vijay S. Iyengar |
| 2002 | Tumor cell identification using features rules. Bin Fang, Wynne Hsu, Mong-Li Lee |
| 2002 | Visualization support for a user-centered KDD process. Tu Bao Ho, Trong Dung Nguyen, DucDung Nguyen |
| 2002 | Web site mining: a new way to spot competitors, customers and suppliers in the world wide web. Martin Ester, Hans-Peter Kriegel, Matthias Schubert |
| 2002 | What's the code?: automatic classification of source code archives. Secil Ugurel, Robert Krovetz, C. Lee Giles |