| 2003 | A Dynamic Adaptive Self-Organising Hybrid Model for Text Clustering. Chihli Hung, Stefan Wermter |
| 2003 | A Fast Algorithm for Computing Hypergraph Transversals and its Application in Mining Emerging Patterns. James Bailey, Thomas Manoukian, Kotagiri Ramamohanarao |
| 2003 | A Feature Selection Framework for Text Filtering. Zhaohui Zheng, Rohini K. Srihari, Sargur N. Srihari |
| 2003 | A High-Performance Distributed Algorithm for Mining Association Rules. Assaf Schuster, Ran Wolff, Dan Trock |
| 2003 | A Hybrid Data-Mining Approach in Genomics and Text Structures. Horia-Nicolai L. Teodorescu, Lucian Iulian Fira |
| 2003 | A K-NN Associated Fuzzy Evidential Reasoning Classifier with Adaptive Neighbor Selection. Hongwei Zhu, Otman A. Basir |
| 2003 | A User-driven and Quality-oriented Visualization for Mining Association Rules. Julien Blanchard, Fabrice Guillet, Henri Briand |
| 2003 | A new optimization criterion for generalized discriminant analysis on undersampled problems. Jieping Ye, Ravi Janardan, Cheong Hee Park, Haesun Park |
| 2003 | Active Sampling for Feature Selection. Sriharsha Veeramachaneni, Paolo Avesani |
| 2003 | Algorithms for Spatial Outlier Detection. Chang-Tien Lu, Dechang Chen, Yufeng Kou |
| 2003 | An Algebra for Inductive Query Evaluation. Sau Dan Lee, Luc De Raedt |
| 2003 | An Algorithm for the Exact Computation of the Centroid of Higher Dimensional Polyhedra and its Application to Kernel Machines. Frédéric Maire |
| 2003 | Analyzing High-Dimensional Data by Subspace Validity. Amihood Amir, Reuven Kashi, Nathan S. Netanyahu, Daniel A. Keim, Markus Wawryniuk |
| 2003 | Applying Noise Handling Techniques to Genomic Data: A Case Study. Choh Man Teng |
| 2003 | Association Rule Mining in Peer-to-Peer Systems. Ran Wolff, Assaf Schuster |
| 2003 | Bootstrapping Rule Induction. Lemuel R. Waitman, Douglas H. Fisher, Paul H. King |
| 2003 | Building Text Classifiers Using Positive and Unlabeled Examples. Bing Liu, Yang Dai, Xiaoli Li, Wee Sun Lee, Philip S. Yu |
| 2003 | CBC: Clustering Based Text Classification Requiring Minimal Labeled Data. Hua-Jun Zeng, Xuanhui Wang, Zheng Chen, Hongjun Lu, Wei-Ying Ma |
| 2003 | Center-Based Indexing for Nearest Neighbors Search. Arkadiusz Wojna |
| 2003 | Change Profiles. Taneli Mielikäinen |
| 2003 | Class Decomposition via Clustering: A New Framework for Low-Variance Classifiers. Ricardo Vilalta, Murali-Krishna Achari, Christoph F. Eick |
| 2003 | Clustering Item Data Sets with Association-Taxonomy Similarity. Ching-Huang Yun, Kun-Ta Chuang, Ming-Syan Chen |
| 2003 | Clustering of Time Series Subsequences is Meaningless: Implications for Previous and Future Research. Eamonn J. Keogh, Jessica Lin, Wagner Truppel |
| 2003 | CoMine: Efficient Mining of Correlated Patterns. Young-Koo Lee, Won-Young Kim, Y. Dora Cai, Jiawei Han |
| 2003 | Combining Multiple Weak Clusterings. Alexander P. Topchy, Anil K. Jain, William F. Punch |
| 2003 | Combining the web content and usage mining to understand the visitor behavior in a web site. Juan D. Velásquez, Hiroshi Yasuda, Terumasa Aoki |
| 2003 | Comparing Naive Bayes, Decision Trees, and SVM with AUC and Accuracy. Jin Huang, Jingjing Lu, Charles X. Ling |
| 2003 | Comparing Pure Parallel Ensemble Creation Techniques Against Bagging. Lawrence O. Hall, Kevin W. Bowyer, Robert E. Banfield, Divya Bhadoria, W. Philip Kegelmeyer, Steven Eschrich |
| 2003 | Complex Spatial Relationships. Robert Munro, Sanjay Chawla, Pei Sun |
| 2003 | Cost-Sensitive Learning by Cost-Proportionate Example Weighting. Bianca Zadrozny, John Langford, Naoki Abe |
| 2003 | Detecting Interesting Exceptions from Medical Test Data with Visual Summarization. Einoshin Suzuki, Takeshi Watanabe, Hideto Yokoi, Katsuhiko Takabayashi |
| 2003 | Detecting Patterns of Change Using Enhanced Parallel Coordinates Visualization. Kaidi Zhao, Bing Liu, Thomas M. Tirpak, Andreas Schaller |
| 2003 | Dimensionality Reduction Using Kernel Pooled Local Discriminant Information. Peng Zhang, Jing Peng, Carlotta Domeniconi |
| 2003 | Direct Interesting Rule Generation. Jiuyong Li, Yanchun Zhang |
| 2003 | Dynamic Weighted Majority: A New Ensemble Method for Tracking Concept Drift. Jeremy Z. Kolter, Marcus A. Maloof |
| 2003 | Effectiveness of Information Extraction, Multi-Relational, and Semi-Supervised Learning for Predicting Functional Properties of Genes. Mark-A. Krogel, Tobias Scheffer |
| 2003 | Efficient Data Mining for Maximal Frequent Subtrees. Yongqiao Xiao, Jenq-Foung JF Yao, Zhigang Li, Margaret H. Dunham |
| 2003 | Efficient Mining of Frequent Subgraphs in the Presence of Isomorphism. Jun Huan, Wei Wang, Jan F. Prins |
| 2003 | Efficient Multidimensional Quantitative Hypotheses Generation. Amihood Amir, Reuven Kashi, Nathan S. Netanyahu |
| 2003 | Efficient Nonlinear Dimension Reduction for Clustered Data Using Kernel Functions. Cheong Hee Park, Haesun Park |
| 2003 | Efficient Subsequence Matching in Time Series Databases Under Time and Amplitude Transformations. Tassos Argyros, Charis Ermopoulos |
| 2003 | Enhancing Techniques for Efficient Topic Hierarchy Integration. Jyh-Jong Tsay, Hsuan-Yu Chen, Chi-Feng Chang, Ching-Han Lin |
| 2003 | Ensembles of Cascading Trees. Jinyan Li, Huiqing Liu |
| 2003 | Evolutionary Gabor Filter Optimization with Application to Vehicle Detection. Zehang Sun, George Bebis, Ronald Miller |
| 2003 | ExAMiner: Optimized Level-wise Frequent Pattern Mining with Monotone Constraint. Francesco Bonchi, Fosca Giannotti, Alessio Mazzanti, Dino Pedreschi |
| 2003 | Exploiting Unlabeled Data for Improving Accuracy of Predictive Data Mining. Kang Peng, Slobodan Vucetic, Bo Han, Hongbo M. Xie, Zoran Obradovic |
| 2003 | Facilitating Fuzzy Association Rules Mining by Using Multi-Objective Genetic Algorithms for Automated Clustering. Mehmet Kaya, Reda Alhajj |
| 2003 | Fast PNN-based Clustering Using K-nearest Neighbor Graph. Pasi Fränti, Olli Virmajoki, Ville Hautamäki |
| 2003 | Findings from a Practical Project Concerning Web Usage Mining. Frank Dellmann, Holger Wulff, Stefan Schmitz |
| 2003 | Frequent Sub-Structure-Based Approaches for Classifying Chemical Compounds. Mukund Deshpande, Michihiro Kuramochi, George Karypis |
| 2003 | Frequent-Pattern based Iterative Projected Clustering. Man Lung Yiu, Nikos Mamoulis |
| 2003 | General MC: Estimating Boundary of Positive Class from Small Positive Data. Hwanjo Yu |
| 2003 | Icon-based Visualization of Large High-Dimensional Datasets. Ping Chen, Chenyi Hu, Wei Ding, Heloise Lynn, Yves Simon |
| 2003 | Identifying Markov Blankets with Decision Tree Induction. Lewis J. Frey, Douglas H. Fisher, Ioannis Tsamardinos, Constantin F. Aliferis, Alexander R. Statnikov |
| 2003 | Impact Studies and Sensitivity Analysis in Medical Data Mining with ROC-based Genetic Learning. Michèle Sebag, Jérôme Azé, Noël Lucas |
| 2003 | Improving Home Automation by Discovering Regularly Occurring Device Usage Patterns. Edwin O. Heierman III, Diane J. Cook |
| 2003 | Indexing and Mining Free Trees. Yun Chi, Yirong Yang, Richard R. Muntz |
| 2003 | Inference of Protein-Protein Interactions by Unlikely Profile Pair. Byung-Hoon Park, George Ostrouchov, Gong-Xin Yu, Al Geist, Andrey Gorin, Nagiza F. Samatova |
| 2003 | Information Theoretic Clustering of Sparse Co-Occurrence Data. Inderjit S. Dhillon, Yuqiang Guan |
| 2003 | Integrating Customer Value Considerations into Predictive Modeling. Saharon Rosset, Einat Neumann |
| 2003 | Integrating Fuzziness into OLAP for Multidimensional Fuzzy Association Rules Mining. Reda Alhajj, Mehmet Kaya |
| 2003 | Interactive Visualization and Navigation in Large Data Collections using the Hyperbolic Space. Jörg A. Walter, Jörg Ontrup, Daniel Wessling, Helge J. Ritter |
| 2003 | Interpretations of Association Rules by Granular Computing. Yuefeng Li, Ning Zhong |
| 2003 | Introducing Uncertainty into Pattern Discovery in Temporal Event Sequences. Xingzhi Sun, Maria E. Orlowska, Xue Li |
| 2003 | Is random model better? On its accuracy and efficiency. Wei Fan, Haixun Wang, Philip S. Yu, Sheng Ma |
| 2003 | K-D Decision Tree: An Accelerated and Memory Efficient Nearest Neighbor Classifier. Tomoyuki Shibata, Takekazu Kato, Toshikazu Wada |
| 2003 | Learning Bayesian Networks from Incomplete Data Based on EMI Method. Fengzhan Tian, Hongwei Zhang, Yuchang Lu |
| 2003 | Learning Rules for Anomaly Detection of Hostile Network Traffic. Matthew V. Mahoney, Philip K. Chan |
| 2003 | Links Between Kleinberg's Hubs and Authorities, Correspondence Analysis, and Markov Chains. François Fouss, Marco Saerens, Jean-Michel Renders |
| 2003 | Localized Prediction of Continuous Target Variables Using Hierarchical Clustering. Aleksandar Lazarevic, Ramdev Kanapady, Chandrika Kamath, Vipin Kumar, Kumar K. Tamma |
| 2003 | MPIS: Maximal-Profit Item Selection with Cross-Selling Considerations. Raymond Chi-Wing Wong, Ada Wai-Chee Fu, Ke Wang |
| 2003 | MaPle: A Fast Algorithm for Maximal Pattern-based Clustering. Jian Pei, Xiaoling Zhang, Moonjung Cho, Haixun Wang, Philip S. Yu |
| 2003 | Mining Frequent Itemsets in Distributed and Dynamic Databases. Matthew Eric Otey, Chao Wang, Srinivasan Parthasarathy, Adriano Veloso, Wagner Meira Jr. |
| 2003 | Mining High Utility Itemsets. Raymond Chan, Qiang Yang, Yi-Dong Shen |
| 2003 | Mining Plans for Customer-Class Transformation. Qiang Yang, Hong Cheng |
| 2003 | Mining Production Data with Neural Network & CART. Mingkun Li, Shuo Feng, Ishwar K. Sethi, Jason Luciow, Keith Wagner |
| 2003 | Mining Relevant Text from Unlabelled Documents. Daniel Barbará, Carlotta Domeniconi, Ning Kang |
| 2003 | Mining Semantic Networks for Knowledge Discovery. Kanagasabai Rajaraman, Ah-Hwee Tan |
| 2003 | Mining Significant Pairs of Patterns from Graph Structures with Class Labels. Akihiro Inokuchi, Hisashi Kashima |
| 2003 | Mining Strong Affinity Association Patterns in Data Sets with Skewed Support Distribution. Hui Xiong, Pang-Ning Tan, Vipin Kumar |
| 2003 | Mining the Web to Discover the Meanings of an Ambiguous Word. Raz Tamir, Reinhard Rapp |
| 2003 | Model Stability: A key factor in determining whether an algorithm produces an optimal model from a matching distribution. Kai Ming Ting, Regina Jing Ying Quek |
| 2003 | OP-Cluster: Clustering by Tendency in High Dimensional Space. Jinze Liu, Wei Wang |
| 2003 | Objective and Subjective Algorithms for Grouping Association Rules. Aijun An, Shakil M. Khan, Xiangji Huang |
| 2003 | On Precision and Recall of Multi-Attribute Data Extraction from Semistructured Sources. Guizhen Yang, Saikat Mukherjee, I. V. Ramakrishnan |
| 2003 | On the Privacy Preserving Properties of Random Data Perturbation Techniques. Hillol Kargupta, Souptik Datta, Qi Wang, Krishnamoorthy Sivakumar |
| 2003 | Ontologies Improve Text Document Clustering. Andreas Hotho, Steffen Staab, Gerd Stumme |
| 2003 | Optimized Disjunctive Association Rules via Sampling. Joseph Elble, Cinda Heeren, Leonard Pitt |
| 2003 | Parsing Without a Grammar: Making Sense of Unknown File Formats. Levon Lloyd, Steven Skiena |
| 2003 | Pattern Discovery based on Rule Induction and Taxonomy Generation. Shusaku Tsumoto, Shoji Hirano |
| 2003 | PixelMaps: A New Visual Data Mining Approach for Analyzing Large Spatial Data Sets. Daniel A. Keim, Christian Panse, Mike Sips, Stephen C. North |
| 2003 | Postprocessing Decision Trees to Extract Actionable Knowledge. Qiang Yang, Jie Yin, Charles X. Ling, Tielin Chen |
| 2003 | Predicting distribution of a new forest disease using one-class SVMs. Qinghua Guo, Maggi Kelly, Catherine Graham |
| 2003 | Privacy-Preserving Collaborative Filtering Using Randomized Perturbation Techniques. Huseyin Polat, Wenliang Du |
| 2003 | Privacy-preserving Distributed Clustering using Generative Models. Srujana Merugu, Joydeep Ghosh |
| 2003 | Probabilistic Noise Identification and Data Cleaning. Jeremy Kubica, Andrew W. Moore |
| 2003 | Probabilistic User Behavior Models. Eren Manavoglu, Dmitry Pavlov, C. Lee Giles |
| 2003 | Proceedings of the 3rd IEEE International Conference on Data Mining (ICDM 2003), 19-22 December 2003, Melbourne, Florida, USA |
| 2003 | Protecting Sensitive Knowledge By Data Sanitization. Stanley R. M. Oliveira, Osmar R. Zaïane |
| 2003 | Regression Clustering. Bin Zhang |
| 2003 | Regulatory Element Discovery Using Tree-structured Models. Tu Minh Phuong, Doheon Lee, Kwang Hyung Lee |
| 2003 | Reliable Detection of Episodes in Event Sequences. Robert Gwadera, Mikhail J. Atallah, Wojciech Szpankowski |
| 2003 | SVM Based Models for Predicting Foreign Currency Exchange Rates. Joarder Kamruzzaman, Ruhul A. Sarker, Iftekhar Ahmad |
| 2003 | Scalable Model-based Clustering by Working on Data Summaries. Huidong Jin, Man Leung Wong, Kwong-Sak Leung |
| 2003 | Segmenting Customer Transactions Using a Pattern-Based Clustering Approach. Yinghui Yang, Balaji Padmanabhan |
| 2003 | Semantic Log Analysis Based on a User Query Behavior Model. Noriaki Kawamae, Takeya Mukaigaito, Hanaki Miyoshi |
| 2003 | Semantic Role Parsing: Adding Semantic Structure to Unstructured Text. Sameer S. Pradhan, Kadri Hacioglu, Wayne H. Ward, James H. Martin, Daniel Jurafsky |
| 2003 | Sentiment Analyzer: Extracting Sentiments about a Given Topic using Natural Language Processing Techniques. Jeonghee Yi, Tetsuya Nasukawa, Razvan C. Bunescu, Wayne Niblack |
| 2003 | Sequence Modeling with Mixtures of Conditional Maximum Entropy Distributions. Dmitry Pavlov |
| 2003 | Simple Estimators for Relational Bayesian Classifiers. Jennifer Neville, David D. Jensen, Brian Gallagher |
| 2003 | Spatial Interest Pixels (SIPs): Useful Low-Level Features of Visual Media Data. Qi Li, Jieping Ye, Chandra Kambhamettu |
| 2003 | Statistical Relational Learning for Document Mining. Alexandrin Popescul, Lyle H. Ungar, Steve Lawrence, David M. Pennock |
| 2003 | Structure Search and Stability Enhancement of Bayesian Networks. Hanchuan Peng, Chris H. Q. Ding |
| 2003 | T-Trees, Vertical Partitioning and Distributed Association Rule Mining. Frans Coenen, Paul H. Leng, Shakil Ahmed |
| 2003 | TECNO-STREAMS: Tracking Evolving Clusters in Noisy Data Streams with a Scalable Immune System Learning Model. Olfa Nasraoui, Cesar Cardona Uribe, Carlos Rojas Coronel, Fabio A. González |
| 2003 | TSP: Mining Top-K Closed Sequential Patterns. Petre Tzvetkov, Xifeng Yan, Jiawei Han |
| 2003 | Text Mining for a Clear Picture of Defect Reports: A Praxis Report. Jutta Kreyß, Steve Selvaggio, Michael White, Zach Zakharian |
| 2003 | The Hybrid Poisson Aspect Model for Personalized Shopping Recommendation. Chun-Nan Hsu, Hao-Hsiang Chung, Han-Shen Huang |
| 2003 | The Rough Set Approach to Association Rule Mining. J. W. Guan, David A. Bell, Dayou Liu |
| 2003 | Towards Simple, Easy-to-Understand, yet Accurate Classifiers. Doina Caragea, Dianne Cook, Vasant G. Honavar |
| 2003 | Tractable Group Detection on Large Link Data Sets. Jeremy Kubica, Andrew W. Moore, Jeff G. Schneider |
| 2003 | Tree-structured Partitioning Based on Splitting Histograms of Distances. Longin Jan Latecki, Rajagopal Venugopal, Marc Sobel, Steve Horvat |
| 2003 | Understanding Helicoverpa armigera Pest Population Dynamics related to Chickpea Crop Using Neural Networks. Rajat Gupta, B. V. L. Narayana, P. Krishna Reddy, G. V. Ranga Rao, C. L. L. Gowda, Y. V. R. Reddy, Garimella Rama Murthy |
| 2003 | Unsupervised Link Discovery in Multi-relational Data via Rarity Analysis. Shou-De Lin, Hans Chalupsky |
| 2003 | Using Discriminant Analysis for Multi-class Classification. Tao Li, Shenghuo Zhu, Mitsunori Ogihara |
| 2003 | Validating and Refining Clusters via Visual Rendering. Keke Chen, Ling Liu |
| 2003 | Visualization of Rule's Similarity using Multidimensional Scaling. Shusaku Tsumoto, Shoji Hirano |
| 2003 | Zigzag: a new algorithm for mining large inclusion dependencies in database. Fabien De Marchi, Jean-Marc Petit |