| 2007 | A Cascaded Approach to Biomedical Named Entity Recognition Using a Unified Model. Shing-Kit Chan, Wai Lam, Xiaofeng Yu |
| 2007 | A Computational Approach to Style in American Poetry. David M. Kaplan, David M. Blei |
| 2007 | A Generalization of Proximity Functions for K-Means. Junjie Wu, Hui Xiong, Jian Chen, Wenjun Zhou |
| 2007 | A Novel Criterion for Onset Detection: Differential Information Redundancy with Application to Human Movement Initiation. Gert Van Dijck, Marc M. Van Hulle, Jo Van Vaerenbergh |
| 2007 | A Pairwise Covariance-Preserving Projection Method for Dimension Reduction. Xiaoming Liu, Zhaohui Wang, Zhilin Feng, Jinshan Tang |
| 2007 | A Semantic Kernel for Semi-structured DocumentS. Sujeevan Aseervatham, Emmanuel Viennet, Younès Bennani |
| 2007 | A Support Vector Approach to Censored Targets. Pannagadatta K. Shivaswamy, Wei Chu, Martin Jansche |
| 2007 | A Text Classification Framework with a Local Feature Ranking for Learning Social Networks. Masoud Makrehchi, Mohamed S. Kamel |
| 2007 | Active Learning from Data Streams. Xingquan Zhu, Peng Zhang, Xiaodong Lin, Yong Shi |
| 2007 | An Efficient Spectral Algorithm for Network Community Discovery and Its Applications to Biological and Social Networks. Jianhua Ruan, Weixiong Zhang |
| 2007 | Analyzing and Detecting Review Spam. Nitin Jindal, Bing Liu |
| 2007 | Bandit-Based Algorithms for Budgeted Learning. Kun Deng, Chris Bourke, Stephen Scott, Julie Sunderman, Yaling Zheng |
| 2007 | Bayesian Folding-In with Dirichlet Kernels for PLSI. Alexander Hinneburg, Hans-Henning Gabriel, André Gohr |
| 2007 | Binary Matrix Factorization with Applications. Zhongyuan Zhang, Tao Li, Chris H. Q. Ding, Xiang-Sun Zhang |
| 2007 | Can the Content of Public News Be Used to Forecast Abnormal Stock Market Behaviour? Calum S. Robertson, Shlomo Geva, Rodney C. Wolff |
| 2007 | Change-Point Detection in Time-Series Data Based on Subspace Identification. Yoshinobu Kawahara, Takehisa Yairi, Kazuo Machida |
| 2007 | Clustering Needles in a Haystack: An Information Theoretic Analysis of Minority and Outlier Detection. Shin Ando |
| 2007 | Co-ranking Authors and Documents in a Heterogeneous Network. Ding Zhou, Sergey A. Orshanskiy, Hongyuan Zha, C. Lee Giles |
| 2007 | Cocktail Ensemble for Regression. Yang Yu, Zhi-Hua Zhou, Kai Ming Ting |
| 2007 | Community Learning by Graph Approximation. Bo Long, Xiaoyun Xu, Zhongfei (Mark) Zhang, Philip S. Yu |
| 2007 | Computing Correlation Anomaly Scores Using Stochastic Nearest Neighbors. Tsuyoshi Idé, Spiros Papadimitriou, Michail Vlachos |
| 2007 | Confident Identification of Relevant Objects Based on Nonlinear Rescaling Method and Transductive Inference. Shen-Shyang Ho, Roman A. Polyak |
| 2007 | Connections between Mining Frequent Itemsets and Learning Generative Models. Srivatsan Laxman, Prasad Naldurg, Raja Sripada, Ramarathnam Venkatesan |
| 2007 | Consensus Clusterings. Nam Nguyen, Rich Caruana |
| 2007 | Cross-Mining Binary and Numerical Attributes. Gemma C. Garriga, Hannes Heikinheimo, Jouni K. Seppänen |
| 2007 | DUSC: Dimensionality Unbiased Subspace Clustering. Ira Assent, Ralph Krieger, Emmanuel Müller, Thomas Seidl |
| 2007 | Data Discretization Unification. Ruoming Jin, Yuri Breitbart, Chibuike Muoh |
| 2007 | Depth-Based Novelty Detection and Its Application to Taxonomic Research. Yixin Chen, Henry L. Bart Jr., Xin Dang, Hanxiang Peng |
| 2007 | Detecting Fractures in Classifier Performance. David A. Cieslak, Nitesh V. Chawla |
| 2007 | Detecting Subdimensional Motifs: An Efficient Algorithm for Generalized Multivariate Pattern Discovery. David Minnen, Charles L. Isbell Jr., Irfan A. Essa, Thad Starner |
| 2007 | Discovering Temporal Communities from Social Network Documents. Ding Zhou, Isaac G. Councill, Hongyuan Zha, C. Lee Giles |
| 2007 | Disk Aware Discord Discovery: Finding Unusual Time Series in Terabyte Sized Datasets. Dragomir Yankov, Eamonn J. Keogh, Umaa Rebbapragada |
| 2007 | Document Transformation for Multi-label Feature Selection in Text Categorization. Weizhu Chen, Jun Yan, Benyu Zhang, Zheng Chen, Qiang Yang |
| 2007 | Dynamic Micro Targeting: Fitness-Based Approach to Predicting Individual Preferences. Tianyi Jiang, Alexander Tuzhilin |
| 2007 | Efficient Algorithms for Mining Significant Substructures in Graphs with Quality Guarantees. Huahai He, Ambuj K. Singh |
| 2007 | Efficient Data Sampling in Heterogeneous Peer-to-Peer Networks. Benjamin Arai, Song Lin, Dimitrios Gunopulos |
| 2007 | Efficient Discovery of Frequent Approximate Sequential Patterns. Feida Zhu, Xifeng Yan, Jiawei Han, Philip S. Yu |
| 2007 | Efficient Kernel Discriminant Analysis via Spectral Regression. Deng Cai, Xiaofei He, Jiawei Han |
| 2007 | Exploration of Link Structure and Community-Based Node Roles in Network Analysis. Jerry Scripps, Pang-Ning Tan, Abdol-Hossein Esfahanian |
| 2007 | Extracting Product Comparisons from Discussion Boards. Ronen Feldman, Moshe Fresko, Jacob Goldenberg, Oded Netzer, Lyle H. Ungar |
| 2007 | Failure Prediction in IBM BlueGene/L Event Logs. Yinglung Liang, Yanyong Zhang, Hui Xiong, Ramendra K. Sahoo |
| 2007 | Finding Cohesive Clusters for Analyzing Knowledge Communities. Vasileios Kandylas, S. Phineas Upham, Lyle H. Ungar |
| 2007 | Finding Predictive Runs with LAPS. Suhrid Balakrishnan, David Madigan |
| 2007 | General Averaged Divergence Analysis. Dacheng Tao, Xuelong Li, Xindong Wu, Stephen J. Maybank |
| 2007 | High-Speed Function Approximation. Biswanath Panda, Mirek Riedewald, Johannes Gehrke, Stephen B. Pope |
| 2007 | How Much Noise Is Too Much: A Study in Automatic Text Classification. Sumeet Agarwal, Shantanu Godbole, Diwakar Punjani, Shourya Roy |
| 2007 | Improving Knowledge Discovery in Document Collections through Combining Text Retrieval and Link Analysis Techniques. Wei Jin, Rohini K. Srihari, Hung Hay Ho, Xin Wu |
| 2007 | Improving Text Classification by Using Encyclopedia Knowledge. Pu Wang, Jian Hu, Hua-Jun Zeng, Lijun Chen, Zheng Chen |
| 2007 | Incorporating User Provided Constraints into Document Clustering. Yanhua Chen, Manjeet Rege, Ming Dong, Jing Hua |
| 2007 | Incremental Subspace Clustering over Multiple Data Streams. Qi Zhang, Jinze Liu, Wei Wang |
| 2007 | Language-Independent Set Expansion of Named Entities Using the Web. Richard C. Wang, William W. Cohen |
| 2007 | Latent Dirichlet Conditional Naive-Bayes Models. Arindam Banerjee, Hanhuai Shan |
| 2007 | Lazy Bagging for Classifying Imbalanced Data. Xingquan Zhu |
| 2007 | Lightweight Distributed Trust Propagation. Daniele Quercia, Stephen Hailes, Licia Capra |
| 2007 | Local Probabilistic Models for Link Prediction. Chao Wang, Venu Satuluri, Srinivasan Parthasarathy |
| 2007 | Local Word Bag Model for Text Categorization. Wen Pu, Ning Liu, Shuicheng Yan, Jun Yan, Kunqing Xie, Zheng Chen |
| 2007 | Locally Constrained Support Vector Clustering. Dragomir Yankov, Eamonn J. Keogh, Kin Fai Kan |
| 2007 | Maximum Entropy Based Significance of Itemsets. Nikolaj Tatti |
| 2007 | Mechanism Design for Clustering Aggregation by Selfish Systems. Pinata Winoto, Yiu-ming Cheung, Jiming Liu |
| 2007 | Mining Frequent Itemsets in a Stream. Toon Calders, Nele Dexters, Bart Goethals |
| 2007 | Mining Interpretable Human Strategies: A Case Study. Xiaoli Z. Fern, Chaitanya Komireddy, Margaret M. Burnett |
| 2007 | Mining Statistical Information of Frequent Fault-Tolerant Patterns in Transactional Databases. Ardian Kristanto Poernomo, Vivekanand Gopalkrishnan |
| 2007 | Multilevel Belief Propagation for Fast Inference on Markov Random Fields. Liang Xiong, Fei Wang, Changshui Zhang |
| 2007 | Noise Modeling with Associative Corruption Rules. Yan Zhang, Xindong Wu |
| 2007 | Non-redundant Multi-view Clustering via Orthogonalization. Ying Cui, Xiaoli Z. Fern, Jennifer G. Dy |
| 2007 | ORIGAMI: Mining Representative Orthogonal Graph Patterns. Mohammad Al Hasan, Vineet Chaoji, Saeed Salem, Jérémy Besson, Mohammed Javeed Zaki |
| 2007 | On Appropriate Assumptions to Mine Data Streams: Analysis and Practice. Jing Gao, Wei Fan, Jiawei Han |
| 2007 | On Meta-Learning Rule Learning Heuristics. Frederik Janssen, Johannes Fürnkranz |
| 2007 | Optimal Subsequence Bijection. Longin Jan Latecki, Qiang Wang, Suzan Köknar-Tezel, Vasileios Megalooikonomou |
| 2007 | Optimizing Frequency Queries for Data Mining Applications. Hassan H. Malik, John R. Kender |
| 2007 | Parallel Mining of Frequent Closed Patterns: Harnessing Modern Computer Architectures. Claudio Lucchese, Salvatore Orlando, Raffaele Perego |
| 2007 | Predicting Blogging Behavior Using Temporal and Social Networks. Bi Chen, Qiankun Zhao, Bingjun Sun, Prasenjit Mitra |
| 2007 | Preserving Privacy through Data Generation. Jilles Vreeken, Matthijs van Leeuwen, Arno Siebes |
| 2007 | Prism: A Primal-Encoding Approach for Frequent Sequence Mining. Karam Gouda, Mosab Hassaan, Mohammed Javeed Zaki |
| 2007 | Proceedings of the 7th IEEE International Conference on Data Mining (ICDM 2007), October 28-31, 2007, Omaha, Nebraska, USA |
| 2007 | Recommendation via Query Centered Random Walk on K-Partite Graph. Haibin Cheng, Pang-Ning Tan, Jon Sticklen, William F. Punch |
| 2007 | Rule Cubes for Causal Investigations. Axel Blumenstock, Franz Schweiggert, Markus Müller |
| 2007 | Sample Selection for Maximal Diversity. Feng Pan, Adam Roberts, Leonard McMillan, David Threadgill, Wei Wang |
| 2007 | Sampling for Sequential Pattern Mining: From Static Databases to Data Streams. Chedy Raïssi, Pascal Poncelet |
| 2007 | Scalable Collaborative Filtering with Jointly Derived Neighborhood Interpolation Weights. Robert M. Bell, Yehuda Koren |
| 2007 | Semi-supervised Document Clustering via Active Learning with Pairwise Constraints. Ruizhang Huang, Wai Lam |
| 2007 | Social Network Extraction of Academic Researchers. Jie Tang, Duo Zhang, Limin Yao |
| 2007 | Solving Consensus and Semi-supervised Clustering Problems Using Nonnegative Matrix Factorization. Tao Li, Chris H. Q. Ding, Michael I. Jordan |
| 2007 | Spectral Regression: A Unified Approach for Sparse Subspace Learning. Deng Cai, Xiaofei He, Jiawei Han |
| 2007 | Statistical Learning Algorithm for Tree Similarity. Atsuhiro Takasu, Daiji Fukagawa, Tatsuya Akutsu |
| 2007 | Structure-Based Statistical Features and Multivariate Time Series Clustering. Xiaozhe Wang, Anthony Wirth, Liang Wang |
| 2007 | Succinct Matrix Approximation and Efficient k-NN Classification. Rong Liu, Yong Shi |
| 2007 | Supervised Learning by Training on Aggregate Outputs. David R. Musicant, Janara M. Christensen, Jamie F. Olson |
| 2007 | Temporal Analysis of Semantic Graphs Using ASALSAN. Brett W. Bader, Richard A. Harshman, Tamara G. Kolda |
| 2007 | The Chosen Few: On Identifying Valuable Patterns. Björn Bringmann, Albrecht Zimmermann |
| 2007 | Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval. Xuerui Wang, Andrew McCallum, Xing Wei |
| 2007 | Training Conditional Random Fields by Periodic Step Size Adaptation for Large-Scale Text Mining. Han-Shen Huang, Yu-Ming Chang, Chun-Nan Hsu |
| 2007 | Transitional Patterns and Their Significant Milestones. Qian Wan, Aijun An |
| 2007 | Trend Motif: A Graph Mining Approach for Analysis of Dynamic Complex Networks. Ruoming Jin, Scott McCallen, Eivind Almaas |
| 2007 | Understanding Discrete Classifiers with a Case Study in Gene Prediction. Muhammad Subianto, Arno Siebes |
| 2007 | Using Burstiness to Improve Clustering of Topics in News Streams. Qi He, Kuiyu Chang, Ee-Peng Lim |
| 2007 | Using Significant, Positively Associated and Relatively Class Correlated Rules for Associative Classification of Imbalanced Datasets. Florian Verhein, Sanjay Chawla |
| 2007 | Web Site Recommendation Using HTTP Traffic. Ming Jia, Shaozhi Ye, Xing Li, Julie A. Dickerson |
| 2007 | Weighted Additive Criterion for Linear Dimension Reduction. Jing Peng, Stefan A. Robila |
| 2007 | Zonal Co-location Pattern Discovery with Dynamic Parameters. Mete Celik, James M. Kang, Shashi Shekhar |
| 2007 | estMax: Tracing Maximal Frequent Itemsets over Online Data Streams. Ho Jin Woo, Won Suk Lee |
| 2007 | gApprox: Mining Frequent Approximate Patterns from a Massive Network. Chen Chen, Xifeng Yan, Feida Zhu, Jiawei Han |