| 2007 | A Better Alternative to Piecewise Linear Time Series Segmentation. Daniel Lemire |
| 2007 | A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions. Jing Gao, Wei Fan, Jiawei Han, Philip S. Yu |
| 2007 | A PAC Bound for Approximate Support Vector Machines. Dongwei Cao, Daniel Boley |
| 2007 | A System for Keyword Search on Textual Streams. Vagelis Hristidis, Oscar Valdivia, Michail Vlachos, Philip S. Yu |
| 2007 | AC-Framework for Privacy-Preserving Collaboration. Wei Jiang, Chris Clifton |
| 2007 | Active Learning of Constraints for Semi-supervised Text Clustering. Ruizhang Huang, Wai Lam, Zhigang Zhang |
| 2007 | Adaptive Concept Learning through Clustering and Aggregation of Relational Data. Hichem Frigui, Cheul Hwang |
| 2007 | An Analysis of Logistic Models: Exponential Family Connections and Online Performance. Arindam Banerjee |
| 2007 | An incremental data-stream sketch using sparse random projections. Aditya Krishna Menon, Gia Vinh Anh Pham, Sanjay Chawla, Anastasios Viglas |
| 2007 | Approximating Representations for Large Numerical Databases. Szymon Jaroszewicz, Marcin Korzen |
| 2007 | Are approximation algorithms for consensus clustering worthwhile?. Michael Bertolacci, Anthony Wirth |
| 2007 | Bandits for Taxonomies: A Model-based Approach. Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti, Vanja Josifovski |
| 2007 | Boosting Optimal Logical Patterns Using Noisy Data. Noam Goldberg, Chung-chieh Shan |
| 2007 | Bursty Feature Representation for Clustering Text Streams. Qi He, Kuiyu Chang, Ee-Peng Lim, Jun Zhang |
| 2007 | Change-Point Detection using Krylov Subspace Learning. Tsuyoshi Idé, Koji Tsuda |
| 2007 | Clustering by weighted cuts in directed graphs. Marina Meila, William Pentney |
| 2007 | Co-Preserving Patterns in Bipartite Partitioning for Topic Identification. Tianming Hu, Hui Xiong, Sam Yuan Sung |
| 2007 | Computing Statistical Profiles of Active Sites in Proteins. Chang Zhao, Jalal Mahmud, I. V. Ramakrishnan, Subramanyam Swaminathan |
| 2007 | Conical Dimension as an Intrinsic Dimension Estimator and its Applications. Xin Yang, Sebastien Michea, Hongyuan Zha |
| 2007 | Constraint-Based Pattern Set Mining. Luc De Raedt, Albrecht Zimmermann |
| 2007 | Discriminating Subsequence Discovery for Sequence Clustering. Jianyong Wang, Yuzhou Zhang, Lizhu Zhou, George Karypis, Charu C. Aggarwal |
| 2007 | Distance Preserving Dimension Reduction for Manifold Learning. Hyunsoo Kim, Haesun Park, Hongyuan Zha |
| 2007 | Distributed Top-K Outlier Detection from Astronomy Catalogs using the DEMAC System. Haimonti Dutta, Chris Giannella, Kirk D. Borne, Hillol Kargupta |
| 2007 | Dynamic Algorithm for Graph Clustering Using Minimum Cut Tree. Barna Saha, Pabitra Mitra |
| 2007 | Efficient Multiclass Boosting Classification with Active Learning. Jian Huang, Seyda Ertekin, Yang Song, Hongyuan Zha, C. Lee Giles |
| 2007 | Estimating False Negatives for Classification Problems with Cluster Structure. György J. Simon, Vipin Kumar, Zhi-Li Zhang |
| 2007 | Fast Best-Match Shape Searching in Rotation Invariant Metric Spaces. Dragomir Yankov, Eamonn J. Keogh, Li Wei, Xiaopeng Xi, Wendy L. Hodges |
| 2007 | Fast Counting with AV-Space for Efficient Rule Induction. Linyan Wang, Aijun An |
| 2007 | Fast Multilevel Transduction on Graphs. Fei Wang, Changshui Zhang |
| 2007 | Fast Newton-type Methods for the Least Squares Nonnegative Matrix Approximation Problem. Dongmin Kim, Suvrit Sra, Inderjit S. Dhillon |
| 2007 | Finding Motifs in a Database of Shapes. Xiaopeng Xi, Eamonn J. Keogh, Li Wei, Agenor Mafra-Neto |
| 2007 | Flexible Anonymization For Privacy Preserving Data Publishing: A Systematic Search Based Approach. Bijit Hore, Ravi Chandra Jammalamadaka, Sharad Mehrotra |
| 2007 | HACS: Heuristic Algorithm for Clustering Subsets. Ding Yuan, W. Nick Street |
| 2007 | HP2PC: Scalable Hierarchically-Distributed Peer-to-Peer Clustering. Khaled M. Hammouda, Mohamed S. Kamel |
| 2007 | Harmonium Models for Semantic Video Representation and Classification. Jun Yang, Yan Liu, Eric P. Xing, Alexander G. Hauptmann |
| 2007 | Higher Order Orthogonal Iteration of Tensors (HOOI) and its Relation to PCA and GLRAM. Bernard N. Sheehan, Yousef Saad |
| 2007 | Identifying Bundles of Product Options using Mutual Information Clustering. Claudia Perlich, Saharon Rosset |
| 2007 | Incremental Spectral Clustering With Application to Monitoring of Evolving Blog Communities. Huazhong Ning, Wei Xu, Yun Chi, Yihong Gong, Thomas S. Huang |
| 2007 | Kernel Based Detection of Mislabeled Training Examples. Hamed Valizadegan, Pang-Ning Tan |
| 2007 | Lattice based Clustering of Temporal Gene-Expression Matrices. Yang Huang, Martin Farach-Colton |
| 2007 | Learning from Time-Changing Data with Adaptive Windowing. Albert Bifet, Ricard Gavaldà |
| 2007 | Less is More: Compact Matrix Decomposition for Large Sparse Graphs. Jimeng Sun, Yinglian Xie, Hui Zhang, Christos Faloutsos |
| 2007 | Load Shedding in Classifying Multi-Source Streaming Data: A Bayes Risk Approach. Yijian Bai, Haixun Wang, Carlo Zaniolo |
| 2007 | Localized Support Vector Machine and Its Efficient Algorithm. Haibin Cheng, Pang-Ning Tan, Rong Jin |
| 2007 | Maximizing the Area under the ROC Curve with Decision Lists and Rule Sets. Henrik Boström |
| 2007 | Maximum Margin Classifiers with Specified False Positive and False Negative Error Rates. J. Saketha Nath, Chiranjib Bhattacharyya |
| 2007 | Mining Naturally Smooth Evolution of Clusters from Dynamic Data. Yi Wang, Shi-Xia Liu, Jianhua Feng, Lizhu Zhou |
| 2007 | Mining Visual and Textual Data for Constructing a Multi-Modal Thesaurus. Hichem Frigui, Joshua Caudill |
| 2007 | Multi-way Clustering on Relation Graphs. Arindam Banerjee, Sugato Basu, Srujana Merugu |
| 2007 | Nonlinear Dimensionality Reduction using Approximate Nearest Neighbors. Erion Plaku, Lydia E. Kavraki |
| 2007 | On Anonymization of String Data. Charu C. Aggarwal, Philip S. Yu |
| 2007 | On Demand Phenotype Ranking through Subspace Clustering. Xiang Zhang, Wei Wang, Jun Huan |
| 2007 | On Point Sampling Versus Space Sampling for Dimensionality Reduction. Charu C. Aggarwal |
| 2007 | On Privacy-Preservation of Text and Sparse Binary Data with Sketches. Charu C. Aggarwal, Philip S. Yu |
| 2007 | On Sample Selection Bias and Its Efficient Correction via Model Averaging and Unlabeled Examples. Wei Fan, Ian Davidson |
| 2007 | Patterns of Cascading Behavior in Large Blog Graphs. Jure Leskovec, Mary McGlohon, Christos Faloutsos, Natalie S. Glance, Matthew Hurst |
| 2007 | Performance of Recommendation Systems in Dynamic Streaming Environments. Olfa Nasraoui, Jeff Cerwinske, Carlos Rojas, Fabio A. González |
| 2007 | PoClustering: Lossless Clustering of Dissimilarity Data. Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, Jan F. Prins |
| 2007 | Preventing Information Leaks in Email. Vitor R. Carvalho, William W. Cohen |
| 2007 | Probabilistic Joint Feature Selection for Multi-task Learning. Tao Xiong, Jinbo Bi, R. Bharat Rao, Vladimir Cherkassky |
| 2007 | Proceedings of the Seventh SIAM International Conference on Data Mining, April 26-28, 2007, Minneapolis, Minnesota, USA |
| 2007 | RCMap: Efficiently Creating High-Quality Euclidean Embeddings. Arun Qamra, Edward Y. Chang |
| 2007 | ROAM: Rule- and Motif-Based Anomaly Detection in Massive Moving Object Data Sets. Xiaolei Li, Jiawei Han, Sangkyum Kim, Hector Gonzalez |
| 2007 | Rank Aggregation for Similar Items. D. Sculley |
| 2007 | Robust, Complete, and Efficient Correlation Clustering. Elke Achtert, Christian Böhm, Hans-Peter Kriegel, Peer Kröger, Arthur Zimek |
| 2007 | Scalable Name Disambiguation using Multi-level Graph Partition. Byung-Won On, Dongwon Lee |
| 2007 | Segmentations with Rearrangements. Aristides Gionis, Evimaria Terzi |
| 2007 | Semi-Supervised Dimensionality Reduction. Daoqiang Zhang, Zhi-Hua Zhou, Songcan Chen |
| 2007 | Semi-supervised Feature Selection via Spectral Analysis. Zheng Zhao, Huan Liu |
| 2007 | Sketching Landscapes of Page Farms. Bin Zhou, Jian Pei |
| 2007 | Stacked Graphical Models for Efficient Inference in Markov Random Fields. Zhenzhen Kou, William W. Cohen |
| 2007 | Summarizing Review Scores of "Unequal" Reviewers. Hady Wirawan Lauw, Ee-Peng Lim, Ke Wang |
| 2007 | Topic Models over Text Streams: A Study of Batch and Online Unsupervised Learning. Arindam Banerjee, Sugato Basu |
| 2007 | Towards Attack-Resilient Geometric Data Perturbation. Keke Chen, Gordon Sun, Ling Liu |
| 2007 | Understanding and Utilizing the Hierarchy of Abnormal BGP Events. Dejing Dou, Jun Li, Han Qin, Shiwoong Kim, Sheng Zhong |
| 2007 | WAT: Finding Top-K Discords in Time Series Database. Yingyi Bu, Oscar Tat-Wing Leung, Ada Wai-Chee Fu, Eamonn J. Keogh, Jian Pei, Sam Meshkin |