SDM A

76 papers

YearTitle / Authors
2007A Better Alternative to Piecewise Linear Time Series Segmentation.
Daniel Lemire
2007A General Framework for Mining Concept-Drifting Data Streams with Skewed Distributions.
Jing Gao, Wei Fan, Jiawei Han, Philip S. Yu
2007A PAC Bound for Approximate Support Vector Machines.
Dongwei Cao, Daniel Boley
2007A System for Keyword Search on Textual Streams.
Vagelis Hristidis, Oscar Valdivia, Michail Vlachos, Philip S. Yu
2007AC-Framework for Privacy-Preserving Collaboration.
Wei Jiang, Chris Clifton
2007Active Learning of Constraints for Semi-supervised Text Clustering.
Ruizhang Huang, Wai Lam, Zhigang Zhang
2007Adaptive Concept Learning through Clustering and Aggregation of Relational Data.
Hichem Frigui, Cheul Hwang
2007An Analysis of Logistic Models: Exponential Family Connections and Online Performance.
Arindam Banerjee
2007An incremental data-stream sketch using sparse random projections.
Aditya Krishna Menon, Gia Vinh Anh Pham, Sanjay Chawla, Anastasios Viglas
2007Approximating Representations for Large Numerical Databases.
Szymon Jaroszewicz, Marcin Korzen
2007Are approximation algorithms for consensus clustering worthwhile?.
Michael Bertolacci, Anthony Wirth
2007Bandits for Taxonomies: A Model-based Approach.
Sandeep Pandey, Deepak Agarwal, Deepayan Chakrabarti, Vanja Josifovski
2007Boosting Optimal Logical Patterns Using Noisy Data.
Noam Goldberg, Chung-chieh Shan
2007Bursty Feature Representation for Clustering Text Streams.
Qi He, Kuiyu Chang, Ee-Peng Lim, Jun Zhang
2007Change-Point Detection using Krylov Subspace Learning.
Tsuyoshi Idé, Koji Tsuda
2007Clustering by weighted cuts in directed graphs.
Marina Meila, William Pentney
2007Co-Preserving Patterns in Bipartite Partitioning for Topic Identification.
Tianming Hu, Hui Xiong, Sam Yuan Sung
2007Computing Statistical Profiles of Active Sites in Proteins.
Chang Zhao, Jalal Mahmud, I. V. Ramakrishnan, Subramanyam Swaminathan
2007Conical Dimension as an Intrinsic Dimension Estimator and its Applications.
Xin Yang, Sebastien Michea, Hongyuan Zha
2007Constraint-Based Pattern Set Mining.
Luc De Raedt, Albrecht Zimmermann
2007Discriminating Subsequence Discovery for Sequence Clustering.
Jianyong Wang, Yuzhou Zhang, Lizhu Zhou, George Karypis, Charu C. Aggarwal
2007Distance Preserving Dimension Reduction for Manifold Learning.
Hyunsoo Kim, Haesun Park, Hongyuan Zha
2007Distributed Top-K Outlier Detection from Astronomy Catalogs using the DEMAC System.
Haimonti Dutta, Chris Giannella, Kirk D. Borne, Hillol Kargupta
2007Dynamic Algorithm for Graph Clustering Using Minimum Cut Tree.
Barna Saha, Pabitra Mitra
2007Efficient Multiclass Boosting Classification with Active Learning.
Jian Huang, Seyda Ertekin, Yang Song, Hongyuan Zha, C. Lee Giles
2007Estimating False Negatives for Classification Problems with Cluster Structure.
György J. Simon, Vipin Kumar, Zhi-Li Zhang
2007Fast Best-Match Shape Searching in Rotation Invariant Metric Spaces.
Dragomir Yankov, Eamonn J. Keogh, Li Wei, Xiaopeng Xi, Wendy L. Hodges
2007Fast Counting with AV-Space for Efficient Rule Induction.
Linyan Wang, Aijun An
2007Fast Multilevel Transduction on Graphs.
Fei Wang, Changshui Zhang
2007Fast Newton-type Methods for the Least Squares Nonnegative Matrix Approximation Problem.
Dongmin Kim, Suvrit Sra, Inderjit S. Dhillon
2007Finding Motifs in a Database of Shapes.
Xiaopeng Xi, Eamonn J. Keogh, Li Wei, Agenor Mafra-Neto
2007Flexible Anonymization For Privacy Preserving Data Publishing: A Systematic Search Based Approach.
Bijit Hore, Ravi Chandra Jammalamadaka, Sharad Mehrotra
2007HACS: Heuristic Algorithm for Clustering Subsets.
Ding Yuan, W. Nick Street
2007HP2PC: Scalable Hierarchically-Distributed Peer-to-Peer Clustering.
Khaled M. Hammouda, Mohamed S. Kamel
2007Harmonium Models for Semantic Video Representation and Classification.
Jun Yang, Yan Liu, Eric P. Xing, Alexander G. Hauptmann
2007Higher Order Orthogonal Iteration of Tensors (HOOI) and its Relation to PCA and GLRAM.
Bernard N. Sheehan, Yousef Saad
2007Identifying Bundles of Product Options using Mutual Information Clustering.
Claudia Perlich, Saharon Rosset
2007Incremental Spectral Clustering With Application to Monitoring of Evolving Blog Communities.
Huazhong Ning, Wei Xu, Yun Chi, Yihong Gong, Thomas S. Huang
2007Kernel Based Detection of Mislabeled Training Examples.
Hamed Valizadegan, Pang-Ning Tan
2007Lattice based Clustering of Temporal Gene-Expression Matrices.
Yang Huang, Martin Farach-Colton
2007Learning from Time-Changing Data with Adaptive Windowing.
Albert Bifet, Ricard Gavaldà
2007Less is More: Compact Matrix Decomposition for Large Sparse Graphs.
Jimeng Sun, Yinglian Xie, Hui Zhang, Christos Faloutsos
2007Load Shedding in Classifying Multi-Source Streaming Data: A Bayes Risk Approach.
Yijian Bai, Haixun Wang, Carlo Zaniolo
2007Localized Support Vector Machine and Its Efficient Algorithm.
Haibin Cheng, Pang-Ning Tan, Rong Jin
2007Maximizing the Area under the ROC Curve with Decision Lists and Rule Sets.
Henrik Boström
2007Maximum Margin Classifiers with Specified False Positive and False Negative Error Rates.
J. Saketha Nath, Chiranjib Bhattacharyya
2007Mining Naturally Smooth Evolution of Clusters from Dynamic Data.
Yi Wang, Shi-Xia Liu, Jianhua Feng, Lizhu Zhou
2007Mining Visual and Textual Data for Constructing a Multi-Modal Thesaurus.
Hichem Frigui, Joshua Caudill
2007Multi-way Clustering on Relation Graphs.
Arindam Banerjee, Sugato Basu, Srujana Merugu
2007Nonlinear Dimensionality Reduction using Approximate Nearest Neighbors.
Erion Plaku, Lydia E. Kavraki
2007On Anonymization of String Data.
Charu C. Aggarwal, Philip S. Yu
2007On Demand Phenotype Ranking through Subspace Clustering.
Xiang Zhang, Wei Wang, Jun Huan
2007On Point Sampling Versus Space Sampling for Dimensionality Reduction.
Charu C. Aggarwal
2007On Privacy-Preservation of Text and Sparse Binary Data with Sketches.
Charu C. Aggarwal, Philip S. Yu
2007On Sample Selection Bias and Its Efficient Correction via Model Averaging and Unlabeled Examples.
Wei Fan, Ian Davidson
2007Patterns of Cascading Behavior in Large Blog Graphs.
Jure Leskovec, Mary McGlohon, Christos Faloutsos, Natalie S. Glance, Matthew Hurst
2007Performance of Recommendation Systems in Dynamic Streaming Environments.
Olfa Nasraoui, Jeff Cerwinske, Carlos Rojas, Fabio A. González
2007PoClustering: Lossless Clustering of Dissimilarity Data.
Jinze Liu, Qi Zhang, Wei Wang, Leonard McMillan, Jan F. Prins
2007Preventing Information Leaks in Email.
Vitor R. Carvalho, William W. Cohen
2007Probabilistic Joint Feature Selection for Multi-task Learning.
Tao Xiong, Jinbo Bi, R. Bharat Rao, Vladimir Cherkassky
2007Proceedings of the Seventh SIAM International Conference on Data Mining, April 26-28, 2007, Minneapolis, Minnesota, USA
2007RCMap: Efficiently Creating High-Quality Euclidean Embeddings.
Arun Qamra, Edward Y. Chang
2007ROAM: Rule- and Motif-Based Anomaly Detection in Massive Moving Object Data Sets.
Xiaolei Li, Jiawei Han, Sangkyum Kim, Hector Gonzalez
2007Rank Aggregation for Similar Items.
D. Sculley
2007Robust, Complete, and Efficient Correlation Clustering.
Elke Achtert, Christian Böhm, Hans-Peter Kriegel, Peer Kröger, Arthur Zimek
2007Scalable Name Disambiguation using Multi-level Graph Partition.
Byung-Won On, Dongwon Lee
2007Segmentations with Rearrangements.
Aristides Gionis, Evimaria Terzi
2007Semi-Supervised Dimensionality Reduction.
Daoqiang Zhang, Zhi-Hua Zhou, Songcan Chen
2007Semi-supervised Feature Selection via Spectral Analysis.
Zheng Zhao, Huan Liu
2007Sketching Landscapes of Page Farms.
Bin Zhou, Jian Pei
2007Stacked Graphical Models for Efficient Inference in Markov Random Fields.
Zhenzhen Kou, William W. Cohen
2007Summarizing Review Scores of "Unequal" Reviewers.
Hady Wirawan Lauw, Ee-Peng Lim, Ke Wang
2007Topic Models over Text Streams: A Study of Batch and Online Unsupervised Learning.
Arindam Banerjee, Sugato Basu
2007Towards Attack-Resilient Geometric Data Perturbation.
Keke Chen, Gordon Sun, Ling Liu
2007Understanding and Utilizing the Hierarchy of Abnormal BGP Events.
Dejing Dou, Jun Li, Han Qin, Shiwoong Kim, Sheng Zhong
2007WAT: Finding Top-K Discords in Time Series Database.
Yingyi Bu, Oscar Tat-Wing Leung, Ada Wai-Chee Fu, Eamonn J. Keogh, Jian Pei, Sam Meshkin