KDD A*

115 papers

YearTitle / Authors
2007A concept-based model for enhancing text categorization.
Shady Shehata, Fakhri Karray, Mohamed Kamel
2007A fast algorithm for finding frequent episodes in event streams.
Srivatsan Laxman, P. S. Sastry, K. P. Unnikrishnan
2007A framework for classification and segmentation of massive audio data streams.
Charu C. Aggarwal
2007A framework for community identification in dynamic social networks.
Chayant Tantipathananandh, Tanya Y. Berger-Wolf, David Kempe
2007A framework for simultaneous co-clustering and learning from complex data.
Meghana Deodhar, Joydeep Ghosh
2007A learning framework using Green's function and kernel regularization with application to recommender system.
Chris H. Q. Ding, Rong Jin, Tao Li, Horst D. Simon
2007A probabilistic framework for relational clustering.
Bo Long, Zhongfei (Mark) Zhang, Philip S. Yu
2007A scalable modular convex solver for regularized risk minimization.
Choon Hui Teo, Alexander J. Smola, S. V. N. Vishwanathan, Quoc V. Le
2007A spectral clustering approach to optimally combining numericalvectors with a modular network.
Motoki Shiga, Ichigaku Takigawa, Hiroshi Mamitsuka
2007Active exploration for learning rankings from clickthrough data.
Filip Radlinski, Thorsten Joachims
2007An event-based framework for characterizing the evolutionary behavior of interaction graphs.
Sitaram Asur, Srinivasan Parthasarathy, Duygu Ucar
2007Applying collaborative filtering techniques to movie search for better ranking and browsing.
Seung-Taek Park, David M. Pennock
2007Association analysis-based transformations for protein interaction networks: a function prediction case study.
Gaurav Pandey, Michael S. Steinbach, Rohit Gupta, Tushar Garg, Vipin Kumar
2007Automatic labeling of multinomial topic models.
Qiaozhu Mei, Xuehua Shen, ChengXiang Zhai
2007BoostCluster: boosting clustering by pairwise constraints.
Yi Liu, Rong Jin, Anil K. Jain
2007Calculating latent demand in the long tail.
Chris Anderson
2007Canonicalization of database records using adaptive similarity measures.
Aron Culotta, Michael L. Wick, Robert J. Hall, Matthew Marzilli, Andrew McCallum
2007Challenges in mining social network data: processes, privacy, and paradoxes.
Jon M. Kleinberg
2007Characterising the difference.
Jilles Vreeken, Matthijs van Leeuwen, Arno Siebes
2007Cleaning disguised missing data: a heuristic approach.
Ming Hua, Jian Pei
2007Co-clustering based classification for out-of-domain documents.
Wenyuan Dai, Gui-Rong Xue, Qiang Yang, Yong Yu
2007Constraint-driven clustering.
Rong Ge, Martin Ester, Wen Jin, Ian Davidson
2007Content-based document routing and index partitioning for scalable similarity-based searches in a large corpus.
Deepavali Bhagwat, Kave Eshghi, Pankaj Mehra
2007Correlation search in graph databases.
Yiping Ke, James Cheng, Wilfred Ng
2007Corroborate and learn facts from the web.
Shubin Zhao, Jonathan Betz
2007Cost-effective outbreak detection in networks.
Jure Leskovec, Andreas Krause, Carlos Guestrin, Christos Faloutsos, Jeanne M. VanBriesen, Natalie S. Glance
2007Cross-language information retrieval using PARAFAC2.
Peter A. Chew, Brett W. Bader, Tamara G. Kolda, Ahmed Abdelali
2007Data mining at the crossroads: successes, failures and learning from them.
Srinivasan Parthasarathy
2007Density-based clustering for real-time stream data.
Yixin Chen, Li Tu
2007Detecting anomalous records in categorical datasets.
Kaustav Das, Jeff G. Schneider
2007Detecting changes in large data sets of payment card data: a case study.
Chris Curry, Robert L. Grossman, David Locke, Steve Vejcik, Joseph Bugajski
2007Detecting research topics via the correlation between graphs and texts.
Yookyung Jo, Carl Lagoze, C. Lee Giles
2007Detecting time series motifs under uniform scaling.
Dragomir Yankov, Eamonn J. Keogh, Jose Medina, Bill Yuan-chi Chiu, Victor B. Zordan
2007Development of NeuroElectroMagnetic ontologies(NEMO): a framework for mining brainwave ontologies.
Dejing Dou, Gwen A. Frishkoff, Jiawei Rong, Robert M. Frank, Allen D. Malony, Don M. Tucker
2007Discovering the hidden structure of house prices with a non-parametric latent manifold model.
Sumit Chopra, Trivikraman Thampy, John Leahy, Andrew Caplin, Yann LeCun
2007Distributed classification in peer-to-peer networks.
Ping Luo, Hui Xiong, Kevin Lü, Zhongzhi Shi
2007Domain-constrained semi-supervised mining of tracking models in sensor networks.
Rong Pan, Junhui Zhao, Vincent Wenchen Zheng, Jeffrey Junfeng Pan, Dou Shen, Sinno Jialin Pan, Qiang Yang
2007Dynamic hybrid clustering of bioinformatics by incorporating text mining and citation analysis.
Frizo A. L. Janssens, Wolfgang Glänzel, Bart De Moor
2007Efficient and effective explanation of change in hierarchical summaries.
Deepak Agarwal, Dhiman Barman, Dimitrios Gunopulos, Neal E. Young, Flip Korn, Divesh Srivastava
2007Efficient incremental constrained clustering.
Ian Davidson, S. S. Ravi, Martin Ester
2007Efficient mining of iterative patterns for software specification discovery.
David Lo, Siau-Cheng Khoo, Chao Liu
2007Enhanced max margin learning on multimodal data mining in a multimedia database.
Zhen Guo, Zhongfei Zhang, Eric P. Xing, Christos Faloutsos
2007Enhancing semi-supervised clustering: a feature projection perspective.
Wei Tang, Hui Xiong, Shi Zhong, Jie Wu
2007Estimating rates of rare events at multiple resolutions.
Deepak Agarwal, Andrei Z. Broder, Deepayan Chakrabarti, Dejan Diklic, Vanja Josifovski, Mayssam Sayyadian
2007Event summarization for system management.
Wei Peng, Charles Perng, Tao Li, Haixun Wang
2007Evolutionary spectral clustering by incorporating temporal smoothness.
Yun Chi, Xiaodan Song, Dengyong Zhou, Koji Hino, Belle L. Tseng
2007Expertise modeling for matching papers with reviewers.
David M. Mimno, Andrew McCallum
2007Exploiting duality in summarization with deterministic guarantees.
Panagiotis Karras, Dimitris Sacharidis, Nikos Mamoulis
2007Exploiting underrepresented query aspects for automatic query expansion.
Daniel Crabtree, Peter Andreae, Xiaoying Gao
2007Extracting relevant named entities for automated expense reimbursement.
Guangyu Zhu, Timothy J. Bethea, Vikas Krishna
2007Extracting semantic relations from query logs.
Ricardo A. Baeza-Yates, Alessandro Tiberi
2007Fast best-effort pattern matching in large attributed graphs.
Hanghang Tong, Christos Faloutsos, Brian Gallagher, Tina Eliassi-Rad
2007Fast direction-aware proximity for graph mining.
Hanghang Tong, Christos Faloutsos, Yehuda Koren
2007Feature selection methods for text classification.
Anirban Dasgupta, Petros Drineas, Boulos Harb, Vanja Josifovski, Michael W. Mahoney
2007Finding low-entropy sets and trees from binary data.
Hannes Heikinheimo, Jouni K. Seppänen, Eino Hinkkanen, Heikki Mannila, Taneli Mielikäinen
2007Finding tribes: identifying close-knit individuals from employment patterns.
Lisa Friedland, David D. Jensen
2007From frequent itemsets to semantically meaningful visual patterns.
Junsong Yuan, Ying Wu, Ming Yang
2007From mining the web to inventing the new sciences underlying the internet.
Usama M. Fayyad
2007Generalized component analysis for text with heterogeneous attributes.
Xuerui Wang, Chris Pal, Andrew McCallum
2007GraphScope: parameter-free mining of large time-evolving graphs.
Jimeng Sun, Christos Faloutsos, Spiros Papadimitriou, Philip S. Yu
2007Hierarchical mixture models: a probabilistic analysis.
Mark Sandler
2007High-quantile modeling for customer wallet estimation and other applications.
Claudia Perlich, Saharon Rosset, Richard D. Lawrence, Bianca Zadrozny
2007IMDS: intelligent malware detection system.
Yanfang Ye, Dingding Wang, Tao Li, Dongyi Ye
2007Information distance from a question to an answer.
Xian Zhang, Yu Hao, Xiaoyan Zhu, Ming Li, David R. Cheriton
2007Information genealogy: uncovering the flow of ideas in non-hyperlinked document databases.
Benyah Shaparenko, Thorsten Joachims
2007Joint cluster analysis of attribute and relationship data withouta-priori specification of the number of clusters.
Flavia Moser, Rong Ge, Martin Ester
2007Joint optimization of wrapper generation and template detection.
Shuyi Zheng, Ruihua Song, Ji-Rong Wen, Di Wu
2007Knowledge discovery of multiple-topic document using parametric mixture model with dirichlet prior.
Issei Sato, Hiroshi Nakagawa
2007Learning the kernel matrix in discriminant analysis via quadratically constrained quadratic programming.
Jieping Ye, Shuiwang Ji, Jianhui Chen
2007Local decomposition for rare class analysis.
Junjie Wu, Hui Xiong, Peng Wu, Jian Chen
2007LungCAD: a clinically approved, machine learning system for lung cancer detection.
R. Bharat Rao, Jinbo Bi, Glenn Fung, Marcos Salganicoff, Nancy Obuchowski, David P. Naidich
2007Machine learning for stock selection.
Robert J. Yan, Charles X. Ling
2007Making generative classifiers robust to selection bias.
Andrew T. Smith, Charles Elkan
2007Mining complex power networks for blackout prevention.
Junhua Zhao, Zhao Yang Dong, Pei Zhang
2007Mining correlated bursty topic patterns from coordinated text streams.
Xuanhui Wang, ChengXiang Zhai, Xiao Hu, Richard Sproat
2007Mining favorable facets.
Raymond Chi-Wing Wong, Jian Pei, Ada Wai-Chee Fu, Ke Wang
2007Mining optimal decision trees from itemset lattices.
Siegfried Nijssen, Élisa Fromont
2007Mining statistically important equivalence classes and delta-discriminative emerging patterns.
Jinyan Li, Guimei Liu, Limsoon Wong
2007Mining templates from search result records of search engines.
Hongkun Zhao, Weiyi Meng, Clement T. Yu
2007Model-shared subspace boosting for multi-label classification.
Rong Yan, Jelena Tesic, John R. Smith
2007Modeling relationships at multiple scales to improve accuracy of large recommender systems.
Robert M. Bell, Yehuda Koren, Chris Volinsky
2007Multiscale topic tomography.
Ramesh Nallapati, Susan Ditmore, John D. Lafferty, Kin Ung
2007Nestedness and segmented nestedness.
Heikki Mannila, Evimaria Terzi
2007Nonlinear adaptive distance metric learning for clustering.
Jianhui Chen, Zheng Zhao, Jieping Ye, Huan Liu
2007On string classification in data streams.
Charu C. Aggarwal, Philip S. Yu
2007On-board analysis of uncalibrated data for a spacecraft at mars.
Rebecca Castaño, Kiri Wagstaff, Steve A. Chien, Timothy M. Stough, Benyang Tang
2007Partial example acquisition in cost-sensitive learning.
Victor S. Sheng, Charles X. Ling
2007Practical guide to controlled experiments on the web: listen to your customers not to the hippo.
Ron Kohavi, Randal M. Henne, Dan Sommerfield
2007Practical learning from one-sided feedback.
D. Sculley
2007Predictive discrete latent factor models for large scale dyadic data.
Deepak Agarwal, Srujana Merugu
2007Privacy-preservation for gradient descent methods.
Li Wan, Wee Keong Ng, Shuguo Han, Vincent C. S. Lee
2007Proceedings of the 13th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Jose, California, USA, August 12-15, 2007
Pavel Berkhin, Rich Caruana, Xindong Wu
2007Raising the baseline for high-precision text classifiers.
Aleksander Kolcz, Wen-tau Yih
2007Real-time ranking with concept drift using expert advice.
Hila Becker, Marta Arias
2007Relational data pre-processing techniques for improved securities fraud detection.
Andrew S. Fast, Lisa Friedland, Marc E. Maier, Brian J. Taylor, David D. Jensen, Henry G. Goldberg, John Komoroske
2007SCAN: a structural clustering algorithm for networks.
Xiaowei Xu, Nurcan Yuruk, Zhidan Feng, Thomas A. J. Schweiger
2007Scalable look-ahead linear regression trees.
David S. Vogel, Ognian Asparouhov, Tobias Scheffer
2007Semi-supervised classification with hybrid generative/discriminative methods.
Gregory Druck, Chris Pal, Andrew McCallum, Xiaojin Zhu
2007Show me the money!: deriving the pricing power of product features by mining consumer reviews.
Nikolay Archak, Anindya Ghose, Panagiotis G. Ipeirotis
2007Statistical change detection for multi-dimensional data.
Xiuyao Song, Mingxi Wu, Christopher M. Jermaine, Sanjay Ranka
2007Stochastic processes and temporal data mining.
Paul Cotofrei, Kilian Stoffel
2007Structural and temporal analysis of the blogosphere through community factorization.
Yun Chi, Shenghuo Zhu, Xiaodan Song, Jun'ichi Tatemura, Belle L. Tseng
2007Support feature machine for classification of abnormal brain activity.
Wanpracha Art Chaovalitwongse, Ya-Ju Fan, Rajesh C. Sachdeo
2007Temporal causal modeling with graphical granger methods.
Andrew Arnold, Yan Liu, Naoki Abe
2007The minimum consistent subset cover problem and its applications in data mining.
Byron J. Gao, Martin Ester, Jin-Yi Cai, Oliver Schulte, Hui Xiong
2007Time-dependent event hierarchy construction.
Gabriel Pui Cheong Fung, Jeffrey Xu Yu, Huan Liu, Philip S. Yu
2007Tracking multiple topics for finding interesting articles.
Raymond K. Pon, Alfonso F. Cardenas, David Buttler, Terence Critchlow
2007Trajectory pattern mining.
Fosca Giannotti, Mirco Nanni, Fabio Pinelli, Dino Pedreschi
2007Truth discovery with multiple conflicting information providers on the web.
Xiaoxin Yin, Jiawei Han, Philip S. Yu
2007Use of ranked cross document evidence trails for hypothesis generation.
Rohini K. Srihari, Li Xu, Tushar Saxena
2007Using hierarchical clustering for learning theontologies used in recommendation systems.
Vincent Schickel-Zuber, Boi Faltings
2007Very sparse stable random projections for dimension reduction in
Ping Li
2007Webpage understanding: an integrated approach.
Jun Zhu, Bo Zhang, Zaiqing Nie, Ji-Rong Wen, Hsiao-Wuen Hon
2007Weighting versus pruning in rule validation for detecting network and host anomalies.
Gaurav Tandon, Philip K. Chan
2007Xproj: a framework for projected structural clustering of xml documents.
Charu C. Aggarwal, Na Ta, Jianyong Wang, Jianhua Feng, Mohammed Javeed Zaki