| 2009 | A LRT framework for fast spatial anomaly detection. Mingxi Wu, Xiuyao Song, Chris Jermaine, Sanjay Ranka, John Gums |
| 2009 | A case study of behavior-driven conjoint analysis on Yahoo!: front page today module. Wei Chu, Seung-Taek Park, Todd Beaupre, Nitin Motgi, Amit Phadke, Seinjuti Chakraborty, Joe Zachariah |
| 2009 | A generalized Co-HITS algorithm and its application to bipartite graphs. Hongbo Deng, Michael R. Lyu, Irwin King |
| 2009 | A multi-relational approach to spatial classification. Richard Frank, Martin Ester, Arno J. Knobbe |
| 2009 | A principled and flexible framework for finding alternative clusterings. Zijie Qi, Ian Davidson |
| 2009 | A viewpoint-based approach for interaction graph analysis. Sitaram Asur, Srinivasan Parthasarathy |
| 2009 | Adapting the right measures for K-means clustering. Junjie Wu, Hui Xiong, Jian Chen |
| 2009 | Address standardization with latent semantic association. Honglei Guo, Huijia Zhu, Zhili Guo, Xiaoxun Zhang, Zhong Su |
| 2009 | An association analysis approach to biclustering. Gaurav Pandey, Gowtham Atluri, Michael S. Steinbach, Chad L. Myers, Vipin Kumar |
| 2009 | Analyzing patterns of user content generation in online social networks. Lei Guo, Enhua Tan, Songqing Chen, Xiaodong Zhang, Yihong Eric Zhao |
| 2009 | Anomalous window discovery through scan statistics for linear intersecting paths (SSLIP). Lei Shi, Vandana Pursnani Janeja |
| 2009 | Anonymizing healthcare data: a case study on the blood transfusion service. Noman Mohammed, Benjamin C. M. Fung, Patrick C. K. Hung, Cheuk-kwong Lee |
| 2009 | Applying syntactic similarity algorithms for enterprise information management. Ludmila Cherkasova, Kave Eshghi, Charles B. Morrey III, Joseph A. Tucek, Alistair C. Veitch |
| 2009 | Audience selection for on-line brand advertising: privacy-friendly social network targeting. Foster J. Provost, Brian Dalessandro, Rod Hook, Xiaohan Zhang, Alan Murray |
| 2009 | Augmenting the generalized hough transform to enable the mining of petroglyphs. Qiang Zhu, Xiaoyue Wang, Eamonn J. Keogh, Sang-Hee Lee |
| 2009 | BBM: bayesian browsing model from petabyte-scale data. Chao Liu, Fan Guo, Christos Faloutsos |
| 2009 | BGP-lens: patterns and anomalies in internet routing updates. B. Aditya Prakash, Nicholas Valler, David G. Andersen, Michalis Faloutsos, Christos Faloutsos |
| 2009 | Beyond blacklists: learning to detect malicious web sites from suspicious URLs. Justin Ma, Lawrence K. Saul, Stefan Savage, Geoffrey M. Voelker |
| 2009 | COA: finding novel patents through text analysis. Mohammad Al Hasan, W. Scott Spangler, Thomas D. Griffin, Alfredo Alba |
| 2009 | CP-summary: a concise representation for browsing frequent itemsets. Ardian Kristanto Poernomo, Vivekanand Gopalkrishnan |
| 2009 | Can we learn a template-independent wrapper for news article extraction from a single training site? Junfeng Wang, Chun Chen, Can Wang, Jian Pei, Jiajun Bu, Ziyu Guan, Wei Vivian Zhang |
| 2009 | Cartesian contour: a concise representation for a collection of frequent sets. Ruoming Jin, Yang Xiang, Lin Liu |
| 2009 | Catching the drift: learning broad matches from clickthrough data. Sonal Gupta, Mikhail Bilenko, Matthew Richardson |
| 2009 | Category detection using hierarchical mean shift. Pavan Vatturi, Weng-Keen Wong |
| 2009 | Causality quantification and its applications: structuring and modeling of multivariate time series. Takashi Shibuya, Tatsuya Harada, Yasuo Kuniyoshi |
| 2009 | Characteristic relational patterns. Arne Koopman, Arno Siebes |
| 2009 | Characterizing individual communication patterns. R. Dean Malmgren, Jake M. Hofman, Luís A. Nunes Amaral, Duncan J. Watts |
| 2009 | Classification of software behaviors for failure detection: a discriminative pattern mining approach. David Lo, Hong Cheng, Jiawei Han, Siau-Cheng Khoo, Chengnian Sun |
| 2009 | Clustering event logs using iterative partitioning. Adetokunbo Makanju, Nur Zincir-Heywood, Evangelos E. Milios |
| 2009 | Co-clustering on manifolds. Quanquan Gu, Jie Zhou |
| 2009 | Co-evolution of social and affiliation networks. Elena Zheleva, Hossam Sharara, Lise Getoor |
| 2009 | CoCo: coding cost for parameter-free outlier detection. Christian Böhm, Katrin Haegler, Nikola S. Müller, Claudia Plant |
| 2009 | Collaborative filtering with temporal dynamics. Yehuda Koren |
| 2009 | Collective annotation of Wikipedia entities in web text. Sayali Kulkarni, Amit Singh, Ganesh Ramakrishnan, Soumen Chakrabarti |
| 2009 | Collusion-resistant anonymous data collection method. Mafruz Zaman Ashrafi, See-Kiong Ng |
| 2009 | Combining link and content for community detection: a discriminative approach. Tianbao Yang, Rong Jin, Yun Chi, Shenghuo Zhu |
| 2009 | Connections between the lines: augmenting social networks with text. Jonathan D. Chang, Jordan L. Boyd-Graber, David M. Blei |
| 2009 | Consensus group stable feature selection. Steven Loscalzo, Lei Yu, Chris H. Q. Ding |
| 2009 | Constant-factor approximation algorithms for identifying dynamic communities. Chayant Tantipathananandh, Tanya Y. Berger-Wolf |
| 2009 | Constrained optimization for validation-guided conditional random field learning. Minmin Chen, Yixin Chen, Michael R. Brent, Aaron E. Tenney |
| 2009 | Correlated itemset mining in ROC space: a constraint programming approach. Siegfried Nijssen, Tias Guns, Luc De Raedt |
| 2009 | Cross domain distribution adaptation via kernel mapping. Erheng Zhong, Wei Fan, Jing Peng, Kun Zhang, Jiangtao Ren, Deepak S. Turaga, Olivier Verscheure |
| 2009 | DOULION: counting triangles in massive graphs with a coin. Charalampos E. Tsourakakis, U Kang, Gary L. Miller, Christos Faloutsos |
| 2009 | Data mining at NASA: from theory to applications. Ashok N. Srivastava |
| 2009 | Detection of unique temporal segments by information theoretic meta-clustering. Shin Ando, Einoshin Suzuki |
| 2009 | Differentially Private Recommender Systems: Building Privacy into the Netflix Prize Contenders. Frank McSherry, Ilya Mironov |
| 2009 | Drosophila gene expression pattern annotation using sparse features and term-term interactions. Shuiwang Ji, Lei Yuan, Ying-Xin Li, Zhi-Hua Zhou, Sudhir Kumar, Jieping Ye |
| 2009 | DynaMMo: mining and summarization of coevolving sequences with missing values. Lei Li, James McCann, Nancy S. Pollard, Christos Faloutsos |
| 2009 | Effective multi-label active learning for text classification. Bishan Yang, Jian-Tao Sun, Tengjiao Wang, Zheng Chen |
| 2009 | Efficient anomaly monitoring over moving object trajectory streams. Yingyi Bu, Lei Chen, Ada Wai-Chee Fu, Dawei Liu |
| 2009 | Efficient influence maximization in social networks. Wei Chen, Yajun Wang, Siyu Yang |
| 2009 | Efficient methods for topic model inference on streaming document collections. Limin Yao, David M. Mimno, Andrew McCallum |
| 2009 | Efficiently learning the accuracy of labeling sources for selective sampling. Pinar Donmez, Jaime G. Carbonell, Jeff G. Schneider |
| 2009 | Enabling analysts in managed services for CRM analytics. Indrajit Bhattacharya, Shantanu Godbole, Ajay Gupta, Ashish Verma, Jeff Achtermann, Kevin English |
| 2009 | Entity discovery and assignment for opinion mining applications. Xiaowen Ding, Bing Liu, Lei Zhang |
| 2009 | Exploiting Wikipedia as external knowledge for document clustering. Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, Xiaohua Zhou |
| 2009 | Exploring social tagging graph for web object classification. Zhijun Yin, Rui Li, Qiaozhu Mei, Jiawei Han |
| 2009 | Extracting discriminative concepts for domain adaptation in text mining. Bo Chen, Wai Lam, Ivor W. Tsang, Tak-Lam Wong |
| 2009 | Fast approximate spectral clustering. Donghui Yan, Ling Huang, Michael I. Jordan |
| 2009 | Feature shaping for linear SVM classifiers. George Forman, Martin Scholz, Shyamsundar Rajaram |
| 2009 | Finding a team of experts in social networks. Theodoros Lappas, Kun Liu, Evimaria Terzi |
| 2009 | Frequent pattern mining with uncertain data. Charu C. Aggarwal, Yan Li, Jianyong Wang, Jing Wang |
| 2009 | Genre-based decomposition of email class noise. Aleksander Kolcz, Gordon V. Cormack |
| 2009 | Grocery shopping recommendations based on basket-sensitive random walk. Ming Li, M. Benjamin Dias, Ian H. Jarman, Wael El-Deredy, Paulo J. G. Lisboa |
| 2009 | Grouped graphical Granger modeling methods for temporal causal modeling. Aurélie C. Lozano, Naoki Abe, Yan Liu, Saharon Rosset |
| 2009 | Heterogeneous source consensus learning via decision propagation and negotiation. Jing Gao, Wei Fan, Yizhou Sun, Jiawei Han |
| 2009 | Improving classification accuracy using automatically extracted training data. Ariel Fuxman, Anitha Kannan, Andrew B. Goldberg, Rakesh Agrawal, Panayiotis Tsaparas, John C. Shafer |
| 2009 | Improving clustering stability with combinatorial MRFs. Ron Bekkerman, Martin Scholz, Krishnamurthy Viswanathan |
| 2009 | Improving data mining utility with projective sampling. Mark Last |
| 2009 | Incorporating site-level knowledge for incremental crawling of web forums: a list-wise strategy. Jiang-Ming Yang, Rui Cai, Chunsong Wang, Hua Huang, Lei Zhang, Wei-Ying Ma |
| 2009 | Information theoretic regularization for semi-supervised boosting. Lei Zheng, Shaojun Wang, Yan Liu, Chi-Hoon Lee |
| 2009 | Intelligent file scoring system for malware detection from the gray list. Yanfang Ye, Tao Li, Qingshan Jiang, Zhixue Han, Li Wan |
| 2009 | Issues in evaluation of stream learning algorithms. João Gama, Raquel Sebastião, Pedro Pereira Rodrigues |
| 2009 | Large human communication networks: patterns and a utility-driven generator. Nan Du, Christos Faloutsos, Bai Wang, Leman Akoglu |
| 2009 | Large-scale behavioral targeting. Ye Chen, Dmitry Pavlov, John F. Canny |
| 2009 | Large-scale graph mining using backbone refinement classes. Andreas Maunz, Christoph Helma, Stefan Kramer |
| 2009 | Large-scale sparse logistic regression. Jun Liu, Jianhui Chen, Jieping Ye |
| 2009 | Learning dynamic temporal graphs for oil-production equipment monitoring system. Yan Liu, Jayant R. Kalagnanam, Oivind Johnsen |
| 2009 | Learning optimal ranking with tensor factorization for tag recommendation. Steffen Rendle, Leandro Balby Marinho, Alexandros Nanopoulos, Lars Schmidt-Thieme |
| 2009 | Learning patterns in the dynamics of biological networks. Chang Hun You, Lawrence B. Holder, Diane J. Cook |
| 2009 | Learning with a non-exhaustive training dataset: a case study: detection of bacteria cultures using optical-scattering technology. Murat Dundar, E. Daniel Hirleman, Arun K. Bhunia, J. Paul Robinson, Bartek Rajwa |
| 2009 | Learning, indexing, and diagnosing network faults. Ting Wang, Mudhakar Srivatsa, Dakshi Agrawal, Ling Liu |
| 2009 | Measuring the effects of preprocessing decisions and network forces in dynamic network analysis. Jerry Scripps, Pang-Ning Tan, Abdol-Hossein Esfahanian |
| 2009 | Meme-tracking and the dynamics of the news cycle. Jure Leskovec, Lars Backstrom, Jon M. Kleinberg |
| 2009 | MetaFac: community discovery via relational hypergraph factorization. Yu-Ru Lin, Jimeng Sun, Paul C. Castro, Ravi B. Konuru, Hari Sundaram, Aisling Kelliher |
| 2009 | Migration motif: a spatial - temporal pattern mining approach for financial markets. Xiaoxi Du, Ruoming Jin, Liang Ding, Victor E. Lee, John H. Thornton Jr. |
| 2009 | Mind the gaps: weighting the unknown in large-scale one-class collaborative filtering. Rong Pan, Martin Scholz |
| 2009 | Mining brain region connectivity for alzheimer's disease study via sparse inverse covariance estimation. Liang Sun, Rinkal Patel, Jun Liu, Kewei Chen, Teresa Wu, Jing Li, Eric Reiman, Jieping Ye |
| 2009 | Mining broad latent query aspects from search sessions. Xuanhui Wang, Deepayan Chakrabarti, Kunal Punera |
| 2009 | Mining discrete patterns via binary matrix factorization. Bao-Hong Shen, Shuiwang Ji, Jieping Ye |
| 2009 | Mining for the most certain predictions from dyadic data. Meghana Deodhar, Joydeep Ghosh |
| 2009 | Mining rich session context to improve web search. Guangyu Zhu, Gilad Mishne |
| 2009 | Mining social networks for personalized email prioritization. Shinjae Yoo, Yiming Yang, Frank Lin, Il-Chul Moon |
| 2009 | Mining web logs: applications and challenges. Ravi Kumar |
| 2009 | Mismatched models, wrong results, and dreadful decisions: on choosing appropriate data mining tools. David J. Hand |
| 2009 | Modeling and predicting user behavior in sponsored search. Josh Attenberg, Sandeep Pandey, Torsten Suel |
| 2009 | Multi-focal learning and its application to customer service support. Yong Ge, Hui Xiong, Wenjun Zhou, Ramendra K. Sahoo, Xiaofeng Gao, Weili Wu |
| 2009 | Name-ethnicity classification from open sources. Anurag Ambekar, Charles B. Ward, Jahangir Mohammed, Swapna Male, Steven Skiena |
| 2009 | Named entity mining from click-through data using weakly supervised latent dirichlet allocation. Gu Xu, Shuang-Hong Yang, Hang Li |
| 2009 | Network anomaly detection based on Eigen equation compression. Shunsuke Hirose, Kenji Yamanishi, Takayuki Nakata, Ryohei Fujimaki |
| 2009 | Network science: an introduction to recent statistical approaches. Stanley Wasserman |
| 2009 | New ensemble methods for evolving data streams. Albert Bifet, Geoffrey Holmes, Bernhard Pfahringer, Richard Kirkby, Ricard Gavaldà |
| 2009 | OLAP on search logs: an infrastructure supporting data-driven applications in search engines. Bin Zhou, Daxin Jiang, Jian Pei, Hang Li |
| 2009 | On burstiness-aware search for document sequences. Theodoros Lappas, Benjamin Arai, Manolis Platakis, Dimitrios Kotsakos, Dimitrios Gunopulos |
| 2009 | On compressing social networks. Flavio Chierichetti, Ravi Kumar, Silvio Lattanzi, Michael Mitzenmacher, Alessandro Panconesi, Prabhakar Raghavan |
| 2009 | On the tradeoff between privacy and utility in data publishing. Tiancheng Li, Ninghui Li |
| 2009 | Open standards and cloud computing: KDD-2009 panel report. Michael Zeller, Robert Grossman, Christoph Lingenfelder, Michael R. Berthold, Erik Marcadé, Rick Pechter, Mike Hoskins, Wayne Thompson, Rich Holada |
| 2009 | OpinionMiner: a novel machine learning system for web opinion mining and extraction. Wei Jin, Hung Hay Ho, Rohini K. Srihari |
| 2009 | Optimizing web traffic via the media scheduling problem. Lars Backstrom, Jon M. Kleinberg, Ravi Kumar |
| 2009 | PSkip: estimating relevance ranking quality from web search clickthrough data. Kuansan Wang, Toby Walker, Zijian Zheng |
| 2009 | Parallel community detection on large networks with propinquity dynamics. Yuzhou Zhang, Jianyong Wang, Yi Wang, Lizhu Zhou |
| 2009 | Pervasive parallelism in data mining: dataflow solution to co-clustering large and sparse Netflix data. Srivatsava Daruru, Nena M. Marin, Matt Walker, Joydeep Ghosh |
| 2009 | Predicting bounce rates in sponsored search advertisements. D. Sculley, Robert G. Malkin, Sugato Basu, Roberto J. Bayardo |
| 2009 | Primal sparse Max-margin Markov networks. Jun Zhu, Eric P. Xing, Bo Zhang |
| 2009 | Probabilistic frequent itemset mining in uncertain databases. Thomas Bernecker, Hans-Peter Kriegel, Matthias Renz, Florian Verhein, Andreas Züfle |
| 2009 | Proceedings of the 15th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Paris, France, June 28 - July 1, 2009 John F. Elder IV, Françoise Fogelman-Soulié, Peter A. Flach, Mohammed Javeed Zaki |
| 2009 | Quantification and semi-supervised classification methods for handling changes in class distribution. Jack Chongjie Xue, Gary M. Weiss |
| 2009 | Query result clustering for object-level search. Jongwuk Lee, Seung-won Hwang, Zaiqing Nie, Ji-Rong Wen |
| 2009 | Randomization methods in data mining. Heikki Mannila |
| 2009 | Ranking-based clustering of heterogeneous information networks with star network schema. Yizhou Sun, Yintao Yu, Jiawei Han |
| 2009 | Regression-based latent factor models. Deepak Agarwal, Bee-Chung Chen |
| 2009 | Regret-based online ranking for a growing digital library. Erick Delage |
| 2009 | Relational learning via latent social dimensions. Lei Tang, Huan Liu |
| 2009 | SNARE: a link analytic system for graph labeling and risk detection. Mary McGlohon, Stephen Bay, Markus G. Anderle, David M. Steier, Christos Faloutsos |
| 2009 | Scalable graph clustering using stochastic flows: applications to community discovery. Venu Satuluri, Srinivasan Parthasarathy |
| 2009 | Scalable pseudo-likelihood estimation in hybrid random fields. Antonino Freno, Edmondo Trentin, Marco Gori |
| 2009 | Sentiment analysis of blogs by combining lexical knowledge with text classification. Prem Melville, Wojciech Gryc, Richard D. Lawrence |
| 2009 | Seven pitfalls to avoid when running controlled experiments on the web. Thomas Crook, Brian Frasca, Ron Kohavi, Roger Longbotham |
| 2009 | Social influence analysis in large-scale networks. Jie Tang, Jimeng Sun, Chi Wang, Zi Yang |
| 2009 | Spatial-temporal causal modeling for climate change attribution. Aurélie C. Lozano, Hongfei Li, Alexandru Niculescu-Mizil, Yan Liu, Claudia Perlich, Jonathan R. M. Hosking, Naoki Abe |
| 2009 | Structured correspondence topic models for mining captioned figures in biological literature. Amr Ahmed, Eric P. Xing, William W. Cohen, Robert F. Murphy |
| 2009 | Sustainable operation and management of data center chillers using temporal data mining. Debprakash Patnaik, Manish Marwah, Ratnesh K. Sharma, Naren Ramakrishnan |
| 2009 | TANGENT: a novel, 'Surprise me', recommendation algorithm. Kensuke Onuma, Hanghang Tong, Christos Faloutsos |
| 2009 | Tell me something I don't know: randomization strategies for iterative data mining. Sami Hanhijärvi, Markus Ojala, Niko Vuokko, Kai Puolamäki, Nikolaj Tatti, Heikki Mannila |
| 2009 | Temporal mining for interactive workflow data analysis. Michele Berlingerio, Fabio Pinelli, Mirco Nanni, Fosca Giannotti |
| 2009 | The offset tree for learning with partial labels. Alina Beygelzimer, John Langford |
| 2009 | Time series shapelets: a new primitive for data mining. Lexiang Ye, Eamonn J. Keogh |
| 2009 | Toward autonomic grids: analyzing the job flow with affinity streaming. Xiangliang Zhang, Cyril Furtlehner, Julien Perez, Cécile Germain-Renaud, Michèle Sebag |
| 2009 | Towards a universal marketplace over the web: statistical multi-label classification of service provider forms with simulated annealing. Kivanc M. Ozonat, Donald Young |
| 2009 | Towards combining web classification and web information extraction: a case study. Ping Luo, Fen Lin, Yuhong Xiong, Yong Zhao, Zhongzhi Shi |
| 2009 | Towards efficient mining of proportional fault-tolerant frequent itemsets. Ardian Kristanto Poernomo, Vivekanand Gopalkrishnan |
| 2009 | Turning down the noise in the blogosphere. Khalid El-Arini, Gaurav Veda, Dafna Shahaf, Carlos Guestrin |
| 2009 | User grouping behavior in online forums. Xiaolin Shi, Jun Zhu, Rui Cai, Lei Zhang |
| 2009 | Using graph-based metrics with empirical risk minimization to speed up active learning on networked data. Sofus A. Macskassy |
| 2009 | WhereNext: a location predictor on trajectory pattern mining. Anna Monreale, Fabio Pinelli, Roberto Trasarti, Fosca Giannotti |