| 2008 | A bayesian mixture model with linear regression mixing proportions. Xiuyao Song, Chris Jermaine, Sanjay Ranka, John Gums |
| 2008 | A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances. Luh Yen, Marco Saerens, Amin Mantrach, Masashi Shimbo |
| 2008 | A sequential dual method for large scale multi-class linear svms. S. Sathiya Keerthi, S. Sundararajan, Kai-Wei Chang, Cho-Jui Hsieh, Chih-Jen Lin |
| 2008 | A software system for buzz-based recommendations. Hill Nguyen, Nish Parikh, Neel Sundaresan |
| 2008 | A unified approach for schema matching, coreference and canonicalization. Michael L. Wick, Khashayar Rohanimanesh, Karl Schultz, Andrew McCallum |
| 2008 | A visual-analytic toolkit for dynamic interaction graphs. Xintian Yang, Sitaram Asur, Srinivasan Parthasarathy, Sameep Mehta |
| 2008 | Active learning with direct query construction. Charles X. Ling, Jun Du |
| 2008 | An inductive database prototype based on virtual mining views. Hendrik Blockeel, Toon Calders, Élisa Fromont, Bart Goethals, Adriana Prado, Céline Robardet |
| 2008 | An integrated system for automatic customer satisfaction analysis in the services industry. Shantanu Godbole, Shourya Roy |
| 2008 | Angle-based outlier detection in high-dimensional data. Hans-Peter Kriegel, Matthias Schubert, Arthur Zimek |
| 2008 | Anomaly pattern detection in categorical datasets. Kaustav Das, Jeff G. Schneider, Daniel B. Neill |
| 2008 | Anonymizing transaction databases for publication. Yabo Xu, Ke Wang, Ada Wai-Chee Fu, Philip S. Yu |
| 2008 | Anticipating annotations and emerging trends in biomedical literature. Fabian Mörchen, Mathäus Dejori, Dmitriy Fradkin, Julien Etienne, Bernd Wachmann, Markus Bundschus |
| 2008 | ArnetMiner: extraction and mining of academic social networks. Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, Zhong Su |
| 2008 | Asymmetric support vector machines: low false-positive learning under the user tolerance. Shan-Hung Wu, Keng-Pei Lin, Chung-Min Chen, Ming-Syan Chen |
| 2008 | Automated cyclone discovery and tracking using knowledge sharing in multiple heterogeneous satellite data. Shen-Shyang Ho, Ashit Talukder |
| 2008 | Automatic identification of quasi-experimental designs for discovering causal knowledge. David D. Jensen, Andrew S. Fast, Brian J. Taylor, Marc E. Maier |
| 2008 | Automatic record linkage using seeded nearest neighbour and support vector machine classification. Peter Christen |
| 2008 | Banded structure in binary matrices. Gemma C. Garriga, Esa Junttila, Heikki Mannila |
| 2008 | Bridging centrality: graph mining from element level to group level. Woochang Hwang, Taehyong Kim, Murali Ramanathan, Aidong Zhang |
| 2008 | Building semantic kernels for text classification using wikipedia. Pu Wang, Carlotta Domeniconi |
| 2008 | Bypass rates: reducing query abandonment using negative inferences. Atish Das Sarma, Sreenivas Gollapudi, Samuel Ieong |
| 2008 | Can complex network metrics predict the behavior of NBA teams? Pedro O. S. Vaz de Melo, Virgílio A. F. Almeida, Antonio Alfredo Ferreira Loureiro |
| 2008 | Categorizing and mining concept drifting data streams. Peng Zhang, Xingquan Zhu, Yong Shi |
| 2008 | Classification with partial labels. Nam Nguyen, Rich Caruana |
| 2008 | Colibri: fast mining of large static and dynamic graphs. Hanghang Tong, Spiros Papadimitriou, Jimeng Sun, Philip S. Yu, Christos Faloutsos |
| 2008 | Combinational collaborative filtering for personalized community recommendation. WenYen Chen, Dong Zhang, Edward Y. Chang |
| 2008 | Community evolution in dynamic multi-mode networks. Lei Tang, Huan Liu, Jianping Zhang, Zohreh Nazeri |
| 2008 | Composition attacks and auxiliary information in data privacy. Srivatsava Ranjit Ganta, Shiva Prasad Kasiviswanathan, Adam D. Smith |
| 2008 | Constraint programming for itemset mining. Luc De Raedt, Tias Guns, Siegfried Nijssen |
| 2008 | Constructing comprehensive summaries of large event sequences. Jerry Kiernan, Evimaria Terzi |
| 2008 | Context-aware query suggestion by mining click-through and session data. Huanhuan Cao, Daxin Jiang, Jian Pei, Qi He, Zhen Liao, Enhong Chen, Hang Li |
| 2008 | Customer targeting models using actively-selected web content. Prem Melville, Saharon Rosset, Richard D. Lawrence |
| 2008 | Cut-and-stitch: efficient parallel learning of linear dynamical systems on smps. Lei Li, Wenjie Fu, Fan Guo, Todd C. Mowry, Christos Faloutsos |
| 2008 | Cuts3vm: a fast semi-supervised svm algorithm. Bin Zhao, Fei Wang, Changshui Zhang |
| 2008 | Data mining using high performance data clouds: experimental studies using sector and sphere. Robert L. Grossman, Yunhong Gu |
| 2008 | De-duping URLs via rewrite rules. Anirban Dasgupta, Ravi Kumar, Amit Sasturkar |
| 2008 | Detecting privacy leaks using corpus-based association rules. Richard Chow, Philippe Golle, Jessica Staddon |
| 2008 | DiMaC: a disguised missing data cleaning tool. Ming Hua, Jian Pei |
| 2008 | Direct mining of discriminative and essential frequent patterns via model-based search tree. Wei Fan, Kun Zhang, Hong Cheng, Jing Gao, Xifeng Yan, Jiawei Han, Philip S. Yu, Olivier Verscheure |
| 2008 | Discrimination-aware data mining. Dino Pedreschi, Salvatore Ruggieri, Franco Turini |
| 2008 | Effective and efficient itemset pattern summarization: regression-based approaches. Ruoming Jin, Muad Abu-Ata, Yang Xiang, Ning Ruan |
| 2008 | Effective label acquisition for collective classification. Mustafa Bilgic, Lise Getoor |
| 2008 | Efficient computation of personal aggregate queries on blogs. Ka Cheung Sia, Junghoo Cho, Yun Chi, Belle L. Tseng |
| 2008 | Efficient semi-streaming algorithms for local triangle counting in massive graphs. Luca Becchetti, Paolo Boldi, Carlos Castillo, Aristides Gionis |
| 2008 | Efficient ticket routing by resolution sequence mining. Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos Anerousis |
| 2008 | Entity categorization over large document collections. Venkatesh Ganti, Arnd Christian König, Rares Vernica |
| 2008 | Experimental comparison of scalable online ad serving. Gang Wu, Brendan Kitts |
| 2008 | Extracting shared subspace for multi-label classification. Shuiwang Ji, Lei Tang, Shipeng Yu, Jieping Ye |
| 2008 | FAST: a roc-based feature selection metric for small samples and imbalanced data classification problems. Xue-wen Chen, Michael Wasikowski |
| 2008 | Factorization meets the neighborhood: a multifaceted collaborative filtering model. Yehuda Koren |
| 2008 | Fast collapsed gibbs sampling for latent dirichlet allocation. Ian Porteous, David Newman, Alexander Ihler, Arthur U. Asuncion, Padhraic Smyth, Max Welling |
| 2008 | Fast logistic regression for text categorization with variable-length n-grams. Georgiana Ifrim, Gökhan H. Bakir, Gerhard Weikum |
| 2008 | Fastanova: an efficient algorithm for genome-wide association study. Xiang Zhang, Fei Zou, Wei Wang |
| 2008 | Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface. Peter Christen |
| 2008 | Feedback effects between similarity and social influence in online communities. David J. Crandall, Dan Cosley, Daniel P. Huttenlocher, Jon M. Kleinberg, Siddharth Suri |
| 2008 | Finding non-redundant, statistically significant regions in high dimensional data: a novel approach to projected and subspace clustering. Gabriela Moise, Jörg Sander |
| 2008 | Generating succinct titles for web URLs. Deepayan Chakrabarti, Ravi Kumar, Kunal Punera |
| 2008 | Genesis of postal address reading, current state and future prospects: thirty years of pattern recognition on duty of postal services. Udo Miletzki |
| 2008 | Get another label? improving data quality and data mining using multiple, noisy labelers. Victor S. Sheng, Foster J. Provost, Panagiotis G. Ipeirotis |
| 2008 | Heterogeneous data fusion for alzheimer's disease study. Jieping Ye, Kewei Chen, Teresa Wu, Jing Li, Zheng Zhao, Rinkal Patel, Min Bae, Ravi Janardan, Huan Liu, Gene E. Alexander, Eric Reiman |
| 2008 | Hypergraph spectral learning for multi-label classification. Liang Sun, Shuiwang Ji, Jieping Ye |
| 2008 | Identifying authoritative actors in question-answering forums: the case of Yahoo! answers. Mohamed Bouguessa, Benoît Dumoulin, Shengrui Wang |
| 2008 | Identifying biologically relevant genes via multiple heterogeneous data sources. Zheng Zhao, Jiangxin Wang, Huan Liu, Jieping Ye, Yung Chang |
| 2008 | Identifying domain expertise of developers from source code. Renuka Sindhgatta |
| 2008 | Influence and correlation in social networks. Aris Anagnostopoulos, Ravi Kumar, Mohammad Mahdian |
| 2008 | Information extraction from Wikipedia: moving down the long tail. Fei Wu, Raphael Hoffmann, Daniel S. Weld |
| 2008 | Internet advertising and optimal auction design. Benjamin Edelman, Michael Schwarz |
| 2008 | Interpretable nonnegative matrix decompositions. Saara Hyvönen, Pauli Miettinen, Evimaria Terzi |
| 2008 | Joint latent topic models for text and citations. Ramesh Nallapati, Amr Ahmed, Eric P. Xing, William W. Cohen |
| 2008 | Knowledge discovery of semantic relationships between words using nonparametric bayesian graph model. Issei Sato, Minoru Yoshida, Hiroshi Nakagawa |
| 2008 | Knowledge transfer via multiple model local structure mapping. Jing Gao, Wei Fan, Jing Jiang, Jiawei Han |
| 2008 | Land cover change detection: a case study. Shyam Boriah, Vipin Kumar, Michael S. Steinbach, Christopher Potter, Steven A. Klooster |
| 2008 | Large scale data analysis and modelling in online services and advertising. Thore Graepel, Ralf Herbrich |
| 2008 | Learning classifiers from only positive and unlabeled data. Charles Elkan, Keith Noto |
| 2008 | Learning from multi-topic web documents for contextual advertisement. Yi Zhang, Arun C. Surendran, John C. Platt, Mukund Narasimhan |
| 2008 | Learning methods for lung tumor markerless gating in image-guided radiotherapy. Ying Cui, Jennifer G. Dy, Gregory C. Sharp, Brian M. Alexander, Steve B. Jiang |
| 2008 | Learning subspace kernels for classification. Jianhui Chen, Shuiwang Ji, Betul Ceran, Qi Li, Mingrui Wu, Jieping Ye |
| 2008 | Local peculiarity factor and its application in outlier detection. Jian Yang, Ning Zhong, Yiyu Yao, Jue Wang |
| 2008 | Locality sensitive hash functions based on concomitant rank order statistics. Kave Eshghi, Shyamsundar Rajaram |
| 2008 | Microscopic evolution of social networks. Jure Leskovec, Lars Backstrom, Ravi Kumar, Andrew Tomkins |
| 2008 | Mining adaptively frequent closed unlabeled rooted trees in data streams. Albert Bifet, Ricard Gavaldà |
| 2008 | Mining multi-faceted overviews of arbitrary topics in a text collection. Xu Ling, Qiaozhu Mei, ChengXiang Zhai, Bruce R. Schatz |
| 2008 | Mining preferences from superior and inferior examples. Bin Jiang, Jian Pei, Xuemin Lin, David W. Cheung, Jiawei Han |
| 2008 | Mobile call graphs: beyond power-law and lognormal distributions. Mukund Seshadri, Sridhar Machiraju, Ashwin Sridharan, Jean Bolot, Christos Faloutsos, Jure Leskovec |
| 2008 | Model-based document clustering with a collapsed gibbs sampler. Daniel David Walker, Eric K. Ringger |
| 2008 | Morpheus: interactive exploration of subspace clustering. Emmanuel Müller, Ira Assent, Ralph Krieger, Timm Jansen, Thomas Seidl |
| 2008 | Multi-class cost-sensitive boosting with p-norm loss functions. Aurélie C. Lozano, Naoki Abe |
| 2008 | On updates that constrain the features' connections during learning. Omid Madani, Jian Huang |
| 2008 | Partial least squares regression for graph mining. Hiroto Saigo, Nicole Krämer, Koji Tsuda |
| 2008 | Partitioned logistic regression for spam filtering. Ming-Wei Chang, Wen-tau Yih, Christopher Meek |
| 2008 | Pattern-Miner: integrated management and mining over data mining models. Evangelos E. Kotsifakos, Irene Ntoutsi, Yannis Vrahoritis, Yannis Theodoridis |
| 2008 | Permu-pattern: discovery of mutable permutation patterns with proximity constraint. Meng Hu, Jiong Yang, Wei Su |
| 2008 | Pictor: an interactive system for importing data from a website. Shuyi Zheng, Matthew R. Scott, Ruihua Song, Ji-Rong Wen |
| 2008 | Privacy-preserving cox regression for survival analysis. Shipeng Yu, Glenn Fung, Rómer Rosales, Sriram Krishnan, R. Bharat Rao, Cary Dehing-Oberije, Philippe Lambin |
| 2008 | Probabilistic latent semantic visualization: topic model for visualizing documents. Tomoharu Iwata, Takeshi Yamada, Naonori Ueda |
| 2008 | Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, Nevada, USA, August 24-27, 2008 Ying Li, Bing Liu, Sunita Sarawagi |
| 2008 | Quantitative evaluation of approximate frequent pattern mining algorithms. Rohit Gupta, Gang Fang, Blayne Field, Michael S. Steinbach, Vipin Kumar |
| 2008 | Reconstructing chemical reaction networks: data mining meets system identification. Yong Ju Cho, Naren Ramakrishnan, Yang Cao |
| 2008 | Regularization paths and coordinate descent. Trevor Hastie, Jerome H. Friedman, Robert Tibshirani |
| 2008 | Relational learning via collective matrix factorization. Ajit Paul Singh, Geoffrey J. Gordon |
| 2008 | SAIL: summation-based incremental learning for information-theoretic clustering. Junjie Wu, Hui Xiong, Jian Chen |
| 2008 | SPIRAL: efficient and exact model identification for hidden Markov models. Yasuhiro Fujiwara, Yasushi Sakurai, Masashi Yamamuro |
| 2008 | Scalable and near real-time burst detection from eCommerce queries. Nish Parikh, Neel Sundaresan |
| 2008 | Scaling up text classification for large file systems. George Forman, Shyamsundar Rajaram |
| 2008 | Semi-supervised approach to rapid and reliable labeling of large data sets. György J. Simon, Vipin Kumar, Zhi-Li Zhang |
| 2008 | Semi-supervised learning with data calibration for long-term time series forecasting. Haibin Cheng, Pang-Ning Tan |
| 2008 | Simultaneous tensor subspace selection and clustering: the equivalence of high order svd and k-means clustering. Heng Huang, Chris H. Q. Ding, Dijun Luo, Tao Li |
| 2008 | Social networks: looking ahead. Ravi Kumar, Alexander Tuzhilin, Christos Faloutsos, David D. Jensen, Gueorgi Kossinets, Jure Leskovec, Andrew Tomkins |
| 2008 | Spectral domain-transfer learning. Xiao Ling, Wenyuan Dai, Gui-Rong Xue, Qiang Yang, Yong Yu |
| 2008 | Spotting out emerging artists using geo-aware analysis of P2P query strings. Noam Koenigstein, Yuval Shavitt, Tomer Tankel |
| 2008 | Stable feature selection via dense feature groups. Lei Yu, Chris H. Q. Ding, Steven Loscalzo |
| 2008 | Stream prediction using a generative model based on frequent episodes in event sequences. Srivatsan Laxman, Vikram Tankasali, Ryen W. White |
| 2008 | Structured entity identification and document categorization: two tasks with one joint model. Indrajit Bhattacharya, Shantanu Godbole, Sachindra Joshi |
| 2008 | Structured learning for non-smooth ranking losses. Soumen Chakrabarti, Rajiv Khanna, Uma Sawant, Chiru Bhattacharyya |
| 2008 | Structured metric learning for high dimensional problems. Jason V. Davis, Inderjit S. Dhillon |
| 2008 | Succinct summarization of transactional databases: an overlapped hyperrectangle scheme. Yang Xiang, Ruoming Jin, David Fuhry, Feodor F. Dragan |
| 2008 | Tagmark: reliable estimations of RFID tags for business processes. Leonardo Weiss Ferreira Chaves, Erik Buchmann, Klemens Böhm |
| 2008 | Temporal pattern discovery for trends and transient effects: its application to patient records. G. Niklas Norén, Andrew Bate, Johan Hopstadius, Kristina Star, I. Ralph Edwards |
| 2008 | Text classification, business intelligence, and interactivity: automating C-Sat analysis for services industry. Shantanu Godbole, Shourya Roy |
| 2008 | The cost of privacy: destruction of data-mining utility in anonymized data publishing. Justin Brickell, Vitaly Shmatikov |
| 2008 | The future of image search. Jitendra Malik |
| 2008 | The persuasive phase of visualization. Christine H. Chih, Douglas Stott Parker Jr. |
| 2008 | The structure of information pathways in a social communication network. Gueorgi Kossinets, Jon M. Kleinberg, Duncan J. Watts |
| 2008 | Topical query decomposition. Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis |
| 2008 | Training structural svms with kernels using sampled cuts. Chun-Nam John Yu, Thorsten Joachims |
| 2008 | Unsupervised deduplication using cross-field dependencies. Robert J. Hall, Charles Sutton, Andrew McCallum |
| 2008 | Unsupervised feature selection for principal components analysis. Christos Boutsidis, Michael W. Mahoney, Petros Drineas |
| 2008 | Using Luigi Di Caro, K. Selçuk Candan, Maria Luisa Sapino |
| 2008 | Using ghost edges for classification in sparsely labeled networks. Brian Gallagher, Hanghang Tong, Tina Eliassi-Rad, Christos Faloutsos |
| 2008 | Using predictive analysis to improve invoice-to-cash collection. Sai Zeng, Prem Melville, Christian A. Lang, Ioana M. Boier-Martin, Conrad Murphy |
| 2008 | Volatile correlation computation: a checkpoint view. Wenjun Zhou, Hui Xiong |
| 2008 | Weighted graphs and disconnected components: patterns and a generator. Mary McGlohon, Leman Akoglu, Christos Faloutsos |