KDD A*

133 papers

YearTitle / Authors
2008A bayesian mixture model with linear regression mixing proportions.
Xiuyao Song, Chris Jermaine, Sanjay Ranka, John Gums
2008A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances.
Luh Yen, Marco Saerens, Amin Mantrach, Masashi Shimbo
2008A sequential dual method for large scale multi-class linear svms.
S. Sathiya Keerthi, S. Sundararajan, Kai-Wei Chang, Cho-Jui Hsieh, Chih-Jen Lin
2008A software system for buzz-based recommendations.
Hill Nguyen, Nish Parikh, Neel Sundaresan
2008A unified approach for schema matching, coreference and canonicalization.
Michael L. Wick, Khashayar Rohanimanesh, Karl Schultz, Andrew McCallum
2008A visual-analytic toolkit for dynamic interaction graphs.
Xintian Yang, Sitaram Asur, Srinivasan Parthasarathy, Sameep Mehta
2008Active learning with direct query construction.
Charles X. Ling, Jun Du
2008An inductive database prototype based on virtual mining views.
Hendrik Blockeel, Toon Calders, Élisa Fromont, Bart Goethals, Adriana Prado, Céline Robardet
2008An integrated system for automatic customer satisfaction analysis in the services industry.
Shantanu Godbole, Shourya Roy
2008Angle-based outlier detection in high-dimensional data.
Hans-Peter Kriegel, Matthias Schubert, Arthur Zimek
2008Anomaly pattern detection in categorical datasets.
Kaustav Das, Jeff G. Schneider, Daniel B. Neill
2008Anonymizing transaction databases for publication.
Yabo Xu, Ke Wang, Ada Wai-Chee Fu, Philip S. Yu
2008Anticipating annotations and emerging trends in biomedical literature.
Fabian Mörchen, Mathäus Dejori, Dmitriy Fradkin, Julien Etienne, Bernd Wachmann, Markus Bundschus
2008ArnetMiner: extraction and mining of academic social networks.
Jie Tang, Jing Zhang, Limin Yao, Juanzi Li, Li Zhang, Zhong Su
2008Asymmetric support vector machines: low false-positive learning under the user tolerance.
Shan-Hung Wu, Keng-Pei Lin, Chung-Min Chen, Ming-Syan Chen
2008Automated cyclone discovery and tracking using knowledge sharing in multiple heterogeneous satellite data.
Shen-Shyang Ho, Ashit Talukder
2008Automatic identification of quasi-experimental designs for discovering causal knowledge.
David D. Jensen, Andrew S. Fast, Brian J. Taylor, Marc E. Maier
2008Automatic record linkage using seeded nearest neighbour and support vector machine classification.
Peter Christen
2008Banded structure in binary matrices.
Gemma C. Garriga, Esa Junttila, Heikki Mannila
2008Bridging centrality: graph mining from element level to group level.
Woochang Hwang, Taehyong Kim, Murali Ramanathan, Aidong Zhang
2008Building semantic kernels for text classification using wikipedia.
Pu Wang, Carlotta Domeniconi
2008Bypass rates: reducing query abandonment using negative inferences.
Atish Das Sarma, Sreenivas Gollapudi, Samuel Ieong
2008Can complex network metrics predict the behavior of NBA teams?
Pedro O. S. Vaz de Melo, Virgílio A. F. Almeida, Antonio Alfredo Ferreira Loureiro
2008Categorizing and mining concept drifting data streams.
Peng Zhang, Xingquan Zhu, Yong Shi
2008Classification with partial labels.
Nam Nguyen, Rich Caruana
2008Colibri: fast mining of large static and dynamic graphs.
Hanghang Tong, Spiros Papadimitriou, Jimeng Sun, Philip S. Yu, Christos Faloutsos
2008Combinational collaborative filtering for personalized community recommendation.
WenYen Chen, Dong Zhang, Edward Y. Chang
2008Community evolution in dynamic multi-mode networks.
Lei Tang, Huan Liu, Jianping Zhang, Zohreh Nazeri
2008Composition attacks and auxiliary information in data privacy.
Srivatsava Ranjit Ganta, Shiva Prasad Kasiviswanathan, Adam D. Smith
2008Constraint programming for itemset mining.
Luc De Raedt, Tias Guns, Siegfried Nijssen
2008Constructing comprehensive summaries of large event sequences.
Jerry Kiernan, Evimaria Terzi
2008Context-aware query suggestion by mining click-through and session data.
Huanhuan Cao, Daxin Jiang, Jian Pei, Qi He, Zhen Liao, Enhong Chen, Hang Li
2008Customer targeting models using actively-selected web content.
Prem Melville, Saharon Rosset, Richard D. Lawrence
2008Cut-and-stitch: efficient parallel learning of linear dynamical systems on smps.
Lei Li, Wenjie Fu, Fan Guo, Todd C. Mowry, Christos Faloutsos
2008Cuts3vm: a fast semi-supervised svm algorithm.
Bin Zhao, Fei Wang, Changshui Zhang
2008Data mining using high performance data clouds: experimental studies using sector and sphere.
Robert L. Grossman, Yunhong Gu
2008De-duping URLs via rewrite rules.
Anirban Dasgupta, Ravi Kumar, Amit Sasturkar
2008Detecting privacy leaks using corpus-based association rules.
Richard Chow, Philippe Golle, Jessica Staddon
2008DiMaC: a disguised missing data cleaning tool.
Ming Hua, Jian Pei
2008Direct mining of discriminative and essential frequent patterns via model-based search tree.
Wei Fan, Kun Zhang, Hong Cheng, Jing Gao, Xifeng Yan, Jiawei Han, Philip S. Yu, Olivier Verscheure
2008Discrimination-aware data mining.
Dino Pedreschi, Salvatore Ruggieri, Franco Turini
2008Effective and efficient itemset pattern summarization: regression-based approaches.
Ruoming Jin, Muad Abu-Ata, Yang Xiang, Ning Ruan
2008Effective label acquisition for collective classification.
Mustafa Bilgic, Lise Getoor
2008Efficient computation of personal aggregate queries on blogs.
Ka Cheung Sia, Junghoo Cho, Yun Chi, Belle L. Tseng
2008Efficient semi-streaming algorithms for local triangle counting in massive graphs.
Luca Becchetti, Paolo Boldi, Carlos Castillo, Aristides Gionis
2008Efficient ticket routing by resolution sequence mining.
Qihong Shao, Yi Chen, Shu Tao, Xifeng Yan, Nikos Anerousis
2008Entity categorization over large document collections.
Venkatesh Ganti, Arnd Christian König, Rares Vernica
2008Experimental comparison of scalable online ad serving.
Gang Wu, Brendan Kitts
2008Extracting shared subspace for multi-label classification.
Shuiwang Ji, Lei Tang, Shipeng Yu, Jieping Ye
2008FAST: a roc-based feature selection metric for small samples and imbalanced data classification problems.
Xue-wen Chen, Michael Wasikowski
2008Factorization meets the neighborhood: a multifaceted collaborative filtering model.
Yehuda Koren
2008Fast collapsed gibbs sampling for latent dirichlet allocation.
Ian Porteous, David Newman, Alexander Ihler, Arthur U. Asuncion, Padhraic Smyth, Max Welling
2008Fast logistic regression for text categorization with variable-length n-grams.
Georgiana Ifrim, Gökhan H. Bakir, Gerhard Weikum
2008Fastanova: an efficient algorithm for genome-wide association study.
Xiang Zhang, Fei Zou, Wei Wang
2008Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface.
Peter Christen
2008Feedback effects between similarity and social influence in online communities.
David J. Crandall, Dan Cosley, Daniel P. Huttenlocher, Jon M. Kleinberg, Siddharth Suri
2008Finding non-redundant, statistically significant regions in high dimensional data: a novel approach to projected and subspace clustering.
Gabriela Moise, Jörg Sander
2008Generating succinct titles for web URLs.
Deepayan Chakrabarti, Ravi Kumar, Kunal Punera
2008Genesis of postal address reading, current state and future prospects: thirty years of pattern recognition on duty of postal services.
Udo Miletzki
2008Get another label? improving data quality and data mining using multiple, noisy labelers.
Victor S. Sheng, Foster J. Provost, Panagiotis G. Ipeirotis
2008Heterogeneous data fusion for alzheimer's disease study.
Jieping Ye, Kewei Chen, Teresa Wu, Jing Li, Zheng Zhao, Rinkal Patel, Min Bae, Ravi Janardan, Huan Liu, Gene E. Alexander, Eric Reiman
2008Hypergraph spectral learning for multi-label classification.
Liang Sun, Shuiwang Ji, Jieping Ye
2008Identifying authoritative actors in question-answering forums: the case of Yahoo! answers.
Mohamed Bouguessa, Benoît Dumoulin, Shengrui Wang
2008Identifying biologically relevant genes via multiple heterogeneous data sources.
Zheng Zhao, Jiangxin Wang, Huan Liu, Jieping Ye, Yung Chang
2008Identifying domain expertise of developers from source code.
Renuka Sindhgatta
2008Influence and correlation in social networks.
Aris Anagnostopoulos, Ravi Kumar, Mohammad Mahdian
2008Information extraction from Wikipedia: moving down the long tail.
Fei Wu, Raphael Hoffmann, Daniel S. Weld
2008Internet advertising and optimal auction design.
Benjamin Edelman, Michael Schwarz
2008Interpretable nonnegative matrix decompositions.
Saara Hyvönen, Pauli Miettinen, Evimaria Terzi
2008Joint latent topic models for text and citations.
Ramesh Nallapati, Amr Ahmed, Eric P. Xing, William W. Cohen
2008Knowledge discovery of semantic relationships between words using nonparametric bayesian graph model.
Issei Sato, Minoru Yoshida, Hiroshi Nakagawa
2008Knowledge transfer via multiple model local structure mapping.
Jing Gao, Wei Fan, Jing Jiang, Jiawei Han
2008Land cover change detection: a case study.
Shyam Boriah, Vipin Kumar, Michael S. Steinbach, Christopher Potter, Steven A. Klooster
2008Large scale data analysis and modelling in online services and advertising.
Thore Graepel, Ralf Herbrich
2008Learning classifiers from only positive and unlabeled data.
Charles Elkan, Keith Noto
2008Learning from multi-topic web documents for contextual advertisement.
Yi Zhang, Arun C. Surendran, John C. Platt, Mukund Narasimhan
2008Learning methods for lung tumor markerless gating in image-guided radiotherapy.
Ying Cui, Jennifer G. Dy, Gregory C. Sharp, Brian M. Alexander, Steve B. Jiang
2008Learning subspace kernels for classification.
Jianhui Chen, Shuiwang Ji, Betul Ceran, Qi Li, Mingrui Wu, Jieping Ye
2008Local peculiarity factor and its application in outlier detection.
Jian Yang, Ning Zhong, Yiyu Yao, Jue Wang
2008Locality sensitive hash functions based on concomitant rank order statistics.
Kave Eshghi, Shyamsundar Rajaram
2008Microscopic evolution of social networks.
Jure Leskovec, Lars Backstrom, Ravi Kumar, Andrew Tomkins
2008Mining adaptively frequent closed unlabeled rooted trees in data streams.
Albert Bifet, Ricard Gavaldà
2008Mining multi-faceted overviews of arbitrary topics in a text collection.
Xu Ling, Qiaozhu Mei, ChengXiang Zhai, Bruce R. Schatz
2008Mining preferences from superior and inferior examples.
Bin Jiang, Jian Pei, Xuemin Lin, David W. Cheung, Jiawei Han
2008Mobile call graphs: beyond power-law and lognormal distributions.
Mukund Seshadri, Sridhar Machiraju, Ashwin Sridharan, Jean Bolot, Christos Faloutsos, Jure Leskovec
2008Model-based document clustering with a collapsed gibbs sampler.
Daniel David Walker, Eric K. Ringger
2008Morpheus: interactive exploration of subspace clustering.
Emmanuel Müller, Ira Assent, Ralph Krieger, Timm Jansen, Thomas Seidl
2008Multi-class cost-sensitive boosting with p-norm loss functions.
Aurélie C. Lozano, Naoki Abe
2008On updates that constrain the features' connections during learning.
Omid Madani, Jian Huang
2008Partial least squares regression for graph mining.
Hiroto Saigo, Nicole Krämer, Koji Tsuda
2008Partitioned logistic regression for spam filtering.
Ming-Wei Chang, Wen-tau Yih, Christopher Meek
2008Pattern-Miner: integrated management and mining over data mining models.
Evangelos E. Kotsifakos, Irene Ntoutsi, Yannis Vrahoritis, Yannis Theodoridis
2008Permu-pattern: discovery of mutable permutation patterns with proximity constraint.
Meng Hu, Jiong Yang, Wei Su
2008Pictor: an interactive system for importing data from a website.
Shuyi Zheng, Matthew R. Scott, Ruihua Song, Ji-Rong Wen
2008Privacy-preserving cox regression for survival analysis.
Shipeng Yu, Glenn Fung, Rómer Rosales, Sriram Krishnan, R. Bharat Rao, Cary Dehing-Oberije, Philippe Lambin
2008Probabilistic latent semantic visualization: topic model for visualizing documents.
Tomoharu Iwata, Takeshi Yamada, Naonori Ueda
2008Proceedings of the 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Las Vegas, Nevada, USA, August 24-27, 2008
Ying Li, Bing Liu, Sunita Sarawagi
2008Quantitative evaluation of approximate frequent pattern mining algorithms.
Rohit Gupta, Gang Fang, Blayne Field, Michael S. Steinbach, Vipin Kumar
2008Reconstructing chemical reaction networks: data mining meets system identification.
Yong Ju Cho, Naren Ramakrishnan, Yang Cao
2008Regularization paths and coordinate descent.
Trevor Hastie, Jerome H. Friedman, Robert Tibshirani
2008Relational learning via collective matrix factorization.
Ajit Paul Singh, Geoffrey J. Gordon
2008SAIL: summation-based incremental learning for information-theoretic clustering.
Junjie Wu, Hui Xiong, Jian Chen
2008SPIRAL: efficient and exact model identification for hidden Markov models.
Yasuhiro Fujiwara, Yasushi Sakurai, Masashi Yamamuro
2008Scalable and near real-time burst detection from eCommerce queries.
Nish Parikh, Neel Sundaresan
2008Scaling up text classification for large file systems.
George Forman, Shyamsundar Rajaram
2008Semi-supervised approach to rapid and reliable labeling of large data sets.
György J. Simon, Vipin Kumar, Zhi-Li Zhang
2008Semi-supervised learning with data calibration for long-term time series forecasting.
Haibin Cheng, Pang-Ning Tan
2008Simultaneous tensor subspace selection and clustering: the equivalence of high order svd and k-means clustering.
Heng Huang, Chris H. Q. Ding, Dijun Luo, Tao Li
2008Social networks: looking ahead.
Ravi Kumar, Alexander Tuzhilin, Christos Faloutsos, David D. Jensen, Gueorgi Kossinets, Jure Leskovec, Andrew Tomkins
2008Spectral domain-transfer learning.
Xiao Ling, Wenyuan Dai, Gui-Rong Xue, Qiang Yang, Yong Yu
2008Spotting out emerging artists using geo-aware analysis of P2P query strings.
Noam Koenigstein, Yuval Shavitt, Tomer Tankel
2008Stable feature selection via dense feature groups.
Lei Yu, Chris H. Q. Ding, Steven Loscalzo
2008Stream prediction using a generative model based on frequent episodes in event sequences.
Srivatsan Laxman, Vikram Tankasali, Ryen W. White
2008Structured entity identification and document categorization: two tasks with one joint model.
Indrajit Bhattacharya, Shantanu Godbole, Sachindra Joshi
2008Structured learning for non-smooth ranking losses.
Soumen Chakrabarti, Rajiv Khanna, Uma Sawant, Chiru Bhattacharyya
2008Structured metric learning for high dimensional problems.
Jason V. Davis, Inderjit S. Dhillon
2008Succinct summarization of transactional databases: an overlapped hyperrectangle scheme.
Yang Xiang, Ruoming Jin, David Fuhry, Feodor F. Dragan
2008Tagmark: reliable estimations of RFID tags for business processes.
Leonardo Weiss Ferreira Chaves, Erik Buchmann, Klemens Böhm
2008Temporal pattern discovery for trends and transient effects: its application to patient records.
G. Niklas Norén, Andrew Bate, Johan Hopstadius, Kristina Star, I. Ralph Edwards
2008Text classification, business intelligence, and interactivity: automating C-Sat analysis for services industry.
Shantanu Godbole, Shourya Roy
2008The cost of privacy: destruction of data-mining utility in anonymized data publishing.
Justin Brickell, Vitaly Shmatikov
2008The future of image search.
Jitendra Malik
2008The persuasive phase of visualization.
Christine H. Chih, Douglas Stott Parker Jr.
2008The structure of information pathways in a social communication network.
Gueorgi Kossinets, Jon M. Kleinberg, Duncan J. Watts
2008Topical query decomposition.
Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis
2008Training structural svms with kernels using sampled cuts.
Chun-Nam John Yu, Thorsten Joachims
2008Unsupervised deduplication using cross-field dependencies.
Robert J. Hall, Charles Sutton, Andrew McCallum
2008Unsupervised feature selection for principal components analysis.
Christos Boutsidis, Michael W. Mahoney, Petros Drineas
2008Using
Luigi Di Caro, K. Selçuk Candan, Maria Luisa Sapino
2008Using ghost edges for classification in sparsely labeled networks.
Brian Gallagher, Hanghang Tong, Tina Eliassi-Rad, Christos Faloutsos
2008Using predictive analysis to improve invoice-to-cash collection.
Sai Zeng, Prem Melville, Christian A. Lang, Ioana M. Boier-Martin, Conrad Murphy
2008Volatile correlation computation: a checkpoint view.
Wenjun Zhou, Hui Xiong
2008Weighted graphs and disconnected components: patterns and a generator.
Mary McGlohon, Leman Akoglu, Christos Faloutsos