| 2008 | A decoupled approach to exemplar-based unsupervised learning. Sebastian Nowozin, Gökhan H. Bakir |
| 2008 | A distance model for rhythms. Jean-François Paiement, Yves Grandvalet, Samy Bengio, Douglas Eck |
| 2008 | A dual coordinate descent method for large-scale linear SVM. Cho-Jui Hsieh, Kai-Wei Chang, Chih-Jen Lin, S. Sathiya Keerthi, S. Sundararajan |
| 2008 | A generalization of Haussler's convolution kernel: mapping kernel. Kilho Shin, Tetsuji Kuboyama |
| 2008 | A least squares formulation for canonical correlation analysis. Liang Sun, Shuiwang Ji, Jieping Ye |
| 2008 | A quasi-Newton approach to non-smooth convex optimization. Jin Yu, S. V. N. Vishwanathan, Simon Günter, Nicol N. Schraudolph |
| 2008 | A rate-distortion one-class model and its applications to clustering. Koby Crammer, Partha Pratim Talukdar, Fernando C. N. Pereira |
| 2008 | A reproducing kernel Hilbert space framework for pairwise time series distances. Zhengdong Lu, Todd K. Leen, Yonghong Huang, Deniz Erdogmus |
| 2008 | A semiparametric statistical approach to model-free policy evaluation. Tsuyoshi Ueno, Motoaki Kawanabe, Takeshi Mori, Shin-ichi Maeda, Shin Ishii |
| 2008 | A unified architecture for natural language processing: deep neural networks with multitask learning. Ronan Collobert, Jason Weston |
| 2008 | A worst-case comparison between temporal difference and residual gradient with linear function approximation. Lihong Li |
| 2008 | Accurate max-margin training for structured output spaces. Sunita Sarawagi, Rahul Gupta |
| 2008 | Active kernel learning. Steven C. H. Hoi, Rong Jin |
| 2008 | Active reinforcement learning. Arkady Epshteyn, Adam Vogel, Gerald DeJong |
| 2008 | Actively learning level-sets of composite functions. Brent Bryan, Jeff G. Schneider |
| 2008 | Adaptive p-posterior mixture-model kernels for multiple instance learning. Hua-Yan Wang, Qiang Yang, Hongbin Zha |
| 2008 | An HDP-HMM for systems with state persistence. Emily B. Fox, Erik B. Sudderth, Michael I. Jordan, Alan S. Willsky |
| 2008 | An RKHS for multi-view learning and manifold co-regularization. Vikas Sindhwani, David S. Rosenberg |
| 2008 | An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning. Ronald Parr, Lihong Li, Gavin Taylor, Christopher Painter-Wakefield, Michael L. Littman |
| 2008 | An analysis of reinforcement learning with function approximation. Francisco S. Melo, Sean P. Meyn, M. Isabel Ribeiro |
| 2008 | An asymptotic analysis of generative, discriminative, and pseudolikelihood estimators. Percy Liang, Michael I. Jordan |
| 2008 | An empirical evaluation of supervised learning in high dimensions. Rich Caruana, Nikolaos Karampatziakis, Ainur Yessenalina |
| 2008 | An object-oriented representation for efficient reinforcement learning. Carlos Diuk, Andre Cohen, Michael L. Littman |
| 2008 | Apprenticeship learning using linear programming. Umar Syed, Michael H. Bowling, Robert E. Schapire |
| 2008 | Automatic discovery and transfer of MAXQ hierarchies. Neville Mehta, Soumya Ray, Prasad Tadepalli, Thomas G. Dietterich |
| 2008 | Autonomous geometric precision error estimation in low-level computer vision tasks. Andrés Corrada-Emmanuel, Howard J. Schultz |
| 2008 | Bayes optimal classification for decision trees. Siegfried Nijssen |
| 2008 | Bayesian multiple instance learning: automatic feature selection and inductive transfer. Vikas C. Raykar, Balaji Krishnapuram, Jinbo Bi, Murat Dundar, R. Bharat Rao |
| 2008 | Bayesian probabilistic matrix factorization using Markov chain Monte Carlo. Ruslan Salakhutdinov, Andriy Mnih |
| 2008 | Beam sampling for the infinite hidden Markov model. Jurgen Van Gael, Yunus Saatci, Yee Whye Teh, Zoubin Ghahramani |
| 2008 | Bi-level path following for cross validated solution of kernel quantile regression. Saharon Rosset |
| 2008 | Bolasso: model consistent Lasso estimation through the bootstrap. Francis R. Bach |
| 2008 | Boosting with incomplete information. Gholamreza Haffari, Yang Wang, Shaojun Wang, Greg Mori, Feng Jiao |
| 2008 | Causal modelling combining instantaneous and lagged effects: an identifiable model based on non-Gaussianity. Aapo Hyvärinen, Shohei Shimizu, Patrik O. Hoyer |
| 2008 | Classification using discriminative restricted Boltzmann machines. Hugo Larochelle, Yoshua Bengio |
| 2008 | Closed-form supervised dimensionality reduction with generalized linear models. Irina Rish, Genady Grabarnik, Guillermo A. Cecchi, Francisco Pereira, Geoffrey J. Gordon |
| 2008 | Composite kernel learning. Marie Szafranski, Yves Grandvalet, Alain Rakotomamonjy |
| 2008 | Compressed sensing and Bayesian experimental design. Matthias W. Seeger, Hannes Nickisch |
| 2008 | Confidence-weighted linear classification. Mark Dredze, Koby Crammer, Fernando Pereira |
| 2008 | Cost-sensitive multi-class classification from probability estimates. Deirdre B. O'Brien, Maya R. Gupta, Robert M. Gray |
| 2008 | Data spectroscopy: learning mixture models using eigenspaces of convolution operators. Tao Shi, Mikhail Belkin, Bin Yu |
| 2008 | Deep learning via semi-supervised embedding. Jason Weston, Frédéric Ratle, Ronan Collobert |
| 2008 | Democratic approximation of lexicographic preference models. Fusun Yaman, Thomas J. Walsh, Michael L. Littman, Marie desJardins |
| 2008 | Detecting statistical interactions with additive groves of trees. Daria Sorokina, Rich Caruana, Mirek Riedewald, Daniel Fink |
| 2008 | Dirichlet component analysis: feature extraction for compositional data. Hua-Yan Wang, Qiang Yang, Hong Qin, Hongbin Zha |
| 2008 | Discriminative parameter learning for Bayesian networks. Jiang Su, Harry Zhang, Charles X. Ling, Stan Matwin |
| 2008 | Discriminative structure and parameter learning for Markov logic networks. Tuyen N. Huynh, Raymond J. Mooney |
| 2008 | Efficient bandit algorithms for online multiclass prediction. Sham M. Kakade, Shai Shalev-Shwartz, Ambuj Tewari |
| 2008 | Efficient multiclass maximum margin clustering. Bin Zhao, Fei Wang, Changshui Zhang |
| 2008 | Efficient projections onto the John C. Duchi, Shai Shalev-Shwartz, Yoram Singer, Tushar Chandra |
| 2008 | Efficiently learning linear-linear exponential family predictive representations of state. David Wingate, Satinder Singh |
| 2008 | Efficiently solving convex relaxations for MAP estimation. M. Pawan Kumar, Philip H. S. Torr |
| 2008 | Empirical Bernstein stopping. Volodymyr Mnih, Csaba Szepesvári, Jean-Yves Audibert |
| 2008 | Estimating labels from label proportions. Novi Quadrianto, Alexander J. Smola, Tibério S. Caetano, Quoc V. Le |
| 2008 | Estimating local optimums in EM algorithm over Gaussian mixture model. Zhenjie Zhang, Bing Tian Dai, Anthony K. H. Tung |
| 2008 | Expectation-maximization for sparse and non-negative PCA. Christian D. Sigg, Joachim M. Buhmann |
| 2008 | Exploration scavenging. John Langford, Alexander L. Strehl, Jennifer Wortman |
| 2008 | Extracting and composing robust features with denoising autoencoders. Pascal Vincent, Hugo Larochelle, Yoshua Bengio, Pierre-Antoine Manzagol |
| 2008 | Fast Gaussian process methods for point process intensity estimation. John P. Cunningham, Krishna V. Shenoy, Maneesh Sahani |
| 2008 | Fast estimation of first-order clause coverage through randomization and maximum likelihood. Ondrej Kuzelka, Filip Zelezný |
| 2008 | Fast incremental proximity search in large graphs. Purnamrita Sarkar, Andrew W. Moore, Amit Prakash |
| 2008 | Fast nearest neighbor retrieval for bregman divergences. Lawrence Cayton |
| 2008 | Fast solvers and efficient implementations for distance metric learning. Kilian Q. Weinberger, Lawrence K. Saul |
| 2008 | Fast support vector machine training and classification on graphics processors. Bryan Catanzaro, Narayanan Sundaram, Kurt Keutzer |
| 2008 | Fully distributed EM for very large datasets. Jason Andrew Wolfe, Aria Haghighi, Dan Klein |
| 2008 | Gaussian process product models for nonparametric nonstationarity. Ryan Prescott Adams, Oliver Stegle |
| 2008 | Graph kernels between point clouds. Francis R. Bach |
| 2008 | Graph transduction via alternating minimization. Jun Wang, Tony Jebara, Shih-Fu Chang |
| 2008 | Grassmann discriminant analysis: a unifying view on subspace-based learning. Jihun Ham, Daniel D. Lee |
| 2008 | Hierarchical kernel stick-breaking process for multi-task image analysis. Qi An, Chunping Wang, Ivo Shterev, Eric Wang, Lawrence Carin, David B. Dunson |
| 2008 | Hierarchical sampling for active learning. Sanjoy Dasgupta, Daniel J. Hsu |
| 2008 | ICA and ISA using Schweizer-Wolff measure of dependence. Sergey Kirshner, Barnabás Póczos |
| 2008 | Improved Nyström low-rank approximation and error analysis. Kai Zhang, Ivor W. Tsang, James T. Kwok |
| 2008 | Inverting the Viterbi algorithm: an abstract framework for structure design. Michael Schnall-Levin, Leonid Chindelevitch, Bonnie Berger |
| 2008 | Knows what it knows: a framework for self-aware learning. Lihong Li, Michael L. Littman, Thomas J. Walsh |
| 2008 | Laplace maximum margin Markov networks. Jun Zhu, Eric P. Xing, Bo Zhang |
| 2008 | Large scale manifold transduction. Michael Karlen, Jason Weston, Ayse Erkan, Ronan Collobert |
| 2008 | Learning all optimal policies with multiple criteria. Leon Barrett, Srini Narayanan |
| 2008 | Learning dissimilarities by ranking: from SDP to QP. Hua Ouyang, Alexander G. Gray |
| 2008 | Learning diverse rankings with multi-armed bandits. Filip Radlinski, Robert Kleinberg, Thorsten Joachims |
| 2008 | Learning for control from multiple demonstrations. Adam Coates, Pieter Abbeel, Andrew Y. Ng |
| 2008 | Learning from incomplete data with infinite imputations. Uwe Dick, Peter Haider, Tobias Scheffer |
| 2008 | Learning to classify with missing and corrupted features. Ofer Dekel, Ohad Shamir |
| 2008 | Learning to learn implicit queries from gaze patterns. Kai Puolamäki, Antti Ajanki, Samuel Kaski |
| 2008 | Learning to sportscast: a test of grounded language acquisition. David L. Chen, Raymond J. Mooney |
| 2008 | Listwise approach to learning to rank: theory and algorithm. Fen Xia, Tie-Yan Liu, Jue Wang, Wensheng Zhang, Hang Li |
| 2008 | Local likelihood modeling of temporal text streams. Guy Lebanon, Yang Zhao |
| 2008 | Localized multiple kernel learning. Mehmet Gönen, Ethem Alpaydin |
| 2008 | Machine Learning, Proceedings of the Twenty-Fifth International Conference (ICML 2008), Helsinki, Finland, June 5-9, 2008 William W. Cohen, Andrew McCallum, Sam T. Roweis |
| 2008 | Manifold alignment using Procrustes analysis. Chang Wang, Sridhar Mahadevan |
| 2008 | ManifoldBoost: stagewise function approximation for fully-, semi- and un-supervised learning. Nicolas Loeff, David A. Forsyth, Deepak Ramachandran |
| 2008 | Maximum likelihood rule ensembles. Krzysztof Dembczynski, Wojciech Kotlowski, Roman Slowinski |
| 2008 | Memory bounded inference in topic models. Ryan Gomes, Max Welling, Pietro Perona |
| 2008 | Message-passing for graph-structured linear programs: proximal projections, convergence and rounding schemes. Pradeep Ravikumar, Alekh Agarwal, Martin J. Wainwright |
| 2008 | Metric embedding for kernel classification rules. Bharath K. Sriperumbudur, Omer A. Lang, Gert R. G. Lanckriet |
| 2008 | Modeling interleaved hidden processes. Niels Landwehr |
| 2008 | Modified MMI/MPE: a direct evaluation of the margin in speech recognition. Georg Heigold, Thomas Deselaers, Ralf Schlüter, Hermann Ney |
| 2008 | Multi-classification by categorical features via clustering. Yevgeny Seldin, Naftali Tishby |
| 2008 | Multi-task compressive sensing with Dirichlet process priors. Yuting Qi, Dehong Liu, David B. Dunson, Lawrence Carin |
| 2008 | Multi-task learning for HIV therapy screening. Steffen Bickel, Jasmina Bogojeska, Thomas Lengauer, Tobias Scheffer |
| 2008 | Multiple instance ranking. Charles Bergeron, Jed Zaretzki, Curt M. Breneman, Kristin P. Bennett |
| 2008 | Nearest hyperdisk methods for high-dimensional classification. Hakan Cevikalp, Bill Triggs, Robi Polikar |
| 2008 | No-regret learning in convex games. Geoffrey J. Gordon, Amy Greenwald, Casey Marks |
| 2008 | Non-parametric policy gradients: a unified treatment of propositional and relational domains. Kristian Kersting, Kurt Driessens |
| 2008 | Nonextensive entropic kernels. André F. T. Martins, Mário A. T. Figueiredo, Pedro M. Q. Aguiar, Noah A. Smith, Eric P. Xing |
| 2008 | Nonnegative matrix factorization via rank-one downdate. Michael Biggs, Ali Ghodsi, Stephen A. Vavasis |
| 2008 | On multi-view active learning and the combination with semi-supervised learning. Wei Wang, Zhi-Hua Zhou |
| 2008 | On partial optimality in multi-label MRFs. Pushmeet Kohli, Alexander Shekhovtsov, Carsten Rother, Vladimir Kolmogorov, Philip H. S. Torr |
| 2008 | On the chance accuracies of large collections of classifiers. Mark Palatucci, Andrew Carlson |
| 2008 | On the hardness of finding symmetries in Markov decision processes. Shravan Matthur Narayanamurthy, Balaraman Ravindran |
| 2008 | On the quantitative analysis of deep belief networks. Ruslan Salakhutdinov, Iain Murray |
| 2008 | On-line discovery of temporal-difference networks. Takaki Makino, Toshihisa Takagi |
| 2008 | Online kernel selection for Bayesian reinforcement learning. Joseph Reisinger, Peter Stone, Risto Miikkulainen |
| 2008 | Optimized cutting plane algorithm for support vector machines. Vojtech Franc, Sören Sonnenburg |
| 2008 | Optimizing estimated loss reduction for active sampling in rank learning. Pinar Donmez, Jaime G. Carbonell |
| 2008 | Pairwise constraint propagation by semidefinite programming for semi-supervised classification. Zhenguo Li, Jianzhuang Liu, Xiaoou Tang |
| 2008 | Pointwise exact bootstrap distributions of cost curves. Charles Dugas, David Gadoury |
| 2008 | Polyhedral classifier for target detection: a case study: colorectal cancer. Murat Dundar, Matthias Wolf, Sarang Lakare, Marcos Salganicoff, Vikas C. Raykar |
| 2008 | Preconditioned temporal difference learning. Hengshuai Yao, Zhi-Qiang Liu |
| 2008 | Predicting diverse subsets using structural SVMs. Yisong Yue, Thorsten Joachims |
| 2008 | Prediction with expert advice for the Brier game. Vladimir Vovk, Fedor Zhdanov |
| 2008 | Privacy-preserving reinforcement learning. Jun Sakuma, Shigenobu Kobayashi, Rebecca N. Wright |
| 2008 | Query-level stability and generalization in learning to rank. Yanyan Lan, Tie-Yan Liu, Tao Qin, Zhiming Ma, Hang Li |
| 2008 | Random classification noise defeats all convex potential boosters. Philip M. Long, Rocco A. Servedio |
| 2008 | Rank minimization via online learning. Raghu Meka, Prateek Jain, Constantine Caramanis, Inderjit S. Dhillon |
| 2008 | Reinforcement learning in the presence of rare events. Jordan Frank, Shie Mannor, Doina Precup |
| 2008 | Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs. Finale Doshi, Joelle Pineau, Nicholas Roy |
| 2008 | Robust matching and recognition using context-dependent kernels. Hichem Sahbi, Jean-Yves Audibert, Jaonary Rabarisoa, Renaud Keriven |
| 2008 | SVM optimization: inverse dependence on training set size. Shai Shalev-Shwartz, Nathan Srebro |
| 2008 | Sample-based learning and search with permanent and transient memories. David Silver, Richard S. Sutton, Martin Müller |
| 2008 | Self-taught clustering. Wenyuan Dai, Qiang Yang, Gui-Rong Xue, Yong Yu |
| 2008 | Semi-supervised learning of compact document representations with deep networks. Marc'Aurelio Ranzato, Martin Szummer |
| 2008 | Sequence kernels for predicting protein essentiality. Cyril Allauzen, Mehryar Mohri, Ameet Talwalkar |
| 2008 | Space-indexed dynamic programming: learning to follow trajectories. J. Zico Kolter, Adam Coates, Andrew Y. Ng, Yi Gu, Charles DuHadway |
| 2008 | Sparse Bayesian nonparametric regression. Francois Caron, Arnaud Doucet |
| 2008 | Sparse multiscale gaussian process regression. Christian Walder, Kwang In Kim, Bernhard Schölkopf |
| 2008 | Spectral clustering with inconsistent advice. Tom Coleman, James Saunderson, Anthony Wirth |
| 2008 | Stability of transductive regression algorithms. Corinna Cortes, Mehryar Mohri, Dmitry Pechyony, Ashish Rastogi |
| 2008 | Statistical models for partial membership. Katherine A. Heller, Sinead Williamson, Zoubin Ghahramani |
| 2008 | Stopping conditions for exact computation of leave-one-out error in support vector machines. Vojtech Franc, Pavel Laskov, Klaus-Robert Müller |
| 2008 | Strategy evaluation in extensive games with importance sampling. Michael H. Bowling, Michael Johanson, Neil Burch, Duane Szafron |
| 2008 | Structure compilation: trading structure for features. Percy Liang, Hal Daumé III, Dan Klein |
| 2008 | Tailoring density estimation via reproducing kernel moment matching. Le Song, Xinhua Zhang, Alexander J. Smola, Arthur Gretton, Bernhard Schölkopf |
| 2008 | The Group-Lasso for generalized linear models: uniqueness of solutions and efficient algorithms. Volker Roth, Bernd Fischer |
| 2008 | The asymptotics of semi-supervised learning in discriminative probabilistic models. Nataliya Sokolovska, Olivier Cappé, François Yvon |
| 2008 | The dynamic hierarchical Dirichlet process. Lu Ren, David B. Dunson, Lawrence Carin |
| 2008 | The many faces of optimism: a unifying approach. Istvan Szita, András Lörincz |
| 2008 | The projectron: a bounded kernel-based Perceptron. Francesco Orabona, Joseph Keshet, Barbara Caputo |
| 2008 | The skew spectrum of graphs. Risi Kondor, Karsten M. Borgwardt |
| 2008 | Topologically-constrained latent variable models. Raquel Urtasun, David J. Fleet, Andreas Geiger, Jovan Popovic, Trevor Darrell, Neil D. Lawrence |
| 2008 | Training SVM with indefinite kernels. Jianhui Chen, Jieping Ye |
| 2008 | Training restricted Boltzmann machines using approximations to the likelihood gradient. Tijmen Tieleman |
| 2008 | Training structural SVMs when exact inference is intractable. Thomas Finley, Thorsten Joachims |
| 2008 | Transfer of samples in batch reinforcement learning. Alessandro Lazaric, Marcello Restelli, Andrea Bonarini |
| 2008 | Uncorrelated multilinear principal component analysis through successive variance maximization. Haiping Lu, Konstantinos N. Plataniotis, Anastasios N. Venetsanopoulos |
| 2008 | Unsupervised rank aggregation with distance-based models. Alexandre Klementiev, Dan Roth, Kevin Small |