ICML A*

88 papers

YearTitle / Authors
2002A Boosted Maximum Entropy Model for Learning Text Chunking.
Seong-Bae Park, Byoung-Tak Zhang
2002A Fast Dual Algorithm for Kernel Logistic Regression.
S. Sathiya Keerthi, Kaibo Duan, Shirish K. Shevade, Aun Neow Poo
2002A Necessary Condition of Convergence for Reinforcement Learning with Function Approximation.
Artur Merke, Ralf Schoknecht
2002A New Statistical Approach to Personal Name Extraction.
Zheng Chen, Liu Wenyin, Feng Zhang
2002A Unified Decomposition of Ensemble Loss for Predicting Ensemble Performance.
Michael Goebel, Patricia J. Riddle, Mike Barley
2002Action Refinement in Reinforcement Learning by Probability Smoothing.
Thomas G. Dietterich, Dídac Busquets, Ramón López de Mántaras, Carles Sierra
2002Active + Semi-supervised Learning = Robust Multi-View Learning.
Ion Muslea, Steven Minton, Craig A. Knoblock
2002Adaptive View Validation: A First Step Towards Automatic View Detection.
Ion Muslea, Steven Minton, Craig A. Knoblock
2002Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs.
Carlos Guestrin, Relu Patrascu, Dale Schuurmans
2002An Alternate Objective Function for Markovian Fields.
Sham M. Kakade, Yee Whye Teh, Sam T. Roweis
2002An Analysis of Functional Trees.
João Gama
2002An epsilon-Optimal Grid-Based Algorithm for Partially Observable Markov Decision Processes.
Blai Bonet
2002Anytime Interval-Valued Outputs for Kernel Machines: Fast Support Vector Machine Classification via Distance Geometry.
Dennis DeCoste
2002Approximately Optimal Approximate Reinforcement Learning.
Sham M. Kakade, John Langford
2002Classification Value Grouping.
Colin K. M. Ho
2002Combining Labeled and Unlabeled Data for MultiClass Text Categorization.
Rayid Ghani
2002Combining Trainig Set and Test Set Bounds.
John Langford
2002Competitive Analysis of the Explore/Exploit Tradeoff.
John Langford, Martin Zinkevich, Sham M. Kakade
2002Constraint-based Learning of Long Relational Concepts.
Jacques Ales Bianchetti, Céline Rouveirol, Michèle Sebag
2002Content-Based Image Retrieval Using Multiple-Instance Learning.
Qi Zhang, Sally A. Goldman, Wei Yu, Jason E. Fritts
2002Coordinated Reinforcement Learning.
Carlos Guestrin, Michail G. Lagoudakis, Ronald Parr
2002Cranking: Combining Rankings Using Conditional Probability Models on Permutations.
Guy Lebanon, John D. Lafferty
2002Descriptive Induction through Subgroup Discovery: A Case Study in a Medical Domain.
Dragan Gamberger, Nada Lavrac
2002Diffusion Kernels on Graphs and Other Discrete Input Spaces.
Risi Kondor, John D. Lafferty
2002Discovering Hierarchy in Reinforcement Learning with HEXQ.
Bernhard Hengst
2002Discriminative Feature Selection via Multiclass Variable Memory Markov Model.
Noam Slonim, Gill Bejerano, Shai Fine, Naftali Tishby
2002Exact model averaging with naive Bayesian classifiers.
Denver Dash, Gregory F. Cooper
2002Exploiting Relations Among Concepts to Acquire Weakly Labeled Training Data.
Joseph Bockhorst, Mark Craven
2002Fast Minimum Training Error Discretization.
Tapio Elomaa, Juho Rousu
2002Feature Selection with Selective Sampling.
Huan Liu, Hiroshi Motoda, Lei Yu
2002Feature Subset Selection and Inductive Logic Programming.
Érick Alphonse, Stan Matwin
2002Finding an Optimal Gain-Ratio Subset-Split Test for a Set-Valued Attribute in Decision Tree Induction.
Fumio Takechi, Einoshin Suzuki
2002From Instance-level Constraints to Space-Level Constraints: Making the Most of Prior Knowledge in Data Clustering.
Dan Klein, Sepandar D. Kamvar, Christopher D. Manning
2002Graph-Based Relational Concept Learning.
Jesus A. Gonzalez, Lawrence B. Holder, Diane J. Cook
2002Hierarchically Optimal Average Reward Reinforcement Learning.
Mohammad Ghavamzadeh, Sridhar Mahadevan
2002How to Make Stacking Better and Faster While Also Taking Care of an Unknown Weakness.
Alexander K. Seewald
2002IEMS - The Intelligent Email Sorter.
Elisabeth Crawford, Judy Kay, Eric McCreath
2002Incorporating Prior Knowledge into Boosting.
Robert E. Schapire, Marie Rochery, Mazin G. Rahim, Narendra K. Gupta
2002Inducing Process Models from Continuous Data.
Pat Langley, Javier Nicolás Sánchez, Ljupco Todorovski, Saso Dzeroski
2002Integrating Experimentation and Guidance in Relational Reinforcement Learning.
Kurt Driessens, Saso Dzeroski
2002Interpreting and Extending Classical Agglomerative Clustering Algorithms using a Model-Based approach.
Sepandar D. Kamvar, Dan Klein, Christopher D. Manning
2002Investigating the Maximum Likelihood Alternative to TD(lambda).
Fletcher Lu, Relu Patrascu, Dale Schuurmans
2002Is Combining Classifiers Better than Selecting the Best One.
Saso Dzeroski, Bernard Zenko
2002Issues in Classifier Evaluation using Optimal Cost Curves.
Kai Ming Ting
2002Kernels for Semi-Structured Data.
Hisashi Kashima, Teruo Koyanagi
2002Learning Decision Rules by Randomized Iterative Local Search.
Michael Chisholm, Prasad Tadepalli
2002Learning Decision Trees Using the Area Under the ROC Curve.
César Ferri, Peter A. Flach, José Hernández-Orallo
2002Learning Spatial and Temporal Correlation for Navigation in a 2-Dimensional Continuous World.
Anand Panangadan, Michael G. Dyer
2002Learning from Scarce Experience.
Leonid Peshkin, Christian R. Shelton
2002Learning k-Reversible Context-Free Grammars from Positive Structural Examples.
Tim Oates, Devina Desai, Vinay Bhat
2002Learning the Kernel Matrix with Semi-Definite Programming.
Gert R. G. Lanckriet, Nello Cristianini, Peter L. Bartlett, Laurent El Ghaoui, Michael I. Jordan
2002Learning to Fly by Controlling Dynamic Instabilities.
David Stirling
2002Learning to Share Distributed Probabilistic Beliefs.
Christopher Leckie, Kotagiri Ramamohanarao
2002Learning word normalization using word suffix and context from unlabeled data.
Dunja Mladenic
2002Linkage and Autocorrelation Cause Feature Selection Bias in Relational Learning.
David D. Jensen, Jennifer Neville
2002MMIHMM: Maximum Mutual Information Hidden Markov Models.
Nuria Oliver, Ashutosh Garg
2002Machine Learning, Proceedings of the Nineteenth International Conference (ICML 2002), University of New South Wales, Sydney, Australia, July 8-12, 2002
Claude Sammut, Achim G. Hoffmann
2002Markov Chain Monte Carlo Sampling using Direct Search Optimization.
Malcolm J. A. Strens, Mark Bernhardt, Nicholas Everett
2002Mining Both Positive and Negative Association Rules.
Xindong Wu, Chengqi Zhang, Shichao Zhang
2002Model-based Hierarchical Average-reward Reinforcement Learning.
Sandeep Seri, Prasad Tadepalli
2002Modeling Auction Price Uncertainty Using Boosting-based Conditional Density Estimation.
Robert E. Schapire, Peter Stone, David A. McAllester, Michael L. Littman, János A. Csirik
2002Modeling for Optimal Probability Prediction.
Yong Wang, Ian H. Witten
2002Multi-Instance Kernels.
Thomas Gärtner, Peter A. Flach, Adam Kowalczyk, Alexander J. Smola
2002Non-Disjoint Discretization for Naive-Bayes Classifiers.
Ying Yang, Geoffrey I. Webb
2002On generalization bounds, projection profile, and margin distribution.
Ashutosh Garg, Sariel Har-Peled, Dan Roth
2002On the Existence of Fixed Points for Q-Learning and Sarsa in Partially Observable Domains.
Theodore J. Perkins, Mark D. Pendrith
2002Partially Supervised Classification of Text Documents.
Bing Liu, Wee Sun Lee, Philip S. Yu, Xiaoli Li
2002PolicyBlocks: An Algorithm for Creating Useful Macro-Actions in Reinforcement Learning.
Marc Pickett, Andrew G. Barto
2002Pruning Improves Heuristic Search for Cost-Sensitive Learning.
Valentina Bayer Zubek, Thomas G. Dietterich
2002Qualitative reverse engineering.
Dorian Suc, Ivan Bratko
2002Randomized Variable Elimination.
David J. Stracuzzi, Paul E. Utgoff
2002Refining the Wrapper Approach - Smoothed Error Estimates for Feature Selection.
Loo-Nin Teow, Haifeng Liu, Hwee Tou Ng, Eric Yap
2002Reinforcement Learning and Shaping: Encouraging Intended Behaviors.
Adam Laud, Gerald DeJong
2002Representational Upper Bounds of Bayesian Networks.
Huajie Zhang, Charles X. Ling
2002Scalable Internal-State Policy-Gradient Methods for POMDPs.
Douglas Aberdeen, Jonathan Baxter
2002Semi-supervised Clustering by Seeding.
Sugato Basu, Arindam Banerjee, Raymond J. Mooney
2002Separating Skills from Preference: Using Learning to Program by Reward.
Daniel G. Shapiro, Pat Langley
2002Sparse Bayesian Learning for Regression and Classification using Markov Chain Monte Carlo.
Shien-Shin Tham, Arnaud Doucet, Kotagiri Ramamohanarao
2002Statistical Behavior and Consistency of Support Vector Machines, Boosting, and Beyond.
Tong Zhang
2002Stock Trading System Using Reinforcement Learning with Cooperative Agents.
Jangmin O, Jae Won Lee, Byoung-Tak Zhang
2002Sufficient Dimensionality Reduction - A novel Analysis Method.
Amir Globerson, Naftali Tishby
2002Syllables and other String Kernel Extensions.
Craig Saunders, Hauke Tschach, John Shawe-Taylor
2002The Perceptron Algorithm with Uneven Margins.
Yaoyong Li, Hugo Zaragoza, Ralf Herbrich, John Shawe-Taylor, Jaz S. Kandola
2002Towards "Large Margin" Speech Recognizers by Boosting and Discriminative Training.
Carsten Meyer, Peter Beyerlein
2002Transformation-Based Regression.
Björn Bringmann, Stefan Kramer, Friedrich Neubarth, Hannes Pirker, Gerhard Widmer
2002Univariate Polynomial Inference by Monte Carlo Message Length Approximation.
Leigh J. Fitzgibbon, David L. Dowe, Lloyd Allison
2002Using Abstract Models of Behaviours to Automatically Generate Reinforcement Learning Hierarchies.
Malcolm R. K. Ryan
2002Using Unlabelled Data for Text Classification through Addition of Cluster Parameters.
Bhavani Raskutti, Herman L. Ferrá, Adam Kowalczyk