ALT B

36 papers

YearTitle / Authors
2011Accelerated Training of Max-Margin Markov Networks with Kernels.
Xinhua Zhang, Ankan Saha, S. V. N. Vishwanathan
2011Adaptive and Optimal Online Linear Regression on ℓ1-Balls.
Sébastien Gerchinovitz, Jia Yuan Yu
2011Algorithmic Learning Theory - 22nd International Conference, ALT 2011, Espoo, Finland, October 5-7, 2011. Proceedings
Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann
2011Approximate Reduction from AUC Maximization to 1-Norm Soft Margin Optimization.
Daiki Suehiro, Kohei Hatano, Eiji Takimoto
2011Asymptotically Optimal Agents.
Tor Lattimore, Marcus Hutter
2011Axioms for Rational Reinforcement Learning.
Peter Sunehag, Marcus Hutter
2011Combining Initial Segments of Lists.
Manfred K. Warmuth, Wouter M. Koolen, David P. Helmbold
2011Competing against the Best Nearest Neighbor Filter in Regression.
Arnak S. Dalalyan, Joseph Salmon
2011Deviations of Stochastic Bandit Regret.
Antoine Salomon, Jean-Yves Audibert
2011Distributional Learning of Simple Context-Free Tree Grammars.
Anna Kasprzik, Ryo Yoshinaka
2011Domain Adaptation in Regression.
Corinna Cortes, Mehryar Mohri
2011Editors' Introduction.
Jyrki Kivinen, Csaba Szepesvári, Esko Ukkonen, Thomas Zeugmann
2011Erratum: Learning without Coding.
Samuel E. Moelius, Sandra Zilles
2011Information Distance and Its Extensions.
Ming Li
2011Iterative Learning from Positive Data and Counters.
Timo Kötzing
2011Learning Relational Patterns.
Michael Geilke, Sandra Zilles
2011Learning a Classifier when the Labeling Is Known.
Shalev Ben-David, Shai Ben-David
2011Learning and Classifying.
Sanjay Jain, Eric Martin, Frank Stephan
2011Learning from Label Preferences.
Eyke Hüllermeier, Johannes Fürnkranz
2011Lipschitz Bandits without the Lipschitz Constant.
Sébastien Bubeck, Gilles Stoltz, Jia Yuan Yu
2011Making Online Decisions with Bounded Memory.
Chi-Jen Lu, Wei-Fu Lu
2011Models for Autonomously Motivated Exploration in Reinforcement Learning - (Extended Abstract).
Peter Auer, Shiau Hong Lim, Chris Watkins
2011On Noise-Tolerant Learning of Sparse Parities and Related Problems.
Elena Grigorescu, Lev Reyzin, Santosh S. Vempala
2011On Upper-Confidence Bound Policies for Switching Bandit Problems.
Aurélien Garivier, Eric Moulines
2011On the Expressive Power of Deep Architectures.
Yoshua Bengio, Olivier Delalleau
2011Optimal Estimation.
Jorma Rissanen
2011Re-adapting the Regularization of Weights for Non-stationary Regression.
Nina Vaits, Koby Crammer
2011Regret Minimization Algorithms for Pricing Lookback Options.
Eyal Gofer, Yishay Mansour
2011Robust Learning of Automatic Classes of Languages.
Sanjay Jain, Eric Martin, Frank Stephan
2011Semantic Communication for Simple Goals Is Equivalent to On-line Learning.
Brendan Juba, Santosh S. Vempala
2011Supervised Learning and Co-training.
Malte Darnstädt, Hans Ulrich Simon, Balázs Szörényi
2011The Perceptron with Dynamic Margin.
Constantinos Panagiotakopoulos, Petroula Tsampouka
2011Time Consistent Discounting.
Tor Lattimore, Marcus Hutter
2011Universal Knowledge-Seeking Agents.
Laurent Orseau
2011Universal Prediction of Selected Bits.
Tor Lattimore, Marcus Hutter, Vaibhav Gavane
2011Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits.
Alexandra Carpentier, Alessandro Lazaric, Mohammad Ghavamzadeh, Rémi Munos, Peter Auer