Odalric-Ambrym Maillard

(Ehemalig)

Publikationen

  1. 2014
  2. Veröffentlicht

    Selecting Near-Optimal Approximate State Representations in Reinforcement Learning

    Ortner, R., Maillard, O-A. & Ryabko, D. 2014 Algorithmic Learning Theory - 25th International Conference, ALT 2014, Bled, October 8-10, 2014. S. 140-154

    Forschungsoutput: ForschungBeitrag in Konferenzband

  3. 2013
  4. Veröffentlicht

    Competing with an Infinite Set of Models in Reinforcement Learning

    Nguyen, P., Maillard, O-A., Ryabko, D. & Ortner, R. 2013 JMLR Workshop and Conference Proceedings Volume 31 : Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. S. 463-471

    Forschungsoutput: ForschungBeitrag in Konferenzband

  5. Veröffentlicht

    Linear regression with random projections.

    Maillard, O-A. 2013 in : Journal of machine learning research (JMLR). 13, S. 1-1

    Forschungsoutput: Forschung - (peer-reviewed)Artikel

  6. Veröffentlicht

    Optimal regret bounds for selecting the state representation in reinforcement learning.

    Maillard, O-A., Nguyen, P., Ortner, R. & Ryabko, D. 2013 JMLR Workshop and Conference Proceedings Volume 28 : Proceedings of The 30th International Conference on Machine Learning. S. 543-551

    Forschungsoutput: ForschungBeitrag in Konferenzband

  7. 2011
  8. Veröffentlicht

    Adaptive bandits: Towards the best history-dependent strategy

    Maillard, O-A. 2011 Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics. S. 570-578

    Forschungsoutput: ForschungBeitrag in Konferenzband

  9. Veröffentlicht

    Finite-Time Analysis of Multi-armed Bandits Problems with Kullback-Leibler Divergences

    Maillard, O-A. 2011 Proceedings of the 24th Annual Conference on Learning Theory. S. 497-514

    Forschungsoutput: ForschungBeitrag in Konferenzband

  10. Veröffentlicht

    Selecting the State-Representation in Reinforcement Learning

    Maillard, O-A. 2011 Advances in Neural Information Processing Systems 24. S. 2627-2635

    Forschungsoutput: ForschungBeitrag in Konferenzband

  11. Veröffentlicht

    Sparse recovery with Brownian sensing

    Maillard, O-A. 2011 Advances in Neural Information Processing Systems 24. S. 1782-1790

    Forschungsoutput: ForschungBeitrag in Konferenzband