Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

Ronald Ortner, Daniil Ryabko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

28 Citations (Scopus)
Translated title of the contributionOnline Regret Bounds for Undiscounted Continuous Reinforcement Learning
Original languageEnglish
Title of host publicationAdvances in Neural Information Processing Systems 25
PublisherMIT Press
Pages1772-1780
Publication statusPublished - 2012

Cite this