Online Regret Bounds for Undiscounted Continuous Reinforcement Learning

Ronald Ortner, Daniil Ryabko

Research output: Contribution to conferencePosterResearchpeer-review

28 Citations (Scopus)
Translated title of the contributionOnline Regret Bounds for Undiscounted Continuous Reinforcement Learning
Original languageEnglish
Publication statusPublished - 2012
EventAdvances in Neural Information Processing Systems - Lake Tahoe, United States
Duration: 6 Dec 20126 Dec 2012

Conference

ConferenceAdvances in Neural Information Processing Systems
Abbreviated titleNIPS 2012
Country/TerritoryUnited States
CityLake Tahoe
Period6/12/126/12/12

Cite this