UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Distributionally Robust Model-based Reinforcement Learning with Large State Spaces

Ramesh, Shyam Sundhar; Sessa, Pier Giuseppe; Hu, Yifan; Krause, Andreas; Bogunovic, Ilija; (2024) Distributionally Robust Model-based Reinforcement Learning with Large State Spaces. In: Dasgupta, S and Mandt, S and Li, Y, (eds.) Proceedings of The 27th International Conference on Artificial Intelligence and Statistics. (pp. pp. 1-42). PMLR Green open access

[thumbnail of sundhar-ramesh24a.pdf]
Preview
Text
sundhar-ramesh24a.pdf - Published Version

Download (1MB) | Preview

Abstract

Three major challenges in reinforcement learning are the complex dynamical systems with large state spaces, the costly data acquisition processes, and the deviation of real-world dynamics from the training environment deployment. To overcome these issues, we study distributionally robust Markov decision processes with continuous state spaces under the widely used Kullback–Leibler, chi-square, and total variation uncertainty sets. We propose a model-based approach that utilizes Gaussian Processes and the maximum variance reduction algorithm to efficiently learn multi-output nominal transition dynamics, leveraging access to a generative model (i.e., simulator). We further demonstrate the statistical sample complexity of the proposed method for different uncertainty sets. These complexity bounds are independent of the number of states and extend beyond linear dynamics, ensuring the effectiveness of our approach in identifying near-optimal distributionally-robust policies. The proposed method can be further combined with other model-free distributionally robust reinforcement learning methods to obtain a near-optimal robust policy. Experimental results demonstrate the robustness of our algorithm to distributional shifts and its superior performance in terms of the number of samples needed.

Type: Proceedings paper
Title: Distributionally Robust Model-based Reinforcement Learning with Large State Spaces
Event: 27th International Conference on Artificial Intelligence and Statistics (AISTATS)
Location: Valencia, Spain
Dates: 2nd-4th May 2024
Open access status: An open access version is available from UCL Discovery
Publisher version: https://proceedings.mlr.press/v238/sundhar-ramesh2...
Language: English
Additional information: © The Author 2024. Original content in this paper is licensed under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) Licence (https://creativecommons.org/licenses/by/4.0/).
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng
URI: https://discovery.ucl.ac.uk/id/eprint/10198782
Downloads since deposit
10Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item