UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Adversarial recovery of agent rewards from latent spaces of the limit order book

Roa Vicens, J; Wang, Y; Mison, V; Gal, Y; Silva, R; (2019) Adversarial recovery of agent rewards from latent spaces of the limit order book. In: Proceedings of the Robust AI in Financial Services: Data, Fairness, Explainability, Trustworthiness, and Privacy. NeurIPS 2019: Vancouver, Canada.. Green open access

[thumbnail of Adversarial recovery of agent rewards from latent spaces of the limit order book.pdf]
Preview
Text
Adversarial recovery of agent rewards from latent spaces of the limit order book.pdf - Published Version

Download (667kB) | Preview

Abstract

Inverse reinforcement learning has proved its ability to explain state-action trajectories of expert agents by recovering their underlying reward functions in increasingly challenging environments. Recent advances in adversarial learning have allowed extending inverse RL to applications with non-stationary environment dynamics unknown to the agents, arbitrary structures of reward functions and improved handling of the ambiguities inherent to the ill-posed nature of inverse RL. This is particularly relevant in real time applications on stochastic environments involving risk, like volatile financial markets. Moreover, recent work on simulation of complex environments enable learning algorithms to engage with real market data through simulations of its latent space representations, avoiding a costly exploration of the original environment. In this paper, we explore whether adversarial inverse RL algorithms can be adapted and trained within such latent space simulations from real market data, while maintaining their ability to recover agent rewards robust to variations in the underlying dynamics, and transfer them to new regimes of the original environment.

Type: Proceedings paper
Title: Adversarial recovery of agent rewards from latent spaces of the limit order book
Event: 33rd Conference on Neural Information Processing Systems (NeurIPS 2019)
Location: Vancouver, Canada
Open access status: An open access version is available from UCL Discovery
Publisher version: https://nips.cc/Conferences/2019/Schedule?showEven...
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery.ucl.ac.uk/id/eprint/10141934
Downloads since deposit
9Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item