UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Dungeons and Data: A Large-Scale NetHack Dataset

Hambro, E; Raileanu, R; Rothermel, D; Mella, V; Rocktäschel, T; Küttler, H; Murray, N; (2022) Dungeons and Data: A Large-Scale NetHack Dataset. In: Advances in Neural Information Processing Systems. NeurIPS Green open access

[thumbnail of 187_dungeons_and_data_a_large_scal.pdf]
Preview
Text
187_dungeons_and_data_a_large_scal.pdf - Published Version

Download (1MB) | Preview

Abstract

Recent breakthroughs in the development of agents to solve challenging sequential decision making problems such as Go [50], StarCraft [58], or DOTA [3], have relied on both simulated environments and large-scale datasets. However, progress on this research has been hindered by the scarcity of open-sourced datasets and the prohibitive computational cost to work with them. Here we present the NetHack Learning Dataset (NLD), a large and highly-scalable dataset of trajectories from the popular game of NetHack, which is both extremely challenging for current methods and very fast to run [23]. NLD consists of three parts: 10 billion state transitions from 1.5 million human trajectories collected on the NAO public NetHack server from 2009 to 2020; 3 billion state-action-score transitions from 100,000 trajectories collected from the symbolic bot winner of the NetHack Challenge 2021; and, accompanying code for users to record, load and stream any collection of such trajectories in a highly compressed form. We evaluate a wide range of existing algorithms including online and offline RL, as well as learning from demonstrations, showing that significant research advances are needed to fully leverage large-scale datasets for challenging sequential decision making tasks.

Type: Proceedings paper
Title: Dungeons and Data: A Large-Scale NetHack Dataset
Event: 36th Conference on Neural Information Processing Systems (NeurIPS 2022)
ISBN-13: 9781713871088
Open access status: An open access version is available from UCL Discovery
Publisher version: https://proceedings.neurips.cc/paper_files/paper/2...
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10177322
Downloads since deposit
9Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item