Structure, Learning, & Composition: Multitask Reinforcement Learning in Brains and Machines

Advanced search
Browse by:

Department | Year

UCL Theses | Latest

Deposit your research

Structure, Learning, & Composition: Multitask Reinforcement Learning in Brains and Machines

Moskovitz, Theodore Harris; (2025) Structure, Learning, & Composition: Multitask Reinforcement Learning in Brains and Machines. Doctoral thesis (Ph.D), UCL (University College London). Green open access

[thumbnail of Moskovitz_10211485_Thesis.pdf]

Preview

Text
Moskovitz_10211485_Thesis.pdf
Download (28MB) | Preview

Abstract

This thesis centers around different forms of common structure that can be shared across tasks faced by reinforcement learning agents and how these types of structure can be leveraged to both learn new behavioral policies more efficiently and compose existing policies. Specifically, the first part of this thesis is concerned with how agreement among the optimal policies for some group of tasks constitutes a form of behavioral structure. This structure can be used as the basis for a regularized policy optimization approach to speed up policy learning on new tasks. One such approach proves to be an effective model of a number of animal and human behavioral patterns observed in neuroscientific studies of dual process theories of cognition. The second part of this thesis focuses on how consistent environmental transition dynamics across tasks can be exploited by agents to learn state representations which facilitate efficient policy evaluation and composition. In particular, prior work on this topic is extended to include several forms of biologically-inspired, non-Markovian, non-stationary reward functions, with applications to both machine learning and natural behavior.

Type:	Thesis (Doctoral)
Qualification:	Ph.D
Title:	Structure, Learning, & Composition: Multitask Reinforcement Learning in Brains and Machines
Open access status:	An open access version is available from UCL Discovery
Language:	English
Additional information:	Copyright © The Author 2025. Original content in this thesis is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Licence (https://creativecommons.org/licenses/by-nc/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request.
UCL classification:	UCL UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Life Sciences > Gatsby Computational Neurosci Unit
URI:	https://discovery.ucl.ac.uk/id/eprint/10211485

Downloads since deposit

0Downloads

Download activity - last month

Download activity - last 12 months

Downloads by country - last 12 months

Archive Staff Only

View Item