UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Transformers for Commodity Forecasting

Sharkey, Edward; (2025) Transformers for Commodity Forecasting. Doctoral thesis (Ph.D), UCL (University College London). Green open access

[thumbnail of Edward_Sharkey_Thesis_Submission.pdf]
Preview
Text
Edward_Sharkey_Thesis_Submission.pdf - Accepted Version

Download (22MB) | Preview

Abstract

This thesis investigates the application of Transformer models for commodity forecasting (e.g., Copper), using Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-trained Transformers (GPT) model architectures. The research has five dimensions: firstly, a comparison of BERT and GPT; secondly, we built fine-tuned copper models (CopBERT, CopGPT); thirdly, we built a method for optimising Large Language Model (LLM) prompts (AgentOptimiser); fourthly, we built a signal detection model to identify price change signals within the metal futures market; finally, we built several mechanisms for mechanistic interpretability in our commodity models attempting to decipher the internal workings. The research highlights the significance of mechanistic interpretability and metaheuristics for prompt optimisation to enhance accuracy and interpretability. Mechanistic interpretability refers to understanding how a machine learning model or algorithm functions at a detailed, structural level. Metaheuristics refers to a broad class of optimisation techniques that are used to find near-optimal solutions to complex optimisation problems, particularly when the problem space is large and complex. Practical testing and trials of the models were conducted with a metal trading desk; feedback was obtained from energy companies and several large global telecommunication companies.

Type: Thesis (Doctoral)
Qualification: Ph.D
Title: Transformers for Commodity Forecasting
Open access status: An open access version is available from UCL Discovery
Language: English
Additional information: Copyright © The Author 2025. Original content in this thesis is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Licence (https://creativecommons.org/licenses/by-nc/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request.
UCL classification: UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
UCL
URI: https://discovery.ucl.ac.uk/id/eprint/10206442
Downloads since deposit
150Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item