Sharkey, Edward;
(2025)
Transformers for Commodity Forecasting.
Doctoral thesis (Ph.D), UCL (University College London).
Preview |
Text
Edward_Sharkey_Thesis_Submission.pdf - Accepted Version Download (22MB) | Preview |
Abstract
This thesis investigates the application of Transformer models for commodity forecasting (e.g., Copper), using Bidirectional Encoder Representations from Transformers (BERT) and Generative Pre-trained Transformers (GPT) model architectures. The research has five dimensions: firstly, a comparison of BERT and GPT; secondly, we built fine-tuned copper models (CopBERT, CopGPT); thirdly, we built a method for optimising Large Language Model (LLM) prompts (AgentOptimiser); fourthly, we built a signal detection model to identify price change signals within the metal futures market; finally, we built several mechanisms for mechanistic interpretability in our commodity models attempting to decipher the internal workings. The research highlights the significance of mechanistic interpretability and metaheuristics for prompt optimisation to enhance accuracy and interpretability. Mechanistic interpretability refers to understanding how a machine learning model or algorithm functions at a detailed, structural level. Metaheuristics refers to a broad class of optimisation techniques that are used to find near-optimal solutions to complex optimisation problems, particularly when the problem space is large and complex. Practical testing and trials of the models were conducted with a metal trading desk; feedback was obtained from energy companies and several large global telecommunication companies.
Type: | Thesis (Doctoral) |
---|---|
Qualification: | Ph.D |
Title: | Transformers for Commodity Forecasting |
Open access status: | An open access version is available from UCL Discovery |
Language: | English |
Additional information: | Copyright © The Author 2025. Original content in this thesis is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Licence (https://creativecommons.org/licenses/by-nc/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request. |
UCL classification: | UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science UCL |
URI: | https://discovery.ucl.ac.uk/id/eprint/10206442 |
Archive Staff Only
![]() |
View Item |