Dietrich, Florian M.;
(2025)
Machine Learning Nucleation Collective Variables using Graph Neural Networks.
Doctoral thesis (Ph.D), UCL (University College London).
|
Text
PhD_Thesis.pdf - Accepted Version Access restricted to UCL open access staff until 1 December 2025. Download (101MB) |
Abstract
One of the main obstacles limiting the use of enhanced sampling techniques to study nucleation events, in particular those of molecular systems, in realistic environments is the computational cost associated with evaluating and differentiating nucleation collective variables. This work addresses this bottleneck by developing a framework for training approximations of nucleation collective variables based on graph neural networks. These approximations are more computationally efficient and scale more favourably with system size than their analytical counterparts. This work further demonstrates how to recover the free energy surfaces corresponding to the analytical variables from model sampling, which enables the construction of free energy profiles in non-differentiable, previously inaccessible, descriptor spaces. The work on constructing, training and deploying graph neural network-based machine-learned collective variables led to studying the theoretical implications of constructing free energy surfaces in the space of variables that are the result of stochastic processes. This has led to the development of guidelines for the interpretation of free energy surfaces in machine-learned collective variable spaces, a novel gradient rescaling method and a novel regularisation technique. The generality and transferability of this framework are finally demonstrated by enhancing the sampling of guanine nucleation from solution. A system for which the nucleation mechanism was elucidated prior, in a joint experimental and computational study.
| Type: | Thesis (Doctoral) |
|---|---|
| Qualification: | Ph.D |
| Title: | Machine Learning Nucleation Collective Variables using Graph Neural Networks |
| Language: | English |
| Additional information: | Copyright © The Author 2025. Original content in this thesis is licensed under the terms of the Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0) Licence (https://creativecommons.org/licenses/by-nc/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request. |
| Keywords: | Enhanced Sampling, Machine Learning, Collective Variable, Nucleation |
| UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > MAPS Faculty Office |
| URI: | https://discovery.ucl.ac.uk/id/eprint/10209068 |
Archive Staff Only
![]() |
View Item |

