Joshi, Jitesh Narendra;
(2025)
Enhancing Out-of-distribution
Generalisation for Robust
Camera-based Remote Physiological Sensing.
Doctoral thesis (Ph.D), UCL (University College London).
Preview |
Text
Joshi_PhD-Thesis.pdf - Accepted Version Download (100MB) | Preview |
Abstract
Colour and thermal infrared imaging enable contactless extraction of physiological signals such as photoplethysmography (PPG) and respiration (RSP), enabling novel applications in human-computer interaction (HCI) and healthcare research. This thesis aims to develop a robust end-to-end learning-based framework for estimating remote photoplethysmography (rPPG) and remote respiration (rRSP) signals with enhanced out-of-distribution generalisation for real-world deployment. To reach this aim, this work makes a set of contributions. First, an open-source physiological computing toolkit, PhysioKit, was developed to acquire iBVP dataset that comprise synchronised RGB and thermal (RGB-T) video with multichannel physiological signals from contact-based sensors, along with real-time signal quality assessment. Second, this thesis introduces a Self-Adversarial Multiscale Contrastive Learning Framework (SAM-CL) for robust segmentation of thermal facial frames under adverse conditions and occlusions. SAM-CL has a thermal image augmentation module (TiAug), that generates adversarial samples and segmentation masks, and a multiscale contrastive loss that optimises the segmentation model. Third, to address out-of-distribution generalisation for physiological signal extraction, this work proposes FactorisePhys, a lightweight 3D-CNN architecture, and the Factorised SelfAttention Module (FSAM), mathematically grounded in non-negative matrix factorisation (NMF). FSAM selectively boosts weaker physiological signals in video frames while remaining invariant to lighting and head movements. FactorisePhys, trained with FSAM, shows superior cross-dataset generalisation for rPPG signal estimation as compared to the existing state-of-the-art (SOTA) methods. Finally, building on FSAM and advances in constrained NMF, the Target Signal Constrained Factorisation Module (TSFM) and MMRPhys, an efficient 3D-CNN architecture, are proposed for robust multitask rPPG and rRSP signal estimation using RGB-T frames. MMRPhys, trained with TSFM, shows superior cross-dataset generalisation as compared to the SOTA multitask models along with achieving the lowest inference time latency, making it suitable for real-time edge deployment. In summary, this thesis makes notable contributions to enhance out-of-distribution robustness in remote physiological sensing research.
| Type: | Thesis (Doctoral) |
|---|---|
| Qualification: | Ph.D |
| Title: | Enhancing Out-of-distribution Generalisation for Robust Camera-based Remote Physiological Sensing |
| Open access status: | An open access version is available from UCL Discovery |
| Language: | English |
| Additional information: | Copyright © The Author 2025. Original content in this thesis is licensed under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) Licence (https://creativecommons.org/licenses/by/4.0/). Any third-party copyright material present remains the property of its respective owner(s) and is licensed under its existing terms. Access may initially be restricted at the author’s request. |
| UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
| URI: | https://discovery.ucl.ac.uk/id/eprint/10213331 |
Archive Staff Only
![]() |
View Item |

