UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

SurgflowNet: Leveraging unannotated video for consistent endoscopic pituitary surgery workflow recognition

Wijekoon, Anjana; Das, Adrito; Mao, Zhehua; Khan, Danyal Z; Hanrahan, John G; Stoyanov, Danail; Marcus, Hani J; (2026) SurgflowNet: Leveraging unannotated video for consistent endoscopic pituitary surgery workflow recognition. Artificial Intelligence in Medicine , 172 , Article 103309. 10.1016/j.artmed.2025.103309. Green open access

[thumbnail of 1-s2.0-S0933365725002441-main.pdf]
Preview
Text
1-s2.0-S0933365725002441-main.pdf - Published Version

Download (2MB) | Preview

Abstract

Surgical workflow recognition has the potential to accelerate training initiatives through the analysis of surgical videos, improve intraoperative efficiency, and support preemptive postoperative care. Unlike well-explored minimally invasive surgeries, where surgical workflows are consistent across patients, automating endoscopic pituitary surgery workflow recognition is challenging. Pituitary surgery involves a large number of steps, diverse sequences, optional steps, and frequent transitions, making it challenging for current state-of-the-art (SOTA) methods, which struggle with transferability. Progress is largely limited by the lack of annotated data that captures the complexity of pituitary surgery, and obtaining such annotations is both time-consuming and resource-intensive. This paper presents SurgflowNet, a novel spatio-temporal model for consistent pituitary workflow recognition leveraging unannotated data. We utilise a limited yet fully annotated dataset to infer quasi-labels for unannotated videos and curate a balanced dataset to train a robust frame encoder using the student–teacher framework. A spatio-temporal network that combines the resulting frame encoder and an LSTM network is trained with a consistency loss to ensure stability in step predictions. With a 5% improvement in macro F1-score and 13.4% in Edit Score over the SOTA, SurgflowNetdemonstrates a significant improvement in workflow recognition for endoscopic pituitary surgery.

Type: Article
Title: SurgflowNet: Leveraging unannotated video for consistent endoscopic pituitary surgery workflow recognition
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.artmed.2025.103309
Publisher version: https://doi.org/10.1016/j.artmed.2025.103309
Language: English
Additional information: Copyright © 2025 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
Keywords: Endoscopic pituitary surgery, Surgical workflow recognition
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Brain Sciences > UCL Queen Square Institute of Neurology
URI: https://discovery.ucl.ac.uk/id/eprint/10218196
Downloads since deposit
4Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item