UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Deep Learning Based Robotic Tool Detection and Articulation Estimation with Spatio-Temporal Layers

Colleoni, E; Moccia, S; Du, X; De Momi, E; Stoyanov, D; (2019) Deep Learning Based Robotic Tool Detection and Articulation Estimation with Spatio-Temporal Layers. IEEE Robotics and Automation Letters , 4 (3) pp. 2714-2721. 10.1109/LRA.2019.2917163. Green open access

[thumbnail of RA_L_Colleoni_2019.pdf]
Preview
Text
RA_L_Colleoni_2019.pdf - Accepted Version

Download (5MB) | Preview

Abstract

Surgical-tool detection from laparoscopic images is an important but challenging task in computer-assisted minimally invasive surgery. Illumination levels, variations in background and the different number of tools in the field of view, all pose difficulties to algorithm and model training. Yet, such challenges could be potentially tackled by exploiting the temporal information in laparoscopic videos to avoid per frame handling of the problem. In this paper, we propose a novel encoderdecoder architecture for surgical instrument detection and articulation joint detection that uses 3D convolutional layers to exploit spatio-temporal features from laparoscopic videos. When tested on benchmark and custom-built datasets, a median Dice similarity coefficient of 85.1% with an interquartile range of 4.6% highlights performance better than the state of the art based on single-frame processing. Alongside novelty of the network architecture, the idea for inclusion of temporal information appears to be particularly useful when processing images with unseen backgrounds during the training phase, which indicates that spatio-temporal features for joint detection help to generalize the solution.

Type: Article
Title: Deep Learning Based Robotic Tool Detection and Articulation Estimation with Spatio-Temporal Layers
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/LRA.2019.2917163
Publisher version: https://doi.org/10.1109/LRA.2019.2917163
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Surgical-tool detection, medical robotics, computer assisted interventions, minimally invasive surgery, surgical vision.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10077809
Downloads since deposit
466Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item