UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Feature Aggregation Decoder for Segmenting Laparoscopic Scenes

Kadkhodamohammadi, A; Luengo, I; Barbarisi, S; Taleb, H; Flouty, E; Stoyanov, D; (2019) Feature Aggregation Decoder for Segmenting Laparoscopic Scenes. In: Zhou, L and Sarikaya, D and Kia, SM and Speidel, S and Malpani, A and Hashimoto, D and Habes, M and Lofstedt, T and Ritter, K and Wang, H, (eds.) OR 2.0 Context-Aware Operating Theaters and Machine Learning in Clinical Neuroimaging. (pp. pp. 3-11). Springer: Cham, Switzerland. Green open access

[thumbnail of Feature Aggregation Decoder for Segmenting Laparoscopic Scenes.pdf]
Preview
Text
Feature Aggregation Decoder for Segmenting Laparoscopic Scenes.pdf - Accepted Version

Download (5MB) | Preview

Abstract

Laparoscopic scene segmentation is one of the key building blocks required for developing advanced computer assisted interventions and robotic automation. Scene segmentation approaches often rely on encoder-decoder architectures that encode a representation of the input to be decoded to semantic pixel labels. In this paper, we propose to use the deep Xception model for the encoder and a simple yet effective decoder that relies on a feature aggregation module. Our feature aggregation module constructs a mapping function that reuses and transfers encoder features and combines information across all feature scales to build a richer representation that keeps both high-level context and low-level boundary information. We argue that this aggregation module enables us to simplify the decoder and reduce the number of parameters in the decoder. We have evaluated our approach on two datasets and our experimental results show that our model outperforms state-of-the-art models on the same experimental setup and significantly improves the previous results, 98.44% vs 89.00% , on the EndoVis’15 dataset.

Type: Proceedings paper
Title: Feature Aggregation Decoder for Segmenting Laparoscopic Scenes
Event: OR 2.0: International Workshop on OR 2.0 Context-Aware Operating Theaters / MLCN: International Workshop on Machine Learning in Clinical Neuroimaging
Location: Shenzhen, PEOPLES R CHINA
Dates: 13 October 2019 - 17 October 2019
ISBN-13: 978-3-030-32694-4
Open access status: An open access version is available from UCL Discovery
DOI: 10.1007/978-3-030-32695-1_1
Publisher version: https://doi.org/10.1007/978-3-030-32695-1_1
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10119199
Downloads since deposit
123Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item