UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Improving Generalization of Deep Networks for Estimating Physical Properties of Containers and Fillings

Wang, H; Zhu, C; Ma, Z; Oh, C; (2022) Improving Generalization of Deep Networks for Estimating Physical Properties of Containers and Fillings. In: ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings. (pp. pp. 9147-9151). IEEE: Singapore. Green open access

[thumbnail of 2203.01192.pdf]
Preview
Text
2203.01192.pdf - Accepted Version

Download (1MB) | Preview

Abstract

We present methods to estimate the physical properties of household containers and their fillings manipulated by humans. We use a lightweight, pre-trained convolutional neural network with coordinate attention as a backbone model of the pipelines to accurately locate the object of interest and estimate the physical properties in the CORSMAL Containers Manipulation (CCM) dataset. We address the filling type classification with audio data and then combine this information from audio with video modalities to address the filling level classification. For the container capacity, dimension, and mass estimation, we present a data augmentation and consistency measurement to alleviate the over-fitting issue in the CCM dataset caused by the limited number of containers. We augment the training data using an object-of-interest-based re-scaling that increases the variety of physical values of the containers. We then perform the consistency measurement to choose a model with low prediction variance in the same containers under different scenes, which ensures the generalization ability of the model. Our method improves the generalization ability of the models to estimate the property of the containers that were not previously seen in the training.

Type: Proceedings paper
Title: Improving Generalization of Deep Networks for Estimating Physical Properties of Containers and Fillings
Event: ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
Dates: 23 May 2022 - 27 May 2022
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/ICASSP43922.2022.9747349
Publisher version: https://doi.org/10.1109/ICASSP43922.2022.9747349
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Training , Solid modeling , Three-dimensional displays , Transfer learning , Training data , Containers , Predictive models
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10161226
Downloads since deposit
43Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item