Antotsiou, Dafni;
Ciliberto, Carlo;
Kim, Tae-Kyun;
(2021)
Adversarial Imitation Learning with Trajectorial Augmentation and Correction.
In:
Proceedings of the IEEE International Conference on Robotics and Automation (ICRA) 2021.
(pp. pp. 4724-4730).
Institute of Electrical and Electronics Engineers (IEEE)
Preview |
Text
2103.13887.pdf - Accepted Version Download (2MB) | Preview |
Abstract
Deep Imitation Learning requires a large number of expert demonstrations, which are not always easy to obtain, especially for complex tasks. A way to overcome this shortage of labels is through data augmentation. However, this cannot be easily applied to control tasks due to the sequential nature of the problem. In this work, we introduce a novel augmentation method which preserves the success of the augmented trajectories. To achieve this, we introduce a semi-supervised correction network that aims to correct distorted expert actions. To adequately test the abilities of the correction network, we develop an adversarial data augmented imitation architecture to train an imitation agent using synthetic experts. Additionally, we introduce a metric to measure diversity in trajectory datasets. Experiments show that our data augmentation strategy can improve accuracy and convergence time of adversarial imitation while preserving the diversity between the generated and real trajectories.
Type: | Proceedings paper |
---|---|
Title: | Adversarial Imitation Learning with Trajectorial Augmentation and Correction |
Event: | 2021 IEEE International Conference on Robotics and Automation (ICRA) |
Location: | Xi'an, China |
Dates: | 30th May- 5th June 2021 |
ISBN-13: | 978-1-7281-9077-8 |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1109/ICRA48506.2021.9561915 |
Publisher version: | https://doi.org/10.1109/ICRA48506.2021.9561915 |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions. |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery.ucl.ac.uk/id/eprint/10187534 |
Archive Staff Only
![]() |
View Item |