Turkoglu, MO;
Brachmann, E;
Schindler, K;
Brostow, GJ;
Monszpart, A;
(2022)
Visual Camera Re-Localization Using Graph Neural Networks and Relative Pose Supervision.
In:
2021 International Conference on 3D Vision (3DV).
(pp. pp. 145-155).
IEEE: London, United Kingdom.
Preview |
PDF
Visual_Camera_Re_Localization.pdf - Accepted Version Download (49MB) | Preview |
Abstract
Visual re-localization means using a single image as input to estimate the camera's location and orientation relative to a pre-recorded environment. The highest-scoring methods are 'structure-based,' and need the query camera's intrinsics as an input to the model,with careful geometric optimization. When intrinsics are absent,methods vie for accuracy by making various other assumptions. This yields fairly good localization scores,but the models are 'narrow' in some way,e.g.,requiring costly test-time computations,or depth sensors,or multiple query frames. In contrast,our proposed method makes few special assumptions,and is fairly lightweight in training and testing.Our pose regression network learns from only relative poses of training scenes. For inference,it builds a graph connecting the query image to training counterparts and uses a graph neural network (GNN) with image representations on nodes and image-pair representations on edges. By efficiently passing messages between them,both representation types are refined to produce a consistent camera pose estimate. We validate the effectiveness of our approach on both standard indoor (7-Scenes) and outdoor (Cambridge Landmarks) camera re-localization benchmarks. Our relative pose regression method matches the accuracy of absolute pose regression networks,while retaining the relative-pose models' test-time speed and ability to generalize to non-training scenes.
Type: | Proceedings paper |
---|---|
Title: | Visual Camera Re-Localization Using Graph Neural Networks and Relative Pose Supervision |
Event: | 2021 International Conference on 3D Vision (3DV) |
Dates: | 1 Dec 2021 - 3 Dec 2021 |
ISBN-13: | 9781665426886 |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1109/3DV53792.2021.00025 |
Publisher version: | http://dx.doi.org/10.1109/3dv53792.2021.00025 |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Training, Location awareness, Visualization, Three-dimensional displays, Computational modeling, Cameras, Graph neural networks |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery.ucl.ac.uk/id/eprint/10194977 |




Archive Staff Only
![]() |
View Item |