<> <http://www.w3.org/2000/01/rdf-schema#comment> "The repository administrator has not yet configured an RDF license."^^<http://www.w3.org/2001/XMLSchema#string> .
<> <http://xmlns.com/foaf/0.1/primaryTopic> <https://discovery.ucl.ac.uk/id/eprint/10203467> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/Thesis> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/Article> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/dc/terms/title> "Vision-Based Spatial Representations for Robot Manipulation"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/ontology/bibo/abstract> "Robotics focuses on how to control complex mechanical machines to perform useful and complex behaviours. The range of behaviours that machines can exhibit is limited not only by their physical capabilities but also by their ability to perceive the environment. Cheap and abundant RGB cameras are readily available, but using their data for robotic applications is still difficult. To confront this challenge, we leverage the progress made in deep learning and computer vision, employing spatial representations. In this thesis we show how to effectively extract information from visual data, in the form of keypoints, and demonstrate how to use it as a compact and meaningful representation. In the first part of this work we focus on detecting a small number of keypoints from dozens of human annotations. Training on this data alone would not yield a robust detection model. Therefore we introduce a novel way to use unlabelled multi-view data. We show that this gives us a representation which is not only useful for human defined motions, but also for learned agents. A downside of this approach is that it requires retraining the model if the desired points change. We propose a solution to this in the second part of this thesis, where we present a method to learn a latent space of point identities without the need for prior human annotations. We build upon this in the concluding section, where we move towards generalisation to novel, unseen objects. We show how using a point tracking model, such as TAPIR, we can extract task information from a few demonstrations and then reproduce the motion autonomously. This enables programming robots to solve long horizon visuo-motor tasks, such as gluing or block insertion. Notably, it works with unseen objects, without annotations, and is robust to background changes."^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/dc/terms/date> "2025-01-28" .
<https://discovery.ucl.ac.uk/id/document/1817900> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://purl.org/ontology/bibo/Document> .
<https://discovery.ucl.ac.uk/id/org/ext-a64c3df5861c6582807add1abaadf2af> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Organization> .
<https://discovery.ucl.ac.uk/id/org/ext-a64c3df5861c6582807add1abaadf2af> <http://xmlns.com/foaf/0.1/name> "UCL (University College London)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/dc/terms/issuer> <https://discovery.ucl.ac.uk/id/org/ext-a64c3df5861c6582807add1abaadf2af> .
<https://discovery.ucl.ac.uk/id/org/ext-8f7ed5b3450912d77936e05323506a1f> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Organization> .
<https://discovery.ucl.ac.uk/id/org/ext-8f7ed5b3450912d77936e05323506a1f> <http://xmlns.com/foaf/0.1/name> "Computer Science, UCL (University College London)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/org/ext-8f7ed5b3450912d77936e05323506a1f> <http://purl.org/dc/terms/isPartOf> <https://discovery.ucl.ac.uk/id/org/ext-a64c3df5861c6582807add1abaadf2af> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/dc/terms/issuer> <https://discovery.ucl.ac.uk/id/org/ext-8f7ed5b3450912d77936e05323506a1f> .
<https://discovery.ucl.ac.uk/id/org/ext-a64c3df5861c6582807add1abaadf2af> <http://purl.org/dc/terms/hasPart> <https://discovery.ucl.ac.uk/id/org/ext-8f7ed5b3450912d77936e05323506a1f> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/ontology/bibo/status> <http://purl.org/ontology/bibo/status/unpublished> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/dc/terms/creator> <https://discovery.ucl.ac.uk/id/person/ext-c61dfaa166e49cd05684b5648d5779cc> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/ontology/bibo/authorList> <https://discovery.ucl.ac.uk/id/eprint/10203467#authors> .
<https://discovery.ucl.ac.uk/id/eprint/10203467#authors> <http://www.w3.org/1999/02/22-rdf-syntax-ns#_1> <https://discovery.ucl.ac.uk/id/person/ext-c61dfaa166e49cd05684b5648d5779cc> .
<https://discovery.ucl.ac.uk/id/person/ext-c61dfaa166e49cd05684b5648d5779cc> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://xmlns.com/foaf/0.1/Person> .
<https://discovery.ucl.ac.uk/id/person/ext-c61dfaa166e49cd05684b5648d5779cc> <http://xmlns.com/foaf/0.1/givenName> "Mel"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-c61dfaa166e49cd05684b5648d5779cc> <http://xmlns.com/foaf/0.1/familyName> "Vecerik"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/person/ext-c61dfaa166e49cd05684b5648d5779cc> <http://xmlns.com/foaf/0.1/name> "Mel Vecerik"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/EPrint> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/ThesisEPrint> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://purl.org/dc/terms/isPartOf> <https://discovery.ucl.ac.uk/id/repository> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817900> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1817900> <http://www.w3.org/2000/01/rdf-schema#label> "Vision-Based Spatial Representations for Robot Manipulation (Text)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1817900> <http://eprints.org/ontology/hasFile> <https://discovery.ucl.ac.uk/id/eprint/10203467/7/Vecerik_10203467_Thesis.pdf> .
<https://discovery.ucl.ac.uk/id/document/1817900> <http://purl.org/dc/terms/hasPart> <https://discovery.ucl.ac.uk/id/eprint/10203467/7/Vecerik_10203467_Thesis.pdf> .
<https://discovery.ucl.ac.uk/id/eprint/10203467/7/Vecerik_10203467_Thesis.pdf> <http://www.w3.org/2000/01/rdf-schema#label> "Vecerik_10203467_Thesis.pdf"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1817905> .
<https://discovery.ucl.ac.uk/id/document/1817905> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1817905> <http://www.w3.org/2000/01/rdf-schema#label> "Vision-Based Spatial Representations for Robot Manipulation (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1817905> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817905> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817905> <http://eprints.org/relation/isIndexCodesVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1817906> .
<https://discovery.ucl.ac.uk/id/document/1817906> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1817906> <http://www.w3.org/2000/01/rdf-schema#label> "Vision-Based Spatial Representations for Robot Manipulation (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1817906> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817906> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817906> <http://eprints.org/relation/islightboxThumbnailVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1817907> .
<https://discovery.ucl.ac.uk/id/document/1817907> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1817907> <http://www.w3.org/2000/01/rdf-schema#label> "Vision-Based Spatial Representations for Robot Manipulation (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1817907> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817907> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817907> <http://eprints.org/relation/ispreviewThumbnailVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1817908> .
<https://discovery.ucl.ac.uk/id/document/1817908> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1817908> <http://www.w3.org/2000/01/rdf-schema#label> "Vision-Based Spatial Representations for Robot Manipulation (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1817908> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817908> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817908> <http://eprints.org/relation/ismediumThumbnailVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://eprints.org/ontology/hasDocument> <https://discovery.ucl.ac.uk/id/document/1817909> .
<https://discovery.ucl.ac.uk/id/document/1817909> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://eprints.org/ontology/Document> .
<https://discovery.ucl.ac.uk/id/document/1817909> <http://www.w3.org/2000/01/rdf-schema#label> "Vision-Based Spatial Representations for Robot Manipulation (Other)"^^<http://www.w3.org/2001/XMLSchema#string> .
<https://discovery.ucl.ac.uk/id/document/1817909> <http://eprints.org/relation/isVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817909> <http://eprints.org/relation/isVolatileVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/document/1817909> <http://eprints.org/relation/issmallThumbnailVersionOf> <https://discovery.ucl.ac.uk/id/document/1817900> .
<https://discovery.ucl.ac.uk/id/eprint/10203467> <http://www.w3.org/2000/01/rdf-schema#seeAlso> <https://discovery.ucl.ac.uk/id/eprint/10203467/> .
<https://discovery.ucl.ac.uk/id/eprint/10203467/> <http://purl.org/dc/elements/1.1/title> "HTML Summary of #10203467 \n\nVision-Based Spatial Representations for Robot Manipulation\n\n" .
<https://discovery.ucl.ac.uk/id/eprint/10203467/> <http://purl.org/dc/elements/1.1/format> "text/html" .
<https://discovery.ucl.ac.uk/id/eprint/10203467/> <http://xmlns.com/foaf/0.1/primaryTopic> <https://discovery.ucl.ac.uk/id/eprint/10203467> .