UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Unsupervised image translation with distributional semantics awareness

Peng, Zhexi; Wang, He; Weng, Yanlin; Yang, Yin; Shao, Tianjia; (2023) Unsupervised image translation with distributional semantics awareness. Computational Visual Media , 9 (3) pp. 619-631. 10.1007/s41095-022-0295-3. Green open access

[thumbnail of Wang_s1-4.pdf]
Preview
Text
Wang_s1-4.pdf

Download (22MB) | Preview

Abstract

Unsupervised image translation (UIT) studies the mapping between two image domains. Since such mappings are under-constrained, existing research has pursued various desirable properties such as distributional matching or two-way consistency. In this paper, we re-examine UIT from a new perspective: distributional semantics consistency, based on the observation that data variations contain semantics, e.g., shoes varying in colors. Further, the semantics can be multi-dimensional, e.g., shoes also varying in style, functionality, etc. Given two image domains, matching these semantic dimensions during UIT will produce mappings with explicable correspondences, which has not been investigated previously. We propose distributional semantics mapping (DSM), the first UIT method which explicitly matches semantics between two domains. We show that distributional semantics has been rarely considered within and beyond UIT, even though it is a common problem in deep learning. We evaluate DSM on several benchmark datasets, demonstrating its general ability to capture distributional semantics. Extensive comparisons show that DSM not only produces explicable mappings, but also improves image quality in general. [Figure not available: see fulltext.]

Type: Article
Title: Unsupervised image translation with distributional semantics awareness
Open access status: An open access version is available from UCL Discovery
DOI: 10.1007/s41095-022-0295-3
Publisher version: https://doi.org/10.1007/s41095-022-0295-3
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Science & Technology, Technology, Computer Science, Software Engineering, Computer Science, generative adversarial networks (GANs), manifold alignment, unsupervised learning, image-to-image translation, distributional semantics
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10215215
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item