Peng, Zhexi;
Wang, He;
Weng, Yanlin;
Yang, Yin;
Shao, Tianjia;
(2023)
Unsupervised image translation with distributional semantics awareness.
Computational Visual Media
, 9
(3)
pp. 619-631.
10.1007/s41095-022-0295-3.
Preview |
Text
Wang_s1-4.pdf Download (22MB) | Preview |
Abstract
Unsupervised image translation (UIT) studies the mapping between two image domains. Since such mappings are under-constrained, existing research has pursued various desirable properties such as distributional matching or two-way consistency. In this paper, we re-examine UIT from a new perspective: distributional semantics consistency, based on the observation that data variations contain semantics, e.g., shoes varying in colors. Further, the semantics can be multi-dimensional, e.g., shoes also varying in style, functionality, etc. Given two image domains, matching these semantic dimensions during UIT will produce mappings with explicable correspondences, which has not been investigated previously. We propose distributional semantics mapping (DSM), the first UIT method which explicitly matches semantics between two domains. We show that distributional semantics has been rarely considered within and beyond UIT, even though it is a common problem in deep learning. We evaluate DSM on several benchmark datasets, demonstrating its general ability to capture distributional semantics. Extensive comparisons show that DSM not only produces explicable mappings, but also improves image quality in general. [Figure not available: see fulltext.]
Type: | Article |
---|---|
Title: | Unsupervised image translation with distributional semantics awareness |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1007/s41095-022-0295-3 |
Publisher version: | https://doi.org/10.1007/s41095-022-0295-3 |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Science & Technology, Technology, Computer Science, Software Engineering, Computer Science, generative adversarial networks (GANs), manifold alignment, unsupervised learning, image-to-image translation, distributional semantics |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery.ucl.ac.uk/id/eprint/10215215 |
Archive Staff Only
![]() |
View Item |