UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation

Lu, Jiawei; Zhang, Yingpeng; Zhao, Zengjun; Wang, He; Zhou, Kun; Shao, Tianjia; (2025) GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation. In: Walsh, T and Shah, J and Kolter, Z, (eds.) Proceedings of the AAAI Conference on Artificial Intelligence. (pp. pp. 5820-5828). Association for the Advancement of Artificial Intelligence: Philadelphia, PA, USA. Green open access

[thumbnail of 2409.18401v1.pdf]
Preview
Text
2409.18401v1.pdf - Accepted Version

Download (21MB) | Preview

Abstract

Large-scale text-guided image diffusion models have demonstrated remarkable results in text-to-image (T2I) generation. However, applying these models to synthesize textures for 3D geometries remains challenging due to the domain gap between 2D images and textures on a 3D surface. Early works that used a projecting-inpainting approach managed to preserve generation diversity, but often resulted in noticeable artifacts and style inconsistencies. While recent methods have attempted to address these inconsistencies, they often introduce other issues, such as blurring, over-saturation, or over-smoothing. To overcome these challenges, we propose a novel text-to-texture synthesis framework that takes advantage of pre-trained diffusion models. We introduce a local attention reweighing mechanism in the self-attention layers to guide the model in focusing on spatial-correlated patches across different views, thereby enhancing local details while preserving cross-view consistency. Additionally, we propose a novel latent space merge pipeline, which further ensures consistency across different viewpoints without sacrificing too much diversity. Our method significantly outperforms existing state-of-the-art techniques in terms of texture consistency and visual quality, while delivering results much faster than distillation-based methods. Importantly, our framework does not require additional training or fine-tuning, making it highly adaptable to a wide range of models available on public platforms.

Type: Proceedings paper
Title: GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation
Event: 39th AAAI Conference on Artificial Intelligence
Location: PA, Philadelphia
Dates: 25 Feb 2025 - 4 Mar 2025
Open access status: An open access version is available from UCL Discovery
DOI: 10.1609/aaai.v39i6.32621
Publisher version: https://doi.org/10.1609/aaai.v39i6.32621
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Science & Technology, Technology, Computer Science, Artificial Intelligence, Computer Science, Interdisciplinary Applications, Computer Science, Theory & Methods, Computer Science
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10215205
Downloads since deposit
1Download
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item