Lu, Jiawei;
Zhang, Yingpeng;
Zhao, Zengjun;
Wang, He;
Zhou, Kun;
Shao, Tianjia;
(2025)
GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation.
In: Walsh, T and Shah, J and Kolter, Z, (eds.)
Proceedings of the AAAI Conference on Artificial Intelligence.
(pp. pp. 5820-5828).
Association for the Advancement of Artificial Intelligence: Philadelphia, PA, USA.
Preview |
Text
2409.18401v1.pdf - Accepted Version Download (21MB) | Preview |
Abstract
Large-scale text-guided image diffusion models have demonstrated remarkable results in text-to-image (T2I) generation. However, applying these models to synthesize textures for 3D geometries remains challenging due to the domain gap between 2D images and textures on a 3D surface. Early works that used a projecting-inpainting approach managed to preserve generation diversity, but often resulted in noticeable artifacts and style inconsistencies. While recent methods have attempted to address these inconsistencies, they often introduce other issues, such as blurring, over-saturation, or over-smoothing. To overcome these challenges, we propose a novel text-to-texture synthesis framework that takes advantage of pre-trained diffusion models. We introduce a local attention reweighing mechanism in the self-attention layers to guide the model in focusing on spatial-correlated patches across different views, thereby enhancing local details while preserving cross-view consistency. Additionally, we propose a novel latent space merge pipeline, which further ensures consistency across different viewpoints without sacrificing too much diversity. Our method significantly outperforms existing state-of-the-art techniques in terms of texture consistency and visual quality, while delivering results much faster than distillation-based methods. Importantly, our framework does not require additional training or fine-tuning, making it highly adaptable to a wide range of models available on public platforms.
Type: | Proceedings paper |
---|---|
Title: | GenesisTex2: Stable, Consistent and High-Quality Text-to-Texture Generation |
Event: | 39th AAAI Conference on Artificial Intelligence |
Location: | PA, Philadelphia |
Dates: | 25 Feb 2025 - 4 Mar 2025 |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1609/aaai.v39i6.32621 |
Publisher version: | https://doi.org/10.1609/aaai.v39i6.32621 |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Science & Technology, Technology, Computer Science, Artificial Intelligence, Computer Science, Interdisciplinary Applications, Computer Science, Theory & Methods, Computer Science |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
URI: | https://discovery.ucl.ac.uk/id/eprint/10215205 |
Archive Staff Only
![]() |
View Item |