UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Learned Single-Pass Multitasking Perceptual Graphics for Immersive Displays

Yılmaz, Doğa; Wang, He; Takikawa, Towaki; Ceylan, Duygu; Akşit, Kaan; (2025) Learned Single-Pass Multitasking Perceptual Graphics for Immersive Displays. In: MM '25: Proceedings of the 33rd ACM International Conference on Multimedia. (pp. pp. 10719-10727). ACM Green open access

[thumbnail of Wang_3746027.3754801.pdf]
Preview
Text
Wang_3746027.3754801.pdf

Download (33MB) | Preview

Abstract

Emerging immersive display technologies efficiently utilize resources with perceptual graphics methods such as foveated rendering and denoising. Running multiple perceptual graphics methods challenges devices with limited power and computational resources. We propose a computationally-lightweight learned multitasking perceptual graphics model. Given RGB images and text-prompts, our model performs text-described perceptual tasks in a single inference step. Simply daisy-chaining multiple models or training dedicated models can lead to model management issues and exhaust computational resources. In contrast, our flexible method unlocks consistent high quality perceptual effects with reasonable compute, supporting various permutations at varied intensities using adjectives in text prompts (e.g., ''mildly'', ''lightly''). Text-guidance provides ease of use for dynamic requirements such as creative processes. To train our model, we propose a dataset containing source and perceptually enhanced images with corresponding text prompts. We evaluate our model on desktop and embedded platforms and validate perceptual quality through a user study.

Type: Proceedings paper
Title: Learned Single-Pass Multitasking Perceptual Graphics for Immersive Displays
Event: MM '25: The 33rd ACM International Conference on Multimedia
Open access status: An open access version is available from UCL Discovery
DOI: 10.1145/3746027.3754801
Publisher version: https://doi.org/10.1145/3746027.3754801
Language: English
Additional information: © 2025 Copyright held by the owner/author(s). This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Keywords: Perceptual Graphics, Immersive Displays, Generative Multimedia
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10216569
Downloads since deposit
57Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item