Yılmaz, Doğa;
Wang, He;
Takikawa, Towaki;
Ceylan, Duygu;
Akşit, Kaan;
(2025)
Learned Single-Pass Multitasking Perceptual Graphics for Immersive Displays.
In:
MM '25: Proceedings of the 33rd ACM International Conference on Multimedia.
(pp. pp. 10719-10727).
ACM
Preview |
Text
Wang_3746027.3754801.pdf Download (33MB) | Preview |
Abstract
Emerging immersive display technologies efficiently utilize resources with perceptual graphics methods such as foveated rendering and denoising. Running multiple perceptual graphics methods challenges devices with limited power and computational resources. We propose a computationally-lightweight learned multitasking perceptual graphics model. Given RGB images and text-prompts, our model performs text-described perceptual tasks in a single inference step. Simply daisy-chaining multiple models or training dedicated models can lead to model management issues and exhaust computational resources. In contrast, our flexible method unlocks consistent high quality perceptual effects with reasonable compute, supporting various permutations at varied intensities using adjectives in text prompts (e.g., ''mildly'', ''lightly''). Text-guidance provides ease of use for dynamic requirements such as creative processes. To train our model, we propose a dataset containing source and perceptually enhanced images with corresponding text prompts. We evaluate our model on desktop and embedded platforms and validate perceptual quality through a user study.
| Type: | Proceedings paper |
|---|---|
| Title: | Learned Single-Pass Multitasking Perceptual Graphics for Immersive Displays |
| Event: | MM '25: The 33rd ACM International Conference on Multimedia |
| Open access status: | An open access version is available from UCL Discovery |
| DOI: | 10.1145/3746027.3754801 |
| Publisher version: | https://doi.org/10.1145/3746027.3754801 |
| Language: | English |
| Additional information: | © 2025 Copyright held by the owner/author(s). This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
| Keywords: | Perceptual Graphics, Immersive Displays, Generative Multimedia |
| UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science |
| URI: | https://discovery.ucl.ac.uk/id/eprint/10216569 |
Archive Staff Only
![]() |
View Item |

