Tian, Y;
Zhang, Z;
Yang, Y;
Chen, Z;
Yang, Z;
Jin, R;
Quek, TQS;
(2024)
An Edge-Cloud Collaboration Framework for Generative AI Service Provision with Synergetic Big Cloud Model and Small Edge Models.
IEEE Network
p. 1.
10.1109/MNET.2024.3420755.
(In press).
Preview |
Text
An_Edge-Cloud_Collaboration_Framework_for_Generative_AI_Service_Provision_with_Synergetic_Big_Cloud_Model_and_Small_Edge_Models.pdf - Accepted Version Download (2MB) | Preview |
Abstract
Generative artificial intelligence (GenAI) offers various services to users through content creation, which is believed to be one of the most important components in future networks. However, training and deploying big artificial intelligence models (BAIMs) introduces substantial computational and communication overhead. This poses a critical challenge to centralized approaches, due to the need of high-performance computing infrastructure and the reliability, secrecy and timeliness issues in long-distance access of cloud services. Therefore, there is an urging need to decentralize the services, partly moving them from the cloud to the edge and establishing native GenAI services to enable private, timely, and personalized experiences. In this paper, we propose a brand-new bottom-up BAIM architecture with synergetic big cloud model and small edge models, and design a distributed training framework and a task-oriented deployment scheme for efficient provision of native GenAI services. The proposed framework can facilitate collaborative intelligence, enhance adaptability, gather edge knowledge and alleviate edge-cloud burden. The effectiveness of the proposed framework is demonstrated through an image generation use case. Finally, we outline fundamental research directions to fully exploit the collaborative potential of edge and cloud for native GenAI and BAIM applications.
Type: | Article |
---|---|
Title: | An Edge-Cloud Collaboration Framework for Generative AI Service Provision with Synergetic Big Cloud Model and Small Edge Models |
Open access status: | An open access version is available from UCL Discovery |
DOI: | 10.1109/MNET.2024.3420755 |
Publisher version: | http://dx.doi.org/10.1109/mnet.2024.3420755 |
Language: | English |
Additional information: | This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions. |
Keywords: | Computational modeling; Data models; Training; Adaptation models; Task analysis; Artificial intelligence; Cloud computing; Generative AI; big AI model; edge-cloud collaboration |
UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng |
URI: | https://discovery.ucl.ac.uk/id/eprint/10194347 |
Archive Staff Only
View Item |