UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Fluid Antenna System Liberating Multiuser MIMO for ISAC via Deep Reinforcement Learning

Wang, C; Li, G; Zhang, H; Wong, KK; Li, Z; Ng, DWK; Chae, CB; (2024) Fluid Antenna System Liberating Multiuser MIMO for ISAC via Deep Reinforcement Learning. IEEE Transactions on Wireless Communications p. 1. 10.1109/TWC.2024.3376800. (In press). Green open access

[thumbnail of Wong_Fluid Antenna System Liberating Multiuser MIMO for ISAC via Deep Reinforcement Learning_AAM.pdf]
Preview
Text
Wong_Fluid Antenna System Liberating Multiuser MIMO for ISAC via Deep Reinforcement Learning_AAM.pdf

Download (9MB) | Preview

Abstract

The aim of this paper is to enhance the performance of an integrated sensing and communications (ISAC) system in the multiuser multiple-input multiple-output (MIMO) downlink in which a two-dimensional (2D) fluid antenna system (FAS) with multiple activated ports is employed at the base station (BS) to maximize the sum-rate of the downlink users subject to a sensing constraint. The unique feature of this setup is that the locations of the antenna ports at the FAS can be optimized jointly with the precoding design to achieve a higher sum-rate. The required optimization problem is however NP-hard. To overcome this, we start by considering the perfect channel state information (CSI) scenario where all the port CSI is available. Deep reinforcement learning is utilized to build an end-to-end learning framework for the joint optimization problem. In particular, by fixing the activated ports, we adopt a primal-dual based learning algorithm to design a constraint-aware neural network for optimizing the ISAC precoder. Then, by using the neural precoding network to calculate the reward, we adopt the deep reinforcement learning algorithm to design the port selection and precoder jointly. An advantage actor and critic (A2C) algorithm is proposed to train the policy, in which the actor network uses the pointer network to learn the stochastic policy and the critic network adopts the Long Short-Term Memory (LSTM) encoder architecture to learn the expected reward from the observations. Afterwards, the partial CSI case is addressed, where we propose a masked autoencoder (MAE) induced channel extrapolation for predicting all the CSI to facilitate the joint design. Simulation results demonstrate the promising performance of using FAS for multiuser MIMO and also validate the proposed learning-based scheme.

Type: Article
Title: Fluid Antenna System Liberating Multiuser MIMO for ISAC via Deep Reinforcement Learning
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/TWC.2024.3376800
Publisher version: http://dx.doi.org/10.1109/twc.2024.3376800
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher's terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng
URI: https://discovery.ucl.ac.uk/id/eprint/10191113
Downloads since deposit
43Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item