UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Spectral MAB for Unknown Graph Processes

Toni, L; Frossard, P; (2018) Spectral MAB for Unknown Graph Processes. In: Proceedings of the 26th European Signal Processing Conference (EUSIPCO 2018). (pp. pp. 116-120). IEEE Green open access

[thumbnail of EUSIPCO_V3.pdf]
Preview
Text
EUSIPCO_V3.pdf - Accepted Version

Download (914kB) | Preview

Abstract

In this work, we study graph-based multi-arms bandit (MAB) problems aimed at optimizing actions on irregular and high-dimensional graphs. More formally, we consider a decision-maker that takes sequential actions over time and observes the experienced reward, defined as a function of a sparse graph signal. The goal is to optimize the action policy, which maximizes the reward experienced over time. The main challenges are represented by the system uncertainty (i.e., unknown parameters of the sparse graph signal model) and the high-dimensional search space. The uncertainty can be faced by online learning strategies that infer the system dynamics while taking the appropriate actions. However, the high-dimensionality makes online learning strategies highly inefficient. To overcome this limitation, we propose a novel graph-based MAB algorithm, which is data-efficient also in high-dimensional systems. The key intuition is to infer the nature of the graph processes by learning in the graph-spectral domain, and exploit this knowledge while optimizing the actions. In particular, we model the graph signal with a sparse dictionary-based representation and we propose an online sequential decision strategy that learns the parameters of the graph processes while optimizing the action strategy.

Type: Proceedings paper
Title: Spectral MAB for Unknown Graph Processes
Event: 26th European Signal Processing Conference (EUSIPCO 2018)
Location: Rome
Dates: 03 September 2018 - 07 September 2018
Open access status: An open access version is available from UCL Discovery
DOI: 10.23919/EUSIPCO.2018.8553372
Publisher version: http://doi.org/10.23919/EUSIPCO.2018.8553372
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Electronic and Electrical Eng
URI: https://discovery.ucl.ac.uk/id/eprint/10050384
Downloads since deposit
114Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item