UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Bayesian model-based clustering for populations of network data

Mantziou, Anastasia; Lunagómez, Simón; Mitra, Robin; (2024) Bayesian model-based clustering for populations of network data. Annals of Applied Statistics , 18 (1) pp. 266-302. 10.1214/23-AOAS1789. Green open access

[thumbnail of AOASpaper.pdf]
Preview
PDF
AOASpaper.pdf - Accepted Version

Download (5MB) | Preview

Abstract

There is increasing appetite for analysing populations of network data due to the fast-growing body of applications demanding such methods. While methods exist to provide readily interpretable summaries of heterogeneous network populations, these are often descriptive or ad hoc, lacking any formal justification. In contrast, principled analysis methods often provide results difficult to relate back to the applied problem of interest. Motivated by two complementary applied examples, we develop a Bayesian framework to appropriately model complex heterogeneous network populations, while also allowing analysts to gain insights from the data and make inferences most relevant to their needs. The first application involves a study in computer science measuring human movements across a university. The second analyses data from neuroscience investigating relationships between different regions of the brain. While both applications entail analysis of a heterogeneous population of networks, network sizes vary considerably. We focus on the problem of clustering the elements of a network population, where each cluster is characterised by a network representative. We take advantage of the Bayesian machinery to simultaneously infer the cluster membership, the representatives, and the community structure of the representatives, thus allowing intuitive inferences to be made. The implementation of our method on the human movement study reveals interesting movement patterns of individuals in clusters, readily characterised by their network representative. For the brain networks application, our model reveals a cluster of individuals with different network properties of particular interest in neuroscience. The performance of our method is additionally validated in extensive simulation studies.

Type: Article
Title: Bayesian model-based clustering for populations of network data
Open access status: An open access version is available from UCL Discovery
DOI: 10.1214/23-AOAS1789
Publisher version: https://doi.org/10.1214/23-AOAS1789
Language: English
Additional information: This version is the author-accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Bayesian models, clustering, Mixture models, object data analysis, populations of network data
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science
URI: https://discovery.ucl.ac.uk/id/eprint/10188777
Downloads since deposit
13Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item