UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Data-Driven Socio-Economic Deprivation Prediction via Dimensionality Reduction: The Power of Diffusion Maps

Goo, June Moh; (2025) Data-Driven Socio-Economic Deprivation Prediction via Dimensionality Reduction: The Power of Diffusion Maps. In: 2024 IEEE International Conference on Big Data (BigData). (pp. pp. 2949-2957). IEEE: Washington, DC, USA. Green open access

[thumbnail of Goo_abcss_camera_ready_2024.pdf]
Preview
Text
Goo_abcss_camera_ready_2024.pdf - Accepted Version

Download (14MB) | Preview

Abstract

This research proposes a model to predict the location of the most deprived areas in a city using data from the census. Census data is very high-dimensional and needs to be simplified. We use the diffusion map algorithm to reduce dimensionality and find patterns. Features are defined by eigenvectors of the Laplacian matrix that defines the diffusion map. The eigenvectors corresponding to the smallest eigenvalues indicate specific characteristics of the population. Previous work has found qualitatively that the second most important dimension for describing the census data in Bristol, UK is linked to deprivation. In this research, we analyse how good this dimension is as a model for predicting deprivation by comparing it with the recognised measures. The Pearson correlation coefficient was found to be greater than 0.7. The top 10 per cent of deprived areas in the UK, which are also located in Bristol, are extracted to test the accuracy of the model. There are 52 of the most deprived areas, and 38 areas are correctly identified by comparing them to the model. The influence of scores of IMD domains that do not correlate with the models and Eigenvector 2 entries of non-deprived Output Areas cause the model to fail the prediction of 14 deprived areas. The model demonstrates strong performance in predicting future deprivation in the project areas, which is expected to assist in government resource allocation and funding greatly. The codes can be accessed here: https://github.com/junegoo94/diffusion maps.

Type: Proceedings paper
Title: Data-Driven Socio-Economic Deprivation Prediction via Dimensionality Reduction: The Power of Diffusion Maps
Event: 2024 IEEE International Conference on Big Data (BigData)
Dates: 15 Dec 2024 - 18 Dec 2024
Open access status: An open access version is available from UCL Discovery
DOI: 10.1109/bigdata62323.2024.10825109
Publisher version: https://doi.org/10.1109/bigdata62323.2024.10825109
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Machine Learning, Diffusion Maps, Dimensionality Reduction, Socio-Economic Deprivation, Census, GIS
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10204144
Downloads since deposit
5Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item