UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

An overview on synthetic administrative data for research

Kokosi, Theodora; De Stavola, Bianca; Mitra, Robin; Frayling, Lora; Doherty, Aiden; Dove, Iain; Sonnenberg, Pam; (2022) An overview on synthetic administrative data for research. International Journal of Population Data Science , 7 (1) , Article 10. 10.23889/ijpds.v7i1.1727. Green open access

[thumbnail of Kokosi_An overview on synthetic administrative data for research.pdf]
Preview
Text
Kokosi_An overview on synthetic administrative data for research.pdf - Published Version

Download (483kB) | Preview

Abstract

Use of administrative data for research and for planning services has increased over recent decades due to the value of the large, rich information available. However, concerns about the release of sensitive or personal data and the associated disclosure risk can lead to lengthy approval processes and restricted data access. This can delay or prevent the production of timely evidence. A promising solution to facilitate more efficient data access is to create synthetic versions of the original datasets which do not hold any confidential information and can minimise disclosure risk. Such data may be used as an interim solution, allowing researchers to develop their analysis plans on non-disclosive data, whilst waiting for access to the real data. We aim to provide an overview of the background and uses of synthetic data, describe common methods used to generate synthetic data in the context of UK administrative research, propose a simplified terminology for categories of synthetic data, and illustrate challenges and future directions for research.

Type: Article
Title: An overview on synthetic administrative data for research
Open access status: An open access version is available from UCL Discovery
DOI: 10.23889/ijpds.v7i1.1727
Publisher version: https://doi.org/10.23889/ijpds.v7i1.1727
Language: English
Additional information: This work is licensed under a Creative Commons Attribution 4.0 International License.
Keywords: Synthetic data, administrative datasets, data linkage, statistical disclosure control, data utility, data confidentiality
UCL classification: UCL
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health > Population, Policy and Practice Dept
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences > Faculty of Population Health Sciences > UCL GOS Institute of Child Health
UCL > Provost and Vice Provost Offices > School of Life and Medical Sciences
URI: https://discovery.ucl.ac.uk/id/eprint/10149571
Downloads since deposit
148Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item