Alexopoulos, Angelos;
Dellaportas, Petros;
Gyoshev, Stanley;
Kotsogiannis, Christos;
Olhede, Sofia C;
Pavkov, Trifon;
(2025)
A network approach to detect Value Added Tax fraud.
Journal of the Royal Statistical Society Series A: Statistics in Society
, Article qnaf205. 10.1093/jrsssa/qnaf205.
(In press).
Preview |
Text
Dellaportas_qnaf205.pdf Download (1MB) | Preview |
Abstract
Value Added Tax (VAT) fraud erodes public revenue and puts legitimate businesses at a disadvantaged position thereby exacerbating inequality. This article develops scalable algorithms to detect fraudulent transactions by leveraging the rich information embedded in the complex, high-dimensional VAT network structure. Supervised methods are not always suitable for VAT fraud detection, as issues in the auditing process—such as selection bias and audit quality—can seriously affect the labelling of businesses as fraudsters or not. Therefore, both supervised and unsupervised techniques in which VAT fraud detection is implemented through a suitably constructed Laplacian matrix informed by business-specific covariates. The developed methods are applied to the universe of Bulgarian VAT data and detect around 50% of the VAT fraud, outperforming well-known techniques that ignore the information provided by the transactional network structure. The proposed methods are automated and can be implemented following taxpayers’ submission of their VAT returns, thus allowing the authorities to prevent large revenue losses.
| Type: | Article |
|---|---|
| Title: | A network approach to detect Value Added Tax fraud |
| Open access status: | An open access version is available from UCL Discovery |
| DOI: | 10.1093/jrsssa/qnaf205 |
| Publisher version: | https://doi.org/10.1093/jrsssa/qnaf205 |
| Language: | English |
| Additional information: | © The Royal Statistical Society 2025. This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
| Keywords: | anomaly detection, big data, heterogeneous data sources, information systems, tax evasion |
| UCL classification: | UCL UCL > Provost and Vice Provost Offices > UCL BEAMS UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Statistical Science |
| URI: | https://discovery.ucl.ac.uk/id/eprint/10220013 |
Archive Staff Only
![]() |
View Item |

