UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Machine-generated text detection prevents language model collapse

Drayson, George; Yilmaz, Emine; Lampos, Vasileios; (2025) Machine-generated text detection prevents language model collapse. In: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. (pp. pp. 29645-29661). Association for Computational Linguistics Green open access

[thumbnail of 2025.emnlp-main.1506.pdf]
Preview
Text
2025.emnlp-main.1506.pdf - Published Version

Download (857kB) | Preview

Abstract

As Large Language Models (LLMs) become increasingly prevalent, their generated outputs are proliferating across the web, risking a future where machine-generated content dilutes human-authored text. Since online data is the primary resource for LLM pre-training, subsequent models could be trained on an unknown portion of synthetic samples. This could lead to model collapse, a degenerative process whereby LLMs reinforce their own errors, reduce output diversity, and ultimately yield declining performance. In this study, we investigate the impact of decoding strategy on model collapse, analysing the text characteristics at each model generation, the similarity to human references, and the resulting model performance. Using the decoding strategies that lead to the most significant degradation, we evaluate model collapse in a more realistic scenario where the origin of the data (human or synthetic) is unknown. We train a machinegenerated text detector and propose an importance resampling approach to prevent model collapse by up-sampling likely human content in the training data. Our method is validated on four LLMs from two model families (GPT-2 and SmolLM2), across a range of model sizes (124M to 1.7B). We demonstrate that it not only prevents model collapse but also improves performance compared to training on purely human data, underscoring the benefit of synthetic samples and the importance of data curation.

Type: Proceedings paper
Title: Machine-generated text detection prevents language model collapse
Event: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Dates: Nov 2025 - Nov 2025
Open access status: An open access version is available from UCL Discovery
DOI: 10.18653/v1/2025.emnlp-main.1506
Publisher version: https://doi.org/10.18653/v1/2025.emnlp-main.1506
Language: English
Additional information: ACL materials are Copyright © 1963–2025 ACL; other materials are copyrighted by their respective copyright holders. Materials prior to 2016 here are licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 International License. Permission is granted to make copies for the purposes of teaching and research. Materials published in or after 2016 are licensed on a Creative Commons Attribution 4.0 International License.
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10217226
Downloads since deposit
0Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item