UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

The Patch Overfitting Problem in Automated Program Repair: Practical Magnitude and a Baseline for Realistic Benchmarking

Petke, Justyna; Martinez, Matias; Kechagia, Maria; Aleti, Aldeida; Sarro, Federica; (2024) The Patch Overfitting Problem in Automated Program Repair: Practical Magnitude and a Baseline for Realistic Benchmarking. In: Companion Proceedings of the 32nd ACM International Conference on the Foundations of Software Engineering (FSE Companion ’24). (pp. pp. 452-456). Association for Computing Machinery (ACM): New York, NY, USA. Green open access

[thumbnail of Kechagia_The_Patch_Overfitting_Problem_3663529.3663776.pdf]
Preview
Text
Kechagia_The_Patch_Overfitting_Problem_3663529.3663776.pdf

Download (164kB) | Preview

Abstract

Automated program repair techniques aim to generate patches for software bugs, mainly relying on testing to check their validity. The generation of a large number of such plausible yet incorrect patches is widely believed to hinder wider application of APR in practice, which has motivated research in automated patch assessment. We reflect on the validity of this motivation and carry out an empirical study to analyse the extent to which 10 APR tools suffer from the overfitting problem in practice. We observe that the number of plausible patches generated by any of the APR tools analysed for a given bug from the Defects4J dataset is remarkably low, a median of 2, indicating that a developer only needs to consider 2 patches in most cases to be confident to find a fix or confirming its nonexistence. This study unveils that the overfitting problem might not be as bad as previously thought. We reflect on current evaluation strategies of automated patch assessment techniques and propose a Random Selection baseline to assess whether and when using such techniques is beneficial for reducing human effort. We advocate future work should evaluate the benefit arising from patch overfitting assessment usage against the random baseline.

Type: Proceedings paper
Title: The Patch Overfitting Problem in Automated Program Repair: Practical Magnitude and a Baseline for Realistic Benchmarking
Event: FSE 2024 Ideas, Visions and Reflections Track (FSE-IVR 2024)
Location: Porto de Galinhas, Brazil, Brazil
Dates: 15 Jul 2024 - 19 Jul 2024
ISBN-13: 9798400706585
Open access status: An open access version is available from UCL Discovery
DOI: 10.1145/3663529.3663776
Publisher version: https://doi.org/10.1145/3663529.3663776
Language: English
Additional information: © The Author(s), 2024. This is an Open Access article distributed under the terms of the Creative Commons Attribution Licence (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. https://creativecommons.org/licenses/by/4.0/
Keywords: Automated Program Repair, Overfitting, Patch Assessment
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/10192372
Downloads since deposit
21Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item