UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

An Empirical Study on Dependence Clusters for Effort-Aware Fault-Proneness Prediction

Yang, Y; Harman, M; Krinke, J; Islam, S; Binkley, D; Zhou, Y; Xu, B; (2016) An Empirical Study on Dependence Clusters for Effort-Aware Fault-Proneness Prediction. In: Lo, D and Apel, S and Khurshid, S, (eds.) ASE 2016: Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering. (pp. pp. 296-307). IEEE & ACM: Singapore. Green open access

[thumbnail of cluster-ASE2016.pdf]
Preview
Text
cluster-ASE2016.pdf - Accepted Version

Download (775kB) | Preview

Abstract

A dependence cluster is a set of mutually inter-dependent program elements. Prior studies have found that large dependence clusters are prevalent in software systems. It has been suggested that dependence clusters have potentially harmful effects on software quality. However, little empirical evidence has been provided to support this claim. The study presented in this paper investigates the relationship between dependence clusters and software quality at the function-level with a focus on effort-aware fault-proneness prediction. The investigation first analyzes whether or not larger dependence clusters tend to be more fault-prone. Second, it investigates whether the proportion of faulty functions inside dependence clusters is significantly different from the proportion of faulty functions outside dependence clusters. Third, it examines whether or not functions inside dependence clusters playing a more important role than others are more fault-prone. Finally, based on two groups of functions (i.e., functions inside and outside dependence clusters), the investigation considers a segmented fault-proneness prediction model. Our experimental results, based on five well-known open-source systems, show that (1) larger dependence clusters tend to be more fault-prone; (2) the proportion of faulty functions inside dependence clusters is significantly larger than the proportion of faulty functions outside dependence clusters; (3) functions inside dependence clusters that play more important roles are more fault-prone; (4) our segmented prediction model can significantly improve the effectiveness of effort-aware fault-proneness prediction in both ranking and classification scenarios. These findings help us better understand how dependence clusters influence software quality.

Type: Proceedings paper
Title: An Empirical Study on Dependence Clusters for Effort-Aware Fault-Proneness Prediction
Event: 31st IEEE/ACM International Conference on Automated Software Engineering (ASE 2016)
Location: Singapore, SINGAPORE
Dates: 03 September 2016 - 07 September 2016
ISBN-13: 9781450338455
Open access status: An open access version is available from UCL Discovery
DOI: 10.1145/2970276.2970353
Publisher version: https://doi.org/10.1145/2970276.2970353
Language: English
Additional information: Copyright © 2016 ACM.
Keywords: Dependence clusters, fault-proneness, fault prediction, network analysis
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Computer Science
URI: https://discovery.ucl.ac.uk/id/eprint/1507973
Downloads since deposit
158Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item