UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Optimal Piecewise Linear Regression Algorithm for QSAR Modelling

Cardoso-Silva, J; Papadatos, G; Papageorgiou, LG; Tsoka, S; (2019) Optimal Piecewise Linear Regression Algorithm for QSAR Modelling. Molecular Informatics , 38 (3) , Article 1800028. 10.1002/minf.201800028. Green open access

[thumbnail of minf.201800028R1_manuscript.pdf]
Preview
Text
minf.201800028R1_manuscript.pdf - Accepted Version

Download (326kB) | Preview

Abstract

Quantitative Structure‐Activity Relationship (QSAR) models have been successfully applied to lead optimisation, virtual screening and other areas of drug discovery over the years. Recent studies, however, have focused on the development of models that are predictive but often not interpretable. In this article, we propose the application of a piecewise linear regression algorithm, OPLRAreg, to develop both predictive and interpretable QSAR models. The algorithm determines a feature to best separate the data into regions and identifies linear equations to predict the outcome variable in each region. A regularisation term is introduced to prevent overfitting problems and implicitly selects the most informative features. As OPLRAreg is based on mathematical programming, a flexible and transparent representation for optimisation problems, the algorithm also permits customised constraints to be easily added to the model. The proposed algorithm is presented as a more interpretable alternative to other commonly used machine learning algorithms and has shown comparable predictive accuracy to Random Forest, Support Vector Machine and Random Generalised Linear Model on tests with five QSAR data sets compiled from the ChEMBL database.

Type: Article
Title: Optimal Piecewise Linear Regression Algorithm for QSAR Modelling
Open access status: An open access version is available from UCL Discovery
DOI: 10.1002/minf.201800028
Publisher version: https://doi.org/10.1002/minf.201800028
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: qsar, regression, piecewise regression, mathematical programming, integer programming
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Chemical Engineering
URI: https://discovery.ucl.ac.uk/id/eprint/10072986
Downloads since deposit
0Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item