UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

Tree regression models using statistical testing and mixed integer programming

Gkioulekas, I; Papageorgiou, LG; (2021) Tree regression models using statistical testing and mixed integer programming. Computers and Industrial Engineering , 153 , Article 107059. 10.1016/j.cie.2020.107059. Green open access

[thumbnail of CAIE_Tree_Regression_Accepted.pdf]
Preview
Text
CAIE_Tree_Regression_Accepted.pdf - Accepted Version

Download (693kB) | Preview

Abstract

Regression analysis is a statistical procedure that fits a mathematical function to a set of data in order to capture the relationship between dependent and independent variables. In tree regression, tree structures are constructed by repeated splits of the input space into two subsets, creating if-then-else rules. Such models are popular in the literature due to their ability to be computed quickly and their simple interpretations. This work introduces a tree regression algorithm that exploits an optimisation model of an existing literature method called Mathematical Programming Tree (MPtree) to optimally split nodes into subsets and applies a statistical test to assess the quality of the partitioning. Additionally, an approach of splitting nodes using multivariate decision rules is explored in this work and compared in terms of performance and computational efficiency. Finally, a novel mathematical model is introduced that performs subset selection on each node in order to select an optimal set of variables to considered for splitting, that improves the computational performance of the proposed algorithm.

Type: Article
Title: Tree regression models using statistical testing and mixed integer programming
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.cie.2020.107059
Publisher version: https://doi.org/10.1016/j.cie.2020.107059
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Mathematical programming; Regression analysis; Decision trees; Subset selection; Optimisation
UCL classification: UCL
UCL > Provost and Vice Provost Offices
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > Dept of Chemical Engineering
URI: https://discovery.ucl.ac.uk/id/eprint/10123829
Downloads since deposit
92Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item