UCL Discovery
UCL home » Library Services » Electronic resources » UCL Discovery

A machine learning approach to galaxy properties: joint redshift-stellar mass probability distributions with Random Forest

Mucesh, S; Hartley, WG; Palmese, A; Lahav, O; Whiteway, L; Bluck, AFL; Alarcon, A; ... Wilkinson, RD; + view all (2021) A machine learning approach to galaxy properties: joint redshift-stellar mass probability distributions with Random Forest. Monthly Notices of the Royal Astronomical Society , 502 (2) pp. 2770-2786. 10.1093/mnras/stab164. Green open access

[thumbnail of Lahav_stab164.pdf]
Preview
Text
Lahav_stab164.pdf - Published Version

Download (4MB) | Preview

Abstract

We demonstrate that highly accurate joint redshift–stellar mass probability distribution functions (PDFs) can be obtained using the Random Forest (RF) machine learning (ML) algorithm, even with few photometric bands available. As an example, we use the Dark Energy Survey (DES), combined with the COSMOS2015 catalogue for redshifts and stellar masses. We build two ML models: one containing deep photometry in the griz bands, and the second reflecting the photometric scatter present in the main DES survey, with carefully constructed representative training data in each case. We validate our joint PDFs for 10 699 test galaxies by utilizing the copula probability integral transform and the Kendall distribution function, and their univariate counterparts to validate the marginals. Benchmarked against a basic set-up of the template-fitting code BAGPIPES, our ML-based method outperforms template fitting on all of our predefined performance metrics. In addition to accuracy, the RF is extremely fast, able to compute joint PDFs for a million galaxies in just under 6 min with consumer computer hardware. Such speed enables PDFs to be derived in real time within analysis codes, solving potential storage issues. As part of this work we have developed GALPRO 1, a highly intuitive and efficient PYTHON package to rapidly generate multivariate PDFs on-the-fly. GALPRO is documented and available for researchers to use in their cosmology and galaxy evolution studies.

Type: Article
Title: A machine learning approach to galaxy properties: joint redshift-stellar mass probability distributions with Random Forest
Open access status: An open access version is available from UCL Discovery
DOI: 10.1093/mnras/stab164
Publisher version: https://doi.org/10.1093/mnras/stab164
Language: English
Additional information: © 2021 The Author(s). Published by Oxford University Press on behalf of Royal Astronomical Society. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
Keywords: Methods: data analysis, methods: statistical, galaxies: evolution, galaxies: fundamental parameters, software: data analysis, software: public release
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Maths and Physical Sciences > Dept of Physics and Astronomy
URI: https://discovery.ucl.ac.uk/id/eprint/10129194
Downloads since deposit
Loading...
44Downloads
Download activity - last month
Loading...
Download activity - last 12 months
Loading...
Downloads by country - last 12 months
1.United States
3
2.Russian Federation
1

Archive Staff Only

View Item View Item