TY - JOUR
T1 - Euclid preparation : XXII. Selection of quiescent galaxies from mock photometry using machine learning
AU - Humphrey, A.
AU - Bisigello, L.
AU - Cunha, P. A.C.
AU - Bolzonella, M.
AU - Fotopoulou, S.
AU - Caputi, K.
AU - Tortora, C.
AU - Zamorani, G.
AU - Papaderos, P.
AU - Vergani, D.
AU - Brinchmann, J.
AU - Moresco, M.
AU - Amara, A.
AU - Auricchio, N.
AU - Baldi, M.
AU - Bender, R.
AU - Bonino, D.
AU - Branchini, E.
AU - Brescia, M.
AU - Camera, S.
AU - Capobianco, V.
AU - Carbone, C.
AU - Carretero, J.
AU - Castander, F. J.
AU - Castellano, M.
AU - Cavuoti, S.
AU - Cimatti, A.
AU - Cledassou, R.
AU - Congedo, G.
AU - Conselice, C. J.
AU - Conversi, L.
AU - Copin, Y.
AU - Corcione, L.
AU - Courbin, F.
AU - Cropper, M.
AU - Da Silva, A.
AU - Degaudenzi, H.
AU - Douspis, M.
AU - Dubath, F.
AU - Duncan, C. A.J.
AU - Dupac, X.
AU - Dusini, S.
AU - Farrens, S.
AU - Ferriol, S.
AU - Frailis, M.
AU - Niemi, S. M.
AU - Schneider, P.
AU - Wang, Y.
AU - Gozaliasl, G.
AU - Sánchez, A. G.
AU - Euclid Collaboration
N1 - Funding Information:
This work was supported by Fundação para a Ciência e a Tecnologia (FCT) through grants UID/FIS/04434/2019, UIDB/04434/2020, UIDP/04434/2020 and PTDC/FIS-AST/29245/2017, and an FCT-CAPES Transnational Cooperation Project. LB acknowledges financial support by the Agenzia Spaziale Italiana (ASI) under the research contract 2018-31-HH.0. KIC acknowledges funding from the European Research Council through the award of the Consolidator Grant ID 681627-BUILDUP. AH acknowledges support from the NVIDIA Academic Hardware Grant Program. AH also thanks colleagues Jean Gomes, Joâo Pedroso, Catarina Lobo, Tom Scott, Ana Afonso, Patricio Lagos, Israel Matute, Stergios Amarantidis, Jose Afonso, Rodrigo Carvajal, and Ciro Pappalardo for useful discussions or comments. The Euclid Consortium acknowledges the European Space Agency and a number of agencies and institutes that have supported the development of Euclid, in particular the Academy of Finland, the Agenzia Spaziale Italiana, the Belgian Science Policy, the Canadian Euclid Consortium, the Centre National d'Etudes Spatiales, the Deutsches Zentrum für Luft- und Raumfahrt, the Danish Space Research Institute, the Fundação para a Ciência e a Tecnologia, the Ministerio de Economia y Competitividad, the National Aeronautics and Space Administration, the National Astronomical Observatory of Japan, the Netherlandse Onderzoekschool Voor Astronomie, the Norwegian Space Agency, the Romanian Space Agency, the State Secretariat for Education, Research and Innovation (SERI) at the Swiss Space Office (SSO), and the United Kingdom Space Agency. A complete and detailed list is available on the Euclid web site ( http://www.euclid-ec.org ). In the development of our pipeline, we have made use of the Pandas (McKinney 2010), Numpy (Harris et al. 2020), Scipy (Virtanen et al. 2020) and Dask (Rocklin 2015) packages for Python.
Publisher Copyright:
© 2023 The Authors.
PY - 2023/3/1
Y1 - 2023/3/1
N2 - The Euclid Space Telescope will provide deep imaging at optical and near-infrared wavelengths, along with slitless near-infrared spectroscopy, across ~15 000deg2 of the sky. Euclid is expected to detect ~12 billion astronomical sources, facilitating new insights into cosmology, galaxy evolution, and various other topics. In order to optimally exploit the expected very large dataset, appropriate methods and software tools need to be developed. Here we present a novel machine-learning-based methodology for the selection of quiescent galaxies using broadband Euclid IE, YE, JE, and HE photometry, in combination with multi-wavelength photometry from other large surveys (e.g. the Rubin LSST). The ARIADNE pipeline uses meta-learning to fuse decision-tree ensembles, nearest-neighbours, and deep-learning methods into a single classifier that yields significantly higher accuracy than any of the individual learning methods separately. The pipeline has been designed to have 'sparsity awareness', such that missing photometry values are informative for the classification. In addition, our pipeline is able to derive photometric redshifts for galaxies selected as quiescent, aided by the 'pseudo-labelling' semi-supervised method, and using an outlier detection algorithm to identify and reject likely catastrophic outliers. After the application of the outlier filter, our pipeline achieves a normalised mean absolute deviation of âÂ? ²0.03 and a fraction of catastrophic outliers of âÂ? ²0.02 when measured against the COSMOS2015 photometric redshifts. We apply our classification pipeline to mock galaxy photometry catalogues corresponding to three main scenarios: (i) Euclid Deep Survey photometry with ancillary ugriz, WISE, and radio data; (ii) Euclid Wide Survey photometry with ancillary ugriz, WISE, and radio data; and (iii) Euclid Wide Survey photometry only, with no foreknowledge of galaxy redshifts. In a like-for-like comparison, our classification pipeline outperforms UVJ selection, in addition to the Euclid IE YE, JE HE and u IE, IE JE colour-colour methods, with improvements in completeness and the F1-score (the harmonic mean of precision and recall) of up to a factor of 2.
AB - The Euclid Space Telescope will provide deep imaging at optical and near-infrared wavelengths, along with slitless near-infrared spectroscopy, across ~15 000deg2 of the sky. Euclid is expected to detect ~12 billion astronomical sources, facilitating new insights into cosmology, galaxy evolution, and various other topics. In order to optimally exploit the expected very large dataset, appropriate methods and software tools need to be developed. Here we present a novel machine-learning-based methodology for the selection of quiescent galaxies using broadband Euclid IE, YE, JE, and HE photometry, in combination with multi-wavelength photometry from other large surveys (e.g. the Rubin LSST). The ARIADNE pipeline uses meta-learning to fuse decision-tree ensembles, nearest-neighbours, and deep-learning methods into a single classifier that yields significantly higher accuracy than any of the individual learning methods separately. The pipeline has been designed to have 'sparsity awareness', such that missing photometry values are informative for the classification. In addition, our pipeline is able to derive photometric redshifts for galaxies selected as quiescent, aided by the 'pseudo-labelling' semi-supervised method, and using an outlier detection algorithm to identify and reject likely catastrophic outliers. After the application of the outlier filter, our pipeline achieves a normalised mean absolute deviation of âÂ? ²0.03 and a fraction of catastrophic outliers of âÂ? ²0.02 when measured against the COSMOS2015 photometric redshifts. We apply our classification pipeline to mock galaxy photometry catalogues corresponding to three main scenarios: (i) Euclid Deep Survey photometry with ancillary ugriz, WISE, and radio data; (ii) Euclid Wide Survey photometry with ancillary ugriz, WISE, and radio data; and (iii) Euclid Wide Survey photometry only, with no foreknowledge of galaxy redshifts. In a like-for-like comparison, our classification pipeline outperforms UVJ selection, in addition to the Euclid IE YE, JE HE and u IE, IE JE colour-colour methods, with improvements in completeness and the F1-score (the harmonic mean of precision and recall) of up to a factor of 2.
KW - Galaxies: evolution
KW - Galaxies: general
KW - Galaxies: high-redshift
KW - Galaxies: photometry
KW - Methods: statistical
UR - http://www.scopus.com/inward/record.url?scp=85150446861&partnerID=8YFLogxK
U2 - 10.1051/0004-6361/202244307
DO - 10.1051/0004-6361/202244307
M3 - Article
AN - SCOPUS:85150446861
SN - 0004-6361
VL - 671
JO - Astronomy and Astrophysics
JF - Astronomy and Astrophysics
M1 - A99
ER -