Repository logo
  • English
  • Italiano
Log In
Have you forgotten your password?
  1. Home
  2. PRODOTTI RICERCA INAF
  3. 1 CONTRIBUTI IN RIVISTE (Journal articles)
  4. 1.01 Articoli in rivista
  5. The probabilistic random forest applied to the QUBRICS survey: improving the selection of high-redshift quasars with synthetic data
 

The probabilistic random forest applied to the QUBRICS survey: improving the selection of high-redshift quasars with synthetic data

Journal
MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY  
Date Issued
2022
Author(s)
GUARNERI, Francesco  
•
CALDERONE, GIORGIO  
•
CRISTIANI, Stefano  
•
PORRU, Matteo  
•
FONTANOT, Fabio  
•
Boutsia, Konstantina
•
CUPANI, Guido  
•
GRAZIAN, Andrea  
•
D'ODORICO, Valentina  
•
Murphy, Michael T.
•
BONGIORNO, Angela  
•
SACCHEO, IVANO  
•
NICASTRO, LUCIANO  
DOI
10.1093/mnras/stac2733
Abstract
Several recent works have focused on the search for bright, high-z quasars (QSOs) in the South. Among them, the QUasars as BRIght beacons for Cosmology in the Southern hemisphere (QUBRICS) survey has now delivered hundreds of new spectroscopically confirmed QSOs selected by means of machine learning algorithms. Building upon the results obtained by introducing the probabilistic random forest (PRF) for the QUBRICS selection, we explore in this work the feasibility of training the algorithm on synthetic data to improve the completeness in the higher redshift bins. We also compare the performances of the algorithm if colours are used as primary features instead of magnitudes. We generate synthetic data based on a composite QSO spectral energy distribution. We first train the PRF to identify QSOs among stars and galaxies, then separate high-z quasar from low-z contaminants. We apply the algorithm on an updated data set, based on SkyMapper DR3, combined with Gaia eDR3, 2MASS, and WISE magnitudes. We find that employing colours as features slightly improves the results with respect to the algorithm trained on magnitude data. Adding synthetic data to the training set provides significantly better results with respect to the PRF trained only on spectroscopically confirmed QSOs. We estimate, on a testing data set, a completeness of $\sim 86{{\ \rm per\ cent}}$ and a contamination of $\sim 36{{\ \rm per\ cent}}$. Finally, 206 PRF-selected candidates were observed: 149/206 turned out to be genuine QSOs with z > 2.5, 41 with z < 2.5, 3 galaxies and 13 stars. The result confirms the ability of the PRF to select high-z quasars in large data sets.
Volume
517
Issue
2
Start page
2436
Uri
http://hdl.handle.net/20.500.12386/35378
Url
https://academic.oup.com/mnras/article/517/2/2436/6747158
http://arxiv.org/abs/2209.07257v2
Issn Identifier
0035-8711
Ads BibCode
2022MNRAS.517.2436G
Rights
open.access
File(s)
Loading...
Thumbnail Image
Name

guarneri2022.pdf

Description
PDF editoriale
Size

4.69 MB

Format

Adobe PDF

Checksum (MD5)

cc7f2f648db1dcbd8bac5fc2c11438e0

Explore By
  • Communities and Collection
  • Research Outputs
  • Researchers
  • Organizations
  • Projects
Information and guides for authors
  • https://openaccess-info.inaf.it: all about open access in INAF
  • How to enter a product: guides to OA@INAF
  • The INAF Policy on Open Access
  • Downloadable documents and templates

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Privacy policy
  • End User Agreement
  • Send Feedback