Abstract
The paper presents an approach for the application of machine learning methods for cross-matching astronomical catalogues. Related works on the cross-matching are analyzed and machine learning methods applied are briefly discussed. The approach is applied for cross-matching of three catalogues: Gaia, SDSS and ALLWISE. Experimental results of application of several machine learning methods for cross-matching these catalogues are presented. Recommendations for the application of the approach in astronomical information systems are proposed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Software implementation of the algorithms was taken from sklearn [18], lightgbm, and xgboost Python libraries.
- 2.
- 3.
References
Hubble Space Telescope. https://hubblesite.org/. Accessed 20 Mar 2021
Gaia Archive at ESA. https://gea.esac.esa.int/archive/. Accessed 20 Mar 2021
Infrared Astronomical Satellite (IRAS). https://irsa.ipac.caltech.edu/Missions/iras.html. Accessed 20 Mar 2021
Krakowski, T., et al.: Machine-learning identification of galaxies in the WISE SuperCOSMOS all-sky catalogue. Astron. Astrophy. 596, A39 (2016)
Wide-field Infrared Survey Explorer (WISE) and NEOWISE. https://irsa.ipac.caltech.edu/Missions/wise.html. Accessed 20 Mar 2021
SuperCOSMOS Science Archive. http://ssa.roe.ac.uk/WISExSCOS. Accessed 20 Mar 2021
Marrese, P.M., et al.: Gaia data release 2 - cross-match with external catalogues: algorithms and results. Astron. Astrophys. 621, A144 (2019)
Marrese, P.M., et al.: Gaia data release 1 - cross-match with external catalogues: algorithms and results. Astron. Astrophys. 607, A105 (2017)
The Wide-field Infrared Survey Explorer at IPAC. The AllWISE Data Release November 13, 2013. https://wise2.ipac.caltech.edu/docs/release/allwise/. Accessed 20 Mar 2021
The Ninth SDSS Data Release (DR9). https://www.sdss3.org/dr9/. Accessed 20 Mar 2021
Rohde, D.J., et al.: Applying machine learning to catalogue matching in astrophysics. Mon. Not. R. Astron. Soc. 360(1), 69–75 (2005)
Ogle, P.M., et al: Rule-based cross-matching of very large catalogues in NED. arXiv preprint arXiv:1503.01184 (2015)
Shi, X., Budavari, T., Basu, A.: Probabilistic Cross-identification of Multiple Catalogues in Crowded Fields. Astrophys. J. 870, 51 (2019)
Budavari, T., Basu, A.: Probabilistic cross-identification in crowded fields as an assignment problem. Astrophys. J. 152, 86 (2016)
Munkres, J.: Algorithms for the assignment and transportation problems. J. Soc. Ind. Appl. Math. 5, 32–38 (1957)
Soodla J.: A System for cross-matching all-sky surveys. Master thesis, Uppsala Universitet (2019)
Akhmetov, V., Khlamov, S., Dmytrenko, A.: Fast coordinate cross-match tool for large astronomical catalogue. In: Shakhovska, N., Medykovskyy, M.O. (eds.) CSIT 2018. AISC, vol. 871, pp. 3–16. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-01069-0_1
Scikit-learn: Machine Learning in Python. https://scikit-learn.org/stable/index.html. Accessed 20 Mar 2021
Lindegren, L., et al.: Gaia data release 2 the astrometric solution. Astron. Astrophys. 616, A2 (2018)
Acknowledgements
This work uses data from the European Space Agency (ESA) Gaia mission (https://www.cosmos.esa.int/gaia) processed by the Gaia Data Processing and Analysis Consortium (DPAC, https://www.cosmos.esa.int/web/gaia/dpac/consortium). Funding for DPAC was provided by national institutions, in particular institutions participating in the Gaia Multilateral Agreement.
The work is financially supported by the Russian Foundation for Basic Research, project 19-07-01198.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Kulishova, A., Briukhov, D. (2022). Application of Machine Learning Methods for Cross-Matching Astronomical Catalogues. In: Pozanenko, A., Stupnikov, S., Thalheim, B., Mendez, E., Kiselyova, N. (eds) Data Analytics and Management in Data Intensive Domains. DAMDID/RCDL 2021. Communications in Computer and Information Science, vol 1620. Springer, Cham. https://doi.org/10.1007/978-3-031-12285-9_6
Download citation
DOI: https://doi.org/10.1007/978-3-031-12285-9_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-12284-2
Online ISBN: 978-3-031-12285-9
eBook Packages: Computer ScienceComputer Science (R0)