Abstract
Classification of periodic variable light curves is important for scientific knowledge discovery and efficient use of telescopic resources for source follow-up. In practice, labeled light curves from catalogs with hundreds of flux measurements (the training set) may be used to classify curves from ongoing surveys with tens of flux measurements (the test set). Statistical classifiers generally assume that the probability of class given light curve features is the same for training and test sets. This assumption is unlikely to hold when the number of flux measurements per light curve varies widely between the two sets. We employ two methods to correct the problem—noisification and denoisification. With noisification we alter the training set to mimic the distribution of the test set and then construct a classifier on these altered data. With denoisification we construct a classifier on the well-sampled curves in the training set and probabilistically infer what poorly sampled curves in the test set would look like if we continued obtaining flux measurements. On periodic variable sources from a simulated data set and the OGLE survey, both of these methods outperform making no adjustments for training–test set differences.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Breiman L, Freidman J, Olshen R, Stone C (1984) Classification and regression trees. Wadsworth, Belmont, CA
Butler NR, Bloom JS (2011) Optimal time-series selection of quasars. Astron J 141(3):93
Debosscher J, Sarro L, Aerts C, Cuypers J, Vandenbussche B, Garrido R, Solano E (2007) Automated supervised classification of variable stars. Astron Astrophys 475(3):1159–1183
Eyer L, Blake C (2005) Automated classification of variable stars for All-Sky Automated Survey 1–2 data. Mon Not Roy Astron Soc 358(1):30–38
Eyer L, Cuypers J (2000) Predictions on the number of variable stars for the gaia space mission and for surveys such as the ground-based international liquid mirror telescope. In: IAU Colloq. 176: The impact of large-scale surveys on pulsating star research, vol 203, pp 71–72
Eyer L et al (2010) The variable universe through the eyes of GAIA. arXiv:1011.4527v1
Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: Data mining, inference, and prediction. Springer, New York
Richards J, Starr D, Butler N, Bloom J, Brewer J, Crellin-Quick A, Higgins J, Kennedy R, Rischard M (2011) On machine-learned classification of variable stars with sparse and noisy time-series data. Astrophys J 733:10
Acknowledgements
The authors acknowledge the generous support of a cyber-enabled discovery and innovation (CDI) grant (No. 0941742) from the National Science Foundation. This work was performed in the CDI-sponsored Center for Time Domain Informatics (http://cftd.info).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media New York
About this chapter
Cite this chapter
Long, J.P., Bloom, J.S., Karoui, N.E., Rice, J., Richards, J.W. (2012). Classification of Poorly Time Sampled Light Curves of Periodic Variable Stars. In: Sarro, L., Eyer, L., O'Mullane, W., De Ridder, J. (eds) Astrostatistics and Data Mining. Springer Series in Astrostatistics, vol 2. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-3323-1_16
Download citation
DOI: https://doi.org/10.1007/978-1-4614-3323-1_16
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-3322-4
Online ISBN: 978-1-4614-3323-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)