Skip to main content
Log in

A breakpoint detection in the mean model with heterogeneous variance on fixed time intervals

  • Published:
Statistics and Computing Aims and scope Submit manuscript

Abstract

This work is motivated by an application for the homogenization of global navigation satellite system (GNSS)-derived integrated water vapour series. Indeed, these series are affected by abrupt changes due to equipment changes or environmental effects. The detection and correction of the series from these changes are a crucial step before any use for climate studies. In addition to these abrupt changes, it has been observed in the series a non-stationary of the variability. We propose in this paper a new segmentation model that is a breakpoint detection in the mean model of a Gaussian process with heterogeneous variance on known time intervals. In this segmentation case, the dynamic programming algorithm used classically to infer the breakpoints cannot be applied anymore. We propose a procedure in two steps: we first estimate robustly the variances and then apply the classical inference by plugging these estimators. The performance of our proposed procedure is assessed through simulation experiments. An application to real GNSS data is presented.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  • Arlot, S., Massart, P.: Data-driven calibration of penalties for least-squares regression. J. Mach. Learn. Res. 10, 245–279 (2009); (electronic)

  • Auger, I., Lawrence, C.: Algorithms for the optimal identification of segments neighborhoods. Bull. Math. Biol. 51, 3954 (1989)

    Article  MathSciNet  Google Scholar 

  • Bai, J., Perron, P.: Computation and analysis of multiple structural change models. J. Appl. Econ. 18, 1–22 (2003)

    Article  Google Scholar 

  • Baudry, J.P., Maugis, C., Michel, B.: Slope heuristics: overview and implementation. Stat. Comput. 22(2), 455470 (2011)

    MathSciNet  MATH  Google Scholar 

  • Bellman, R.: The theory of dynamic programming. Bull. Am. Math. Soc. 60(6), 503515 (1954)

    Article  MathSciNet  Google Scholar 

  • Birg, L., Massart, P.: Gaussian model selection. J. Eur. Math. Soc. 3, 203–268 (2001)

    Article  MathSciNet  Google Scholar 

  • Braun, J.V., Braun, R., Müller, H.G.: Multiple changepoint fitting via quasilikelihood, with application to DNA sequence segmentation. Biometrika 87(2), 301–314 (2000)

    Article  MathSciNet  Google Scholar 

  • Caussinus, H., Mestre, O.: Detection and correction of artificial shifts in climate series. Appl. Stat. 53, 405–425 (2004)

    MathSciNet  MATH  Google Scholar 

  • Chakar, S., Lebarbier, E., Levy-Leduc, C., Robin, S.: A robust approach for estimating change-points in the mean of an ar(1) process. Bernoulli (to appear) (2015); (arXiv:14031958)

  • Cleynen, A., Robin, S.: Comparing change-point location in independent series. Stat. Comput. 26(1–2), 263–276 (2016)

    Article  MathSciNet  Google Scholar 

  • Cleynen, A., Dudoit, S., Robin, S.: Comparing segmentation methods for genome annotation based on rna-seq data. J. Agric. Biol. Environ. Stat. 19(1), 101–118 (2014)

    Article  MathSciNet  Google Scholar 

  • Dee, D.P., Uppala, S., Simmons, A., Berrisford, P., Poli, P., Kobayashi, S., Andrae, U., Balmaseda, M., Balsamo, G., Bauer, dP, et al.: The era-interim reanalysis: configuration and performance of the data assimilation system. Q. J. R. Meteorol. Soc. 137(656), 553–597 (2011)

    Article  Google Scholar 

  • Gazeaux, J., Williams, S., King, M., Bos, M., Dach, R., Deo, M., Moore, A.W., Ostini, L., Petrie, E., Roggero, M., Teferle, F.N., Olivares, G., Webb, F.H.: Detecting offsets in GPS time series: first results from the detection of offsets in GPS experiment. J. Geophys. Res. Solid Earth 118, 2397–2407 (2013). https://doi.org/10.1002/jgrb.50152

    Article  Google Scholar 

  • Gazeaux, J., Lebarbier, E., Collilieux, X., Métivier, L.: Joint segmentation of multiple gps coordinate series. Journal de la Société Française de Statistique 156(4), 163–179 (2015)

    MathSciNet  MATH  Google Scholar 

  • Lai, T.L., Liu, H., Xing, H.: Autoregressive models with piecewise constant volatility and regression parameters. Stat. Sin. 15, 279–301 (2005a)

    MathSciNet  MATH  Google Scholar 

  • Lai, W., Johnson, M., Kucherlapati, R., Park, P.J.: Comparative analysis of algorithms for identifying amplifications and deletions in array CGH data. Bioinformatics 21(19), 3763–3770 (2005b)

    Article  Google Scholar 

  • Lavielle, M.: Detection of multiple changes in a sequence of dependent variables. Stoch. Process. Their Appl. 83(1), 79–102 (1999)

    Article  MathSciNet  Google Scholar 

  • Lavielle, M.: Using penalized contrasts for the change-point problem. Signal Process. 85(8), 1501–1510 (2005)

    Article  Google Scholar 

  • Lebarbier, E.: Detecting multiple change-points in the mean of Gaussian process by model selection. Signal Process. 85, 717–736 (2005)

    Article  Google Scholar 

  • Lévy-Leduc, C., Boistard, H., Moulines, E., Taqqu, M.S., Reisen, V.A.: Robust estimation of the scale and of the autocovariance function of Gaussian short-and long-range dependent processes. J. Time Ser. Anal. 32(2), 135–156 (2011)

    Article  MathSciNet  Google Scholar 

  • Lévy-Leduc, C., Delattre, M., Mary-Huard, T., Robin, S.: Two-dimensional segmentation for analyzing hi-c data. Bioinformatics 30(17), i386–i392 (2014)

    Article  Google Scholar 

  • Lindau, R., Venema, V.: On the multiple breakpoint problem and the number of significant breaks in homogenization of climate records. Idojaras Q. J. Hung. Meteorol. Serv. 117(1), 1–34 (2013)

    Google Scholar 

  • Lu, Q., Lund, R., Lee, T.: An MDL approach to the climate segmentation problem. Ann. Appl. Stat. 4(1), 299–319 (2010)

    Article  MathSciNet  Google Scholar 

  • Mestre, O., Domonkos, P., Picard, F., Auer, I., Robin, S., Lebarbier, E., Böhm, R., Aguilar, E., Guijarro, J.A., Vertacnik, G., et al.: HOMER : a homogenization software - methods and applications. IDOJARAS 117(1), 47–67 (2013)

    Google Scholar 

  • Ning, T., Wickert, J., Deng, Z., Heise, S., Dick, G., Vey, S., Schne, T.: Homogenized time series of the atmospheric water vapour content obtained from the gnss reprocessed data. J. Clim. 29(7), 2443–2456 (2016)

    Article  Google Scholar 

  • Parracho, A., Bock, O., Bastin, S.: Global IWV trends and variability in atmospheric reanalyses and GPS observations. Atmos. Chem. Phys. Discuss. (2018). https://doi.org/10.5194/acp-2018-137

  • Picard, F., Robin, S., Lavielle, M., Vaisse, C., Daudin, J.J.: A statistical approach for CGH microarray data analysis. BMC Bioinform. 6, 27 (2005)

    Article  Google Scholar 

  • Rousseeuw, P.J., Croux, C.: Alternatives to the median absolute deviation. J. Am. Stat. Assoc. 88(424), 1273–1283 (1993)

    Article  MathSciNet  Google Scholar 

  • Schroeder, M., Lockhoff, M., Forsythe, J.M., Cronk, H.Q., Vonder Haar, T.H., Bennartz, R.: The gewex water vapour assessment: results from intercomparison, trend, and homogeneity analysis of total column water vapour. J. Appl. Meteorol. Clim. 55(7), 1633–1649 (2016)

    Article  Google Scholar 

  • Schwarz, G.: Estimating the dimension of a model. Ann. Stat. 6, 461464 (1978)

    Article  MathSciNet  Google Scholar 

  • Trenberth, K.E., Fasullo, J., Smith, L.: Trends and variability in column-integrated atmospheric water vapour. Clim. Dyn. 24(7–8), 741–758 (2005)

    Article  Google Scholar 

  • Vey, S., Dietrich, R., Fritsche, M., Rlke, A., Steigenberger, P., Rothacher, M.: On the homogeneity and interpretation of precipitable water time series derived from global gps observations. J. Geophys. Res.: Atmos. 114(D10) (2009)

  • Zhang, N.R., Siegmund, D.O.: A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data. Biometrics 63(1), 22–32 (2007)

    Article  MathSciNet  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Emilie Lebarbier.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bock, O., Collilieux, X., Guillamon, F. et al. A breakpoint detection in the mean model with heterogeneous variance on fixed time intervals. Stat Comput 30, 195–207 (2020). https://doi.org/10.1007/s11222-019-09853-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11222-019-09853-5

Keywords

Navigation