Abstract
Our aim in this paper is to propose a simple method of a change-point detection of mean vector when the number of samples (historical data set) is smaller than the dimension. We restrict here our attention to the problem of monitoring independent individual observations under normality assumption. The presented approach is based on the Hotelling statistic. This statistic is applied to the data set projected onto a randomly chosen subspace of a sufficiently smaller dimension. We propose the procedure of normal random projection of data (historical data set and a new observation) instantaneously, just after a new observation appears. Next, we provide a model of the changes in the mean vector and derive the distribution of noncentrality parameter values. Further, a non-local power of the Hotelling test performed on projected samples is defined, which is the criterion for selecting the dimensionality of a projection subspace. Finally, simulation results are provided.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahn J, Marron JS, Muller KE, Chi Y-Y (2007) The high dimension, low sample size geometric representation holds under mild conditions. Biometrika 94:760–766
Cuesta-Albertos JA, del Barrio E, Fraiman R, Matran C (2007) The random projection method in goodness of fit for functional data. Comput Stat Data Anal 51(10):4814–4831
Frankl P, Maehara H (1990) Some geometric applications of the beta distribution. Ann Inst Stat Math 42(3):463–474
Fridley BL, Jenkins GD, Biernacka JM (2010) Self-contained gene-set analysis of expression data: an evaluation of existing and novel methods. PLoS One 5(9), e12693
Hall P, Marron JS, Neeman A (2005) Geometric representation of high dimension, low sample size data. J R Stat Soc B 67:427–444
Hotelling H (1931) The generalization of student’s ratio. Ann Math Stat 2(3):360–378
Hotelling H (1947) Multivariate quality control-illustrated by the air testing of sample bombsights. In: Eisenhart C, Hastay MW, Wallis WA (eds) Techniques of statistical analysis. McGraw-Hill, New York, pp 111–184
Johnson WB, Lindenstrauss J (1984) Extensions of Lipshitz mapping into Hilbert space. Contemp Math 26:189–206
Katayamaa S, Kanoa Y, Srivastava MS (2013) Asymptotic distributions of some test criteria for the mean vector with fewer observations than the dimension. J Multivar Anal 116:410–421
Liu R (1995) Control charts for multivariate processes. J Am Stat Assoc 90:1380–1387
Lopes M, Jacob L, Wainwright M (2011) A more powerful two-sample test in high dimensions using random projection. In: Advances in neural information processing systems (NIPS) vol 24
Marzetta TL, Tucci GH, Simon SH (2011) A random matrix-theoretic approach to handling singular covariance estimates. IEEE Trans Inf Theory 57(9):6256–6271
Mason RL, Young JC (2002) Multivariate statistical process control with industrial application. SIAM, Philadelphia
Meckes E (2012) Approximation of projections of random vectors. J Theor Probab 25(2):333–352
Milman V (1971) A new proof of the theorem of A. Dvoretzky on sections of convex bodies. Funct Anal Appl 5(4):28–37 (English translation)
Rao CR (1973) Linear statistical inference and its applications, wyd. II. Wiley, New York
Skubalska-Rafajłowicz E (2011) Fast and efficient method of change detection in statistically monitored high-dimensional data streams. In: Proceedings of the 10th international science and technology conference on diagnostics of processes and systems, Zamość, Poland, pp 256–260
Skubalska-Rafajłowicz E (2013) Random projections and Hotelling’s T 2 statistics for change detection in high-dimensional data streams. Int J Appl Math Comput Sci 23(2):447–461
Srivastava MS (2006) Minimum distance classification rules for high dimensional data. J Multivar Anal 97(9):2057–2070
Srivastava MS, Du M (2008) A test for the mean vector with fewer observations than the dimension. J Multivar Anal 99:386–402
Srivastava MS (2009) A review of multivariate theory for high dimensional data with fewer observations. In: SenGupta A (ed) Advances in multivariate statistical methods, vol 9. World Scientific, Singapore, pp 25–52
Sullivan JH, Woodall WH (2000) Change-point detection of mean vector or covariance matrix shifts using multivariate individual observations. IIE Trans 32(6):537–549
Vempala S (2004) The random projection method. American Mathematical Society, Providence
Wang K, Jiang W (2009) High-dimensional process monitoring and fault isolation via variable selection. J Qual Technol 41(3):247–258
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Skubalska-Rafajłowicz, E. (2015). Change-Point Detection of the Mean Vector with Fewer Observations than the Dimension Using Instantaneous Normal Random Projections. In: Steland, A., Rafajłowicz, E., Szajowski, K. (eds) Stochastic Models, Statistics and Their Applications. Springer Proceedings in Mathematics & Statistics, vol 122. Springer, Cham. https://doi.org/10.1007/978-3-319-13881-7_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-13881-7_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13880-0
Online ISBN: 978-3-319-13881-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)