Abstract
This paper is concerned with high-dimensional modifications of Akaike information criterion (AIC) for selection of variables in discriminant analysis. The AIC has been proposed as an asymptotically unbiased estimator of the risk function of the candidate model when the dimension is fixed and the sample size tends to infinity. On the other hand, Fujikoshi (2002) attempted to modify the AIC in two-group discriminant analysis when the dimension and the sample size tend to infinity. Such an estimator is called high-dimensional AIC, which is denoted by HAIC. However, its modification was obtained under a restrictive assumption, and furthermore, it was difficult to extend the method to multiple-group case. In this paper, by a new approach we propose HAIC which is an asymptotically unbiased estimator of the risk function in multiple-group discriminant analysis when both the dimension and the sample size tend to infinity, for a general class of candidate models. By simulation experiments it is shown that HAIC is more useful than other AIC type of criteria.
Similar content being viewed by others
References
Akaike, H. (1973). Information theory and an extension of the maximum likelihood principle. In 2nd International Symposium on Information Theory, (B. N. Petrov and F.Csáki,eds.), 267–81, Budapest: Akadémia Kiado.
Fujikoshi, Y. (1982). A test for additional information in canonical correlation analysis. Ann. Inst. Statist. Math., 34, 137–144.
Fujikoshi, Y. (1985). Selection of Variables in Discriminant Analysis and Canonical Correlation Analysis. In Multivariate Analysis-VI (P.R. Krishnaian, ed.). Elsevier Science Publishers B.V., Amsterdam, pp. 219–236.
Fujikoshi, Y. (2002). Selection of variables for discriminant analysis in a high-dimensional case. Sankhya Ser. A, 64, 256–257.
Fujikoshi, Y., Ulyanov, V. and Shimizu, R. (2010). Multivariate statistics: high-dimensional and large-sample approximations. Wiley, Hobeken, N.J.
Fujikoshi, Y. and Seo, T. (1998). Asymptotic approximations for EPMC’s of the linear and the quadratic discriminant functions when the sample sizes and the dimension are large. Random Oper. Stochastic Equations, 6, 269–280.
Kabe, D.G. (1964). A note on the Bartlett decomposition of a Wishart matrix. J. Roy. Statist. Soc. Ser. B, 26, 270–273.
Rao, C.R. (1948). Tests of significance in multivariate analysis. Biometrika 35, 58–79.
Rao, C.R. (1973). Linear statistical inference and its applications, (2nd ed.). Wiley, New York.
Raudys, S. (1972). On the amount of priori information in designing the classification algorithm, Tech. Cybern. 4, 168–174. (in Russian)
Raudys, S. and Young, D.M. (2004). Results in statistical discriminant analysis: a review of the former Soviet Union literature. J. Multivariate Anal., 89, 1–35.
Wakaki, H., Fujikoshi, Y. and Ulyanov, V. (2002). Asymptotic expansions of the distributions of MANOVA test statistics when the dimension is large. Hiroshima Statistical Group Technical Report 10, 97, 1–10.
Wyman, F. J., Young, D. M. and Turner, D.W. (1990). A comparison of asymptotic error rate expansions for the sample linear discriminant function. Pattern Recognition, 23, 775–783.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Sakurai, T., Nakada, T. & Fujikoshi, Y. High-dimensional AICs for selection of variables in discriminant analysis. Sankhya A 75, 1–25 (2013). https://doi.org/10.1007/s13171-013-0025-0
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s13171-013-0025-0