Abstract
A partial linear model with missing response variables and error-prone covariates is considered. The imputation approach is developed to estimate the regression coefficients and the nonparametric function. The proposed parametric estimators are shown to be asymptotically normal, and the estimators for the nonparametric part are proved to converge at an optimal rate. To construct confidence regions for the regression coefficients and the nonparametric function, respectively, the authors also propose the empirical-likelihood-based statistics and investigate the limit distributions of the empirical likelihood ratios. The simulation study is conducted to compare the finite sample behavior for the proposed estimators. An application to an AIDS dataset is illustrated.
Similar content being viewed by others
References
R. F. Engle, C. W. J. Granger, J. Rice, and A. Weiss, Semiparametric estimates of the relation between weather and electricity sales, J. Amer. Statist. Assoc., 1986, 80: 310–319.
H. Chen, Convergence rates for parametric components in a partly linear model, Ann. Statist., 1988, 16: 136–146.
N. Heckman, Spline smoothing in partly linear models, J. Roy. Statist. Soc. B, 1986, 48: 244–248.
P. Speckman, Kernel smoothing in partial linear models, J. Amer. Statist. Soc. Ser. B, 1988, 50: 413–436.
Q. H. Wang and L. X. Zhu, Estimation in partly linear error-in-covariables models with censored data, Commun. Statist — Theory and Method, 2001, 30: 41–54.
L. G. Xue and L. X. Zhu, Empirical likelihood-based inference in a partially linear model for longitudinal data, Science in China, Series A, 2008, 51(1): 115–130.
H. Liang, H. Härdle, and R. J. Carroll, Estimation in a semiparametric partially linear error-invariables model, Ann. Statist., 1999, 27: 1519–1535.
H. J. Cui, Estimation in partial linear EV models with replicated observations, Science in China, Series A, 2004, 47: 144–159.
H. Liang, S. Wang, J. M. Robins, and R. J. Carroll, Estimation in partially linear models with missing covariates, J. Amer. Statist. Assoc., 2004, 99: 357–367.
Q. H. Wang, O. Linton, and W. Härdle, Semiparametric regression analysis with missing response at random, J. Amer. Statist. Assoc., 2004, 99: 334–345.
Q. H. Wang and Z. H. Sun, Estimation in partially linear models with missing responses at random, J. Multi. Anal., 2007, 98: 1470–1493.
R. J. A. Little and D. B. Rubin, Statistical Analysis with Missing Data, Wiley, New York, 1987.
R. J. Carroll, D. Ruppert, and L. A. Stefansk, Measurement Error in Nonlinear Models, Chapman and Hall, New York, 1995.
H. Liang, S. Wang, and R. J. Carroll, Partially linear models with missing response variables and error-prone covariates, Biometrika, 2007, 94: 185–198.
J. Robins, A. Rotnizky, and L. P. Zhao, Estimation of regression coefficients when some regressors are not always observed, J. Amer. Statist. Assoc., 1994, 89: 846–866.
A. B. Owen, Empirical likelihood ratio confidence intervals for a single function, Biometrika, 1988, 75: 237–325.
A. B. Owen, Empirical likelihood ratio confidence regions, Ann. Statist., 1990, 18: 90–120.
W. Stute, L. G. Xue, and L. X. Zhu, Empirical likelihood inference in nonlinear errors-in-covariables models with validation data, J. Amer. Statist. Assoc., 2007, 102: 332–346.
L. G. Xue and L. X. Zhu, Empirical likelihood for a varying coefficient model with longitudinal data, J. Amer. Statist. Assoc., 2007, 102: 642–654.
Q. H. Wang and G. Li, Empirical likelihood semiparametric regression analysis under random censorship, J. Multi. Anal., 2002, 83: 469–486.
C. J. Stone, Optimal rates of convergence for nonparametric estimators, Ann. Statist., 1980, 8: 1348–1360.
J. Rao and A. Scott, The analysis of categorical data from complex sample surveys: Chi-squared tests for goodness-of-fit and independence in two-way tables, J. Amer. Statist. Assoc., 1981, 76: 221–230.
J. Z. Huang, C. O. Wu, and L. Zhou, Varying-coecient models and basis function approximations for the analysis of repeated measurements, Biometrika, 2002, 89: 111–128.
J. Q. Fan and J. Zhang, Two-step estimation of functional linear models with applications to longitudinal data, J. Roy. Statist. Soc. Ser. B, 2000, 62: 303–322.
J. Q. Fan and R. Z. Li, New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis, J. Amer. Statist. Assoc., 2004, 99: 710–723.
C. O. Wu, C. T. Chiang, and D. R. Hoover, Asymptotic confidence regions for kernel smoothing of a varying-coefficient model with longitudinal data, J. Amer. Statist. Assoc., 1998, 93: 1388–1402.
Author information
Authors and Affiliations
Corresponding author
Additional information
This research is supported by the National Social Science Foundation of China under Grant No. 11CTJ004, the National Natural Science Foundation of China under Grant Nos. 10871013 and 10871217, the National Natural Science Foundation of Beijing under Grant No. 1102008, the Research Foundation of Chongqing Municipal Education Commission under Grant Nos. KJ110720 and KJ100726, and the Natural Science Foundation of Guangxi under Grant No. 2010GXNSFB013051.
This paper was recommended for publication by Editor Guohua ZOU.
Rights and permissions
About this article
Cite this article
Yang, Y., Xue, L. & Cheng, W. Two-step estimators in partial linear models with missing response variables and error-prone covariates. J Syst Sci Complex 24, 1165–1182 (2011). https://doi.org/10.1007/s11424-011-8393-9
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11424-011-8393-9