Abstract
In addition to what discussed in the previous chapters, there exist some other issues or topics about the analysis of panel count data that have been investigated in the literature or could occur in practice. In conducting regression analysis, for example, one can always ask which or if all covariate variables are important or significant enough to be included in the final model for the response variable of interest. That is, one faces a variable selection problem. For the problem, two situations usually occur. One is that the number of covariate variables is fixed and smaller than the sample size as in usual linear or nonlinear regression analysis (Johnson and Wichern, 2002). The other is that the number of covariate or predictor variables is much larger than the sample size and could be over several thousands or hundred thousands. The latter has become a huge and important topic in statistical genetic analysis as well as some other related areas (Beebe et al., 1998; Lee, 2004).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aalen, O. O. (1975). Statistical inference for a family of counting processes. Ph.D. Thesis, University of California, Berkeley.
Akaike, H. (1973). Maximum likelihood identification of Gaussian autoregressive moving average models. Biometrika, 60, 255–265.
Andersen, P. K. and Klein, J. P. (2004). Multi-state models for event history analysis. Statistical Methods in Medical Research, 11, 91–115.
Andersen, P. K. and Klein, J. P. (2007). Regression analysis for multistate models based on a pseudo-value approach, with applications to bone marrow transplantation studies. Scandinavian Journal of Statistics, 34, 3–16.
Bacchetti, P., Boylan, R. D., Terrault, N. A., Monto, A. and Berenguer, M. (2010). Non-Markov multistate modeling using time-varying covariates, with application to progression of liver fibrosis due to Hepatitis C following liver transplant. The International Journal of Biostatistics, 6, Article 7.
Bartholomew, D. J. (1983). Some recent developments in social statistics. International Statistical Review, 51, 1–9.
Beebe, K. R., Pell, R. J. and Seasholtz, M. B. (1998). Chemometrics: A practical guide. John Wiley & Sons, Inc.
Breiman, L. (1996). Heuristics of instability and stabilization in model selection. The Annals of Statistics, 24, 2350–2383.
Cai, Z. and Sun, Y. (2003). Local linear estimation for time-dependent coefficients in Cox’s regression models. Scandinavian Journal of Statistics, 30, 93–111.
Carroll, R. J., Ruppert, D. and Stefanski, L. A. (1995). Measurement error in nonlinear models. Chapman & Hall, London.
Chen, B., Yi, G. Y. and Cook, R. J. (2010). Analysis of interval-censored disease progression data via multi-state models under a nonignorable inspection process. Statistics in Medicine, 29, 1175–1189.
Chen, B. E. and Cook, R. J. (2003). Regression modeling with recurrent events and time-dependent interval-censored marker data. Lifetime Data Analysis, 9, 275–291.
Chen, H. Y. and Little, R. J. A. (1999). Proportional hazards regression with missing covariates. Journal of the American Statistical Association, 94, 896–908.
Chen, J. and Li, P. (2009). Hypothesis test for normal mixture models: the EM approach. The Annal of Statistics, 37, 2523–2542.
Chen, J. and Tan, X. (2009). Inference for multivariate normal mixtures. Journal of Multivariate Analysis, 100, 1367–1383.
Cheng, S. C. and Wei, L. J. (2000). Inferences for a semiparametric model with panel data. Biometrika, 87, 89–97.
Cox, D. R. and Miller, H. D. (1965). The theory of stochastic processes. London: Chapman and Hall.
Darlington, G. A. and Dixon, S. N. (2013). Event-weighted proportional hazards modelling for recurrent gap time data. Statstics in Medcine, 32, 124–130.
Dicker, L., Huang, B. and Lin, X. (2012). Variable selection and estimation with the seamless-L 0 penalty. Statistica Sinica, to appear.
Fan, J. and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association, 96, 1348–1360.
Fan, J. and Li, R. (2004). New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis. Journal of the American Statistical Association, 99, 710–723.
Fan, J. and Lv, J. (2010). A selective overview of variable selection in high dimensional feature space. Statistica Sinica, 20, 101–148.
Fan, J. and Peng, H. (2004). Nonconcave penalized likelihood with a diverging number of parameters. The Annals of Statistics, 32, 928–961.
Ferguson, T. S. (1973). A Bayesian analysis of some non-parametric problems. The Annals of Statistics, 1, 209–230.
Ferguson, T. S. (1974). Prior distributions on spaces of probability measures. The Annals of Statistics, 2, 615–629.
French, J. L. and Ibrahim, J. G. (2002). Bayesian methods for three-state model for rodent carcinogenicity studies. Biometrics, 58, 906–916.
Gentlemen, R. C., Lawless, J. F. and Lindsey, J. C. (1994). Multi-state Markov models for analysing incomplete diseases history data with illustrations for HIV disease. Statistics in Medicine, 13, 805–821.
Gladman, D. D., Farewell, V. T. and Nadeau, C. (1995). Clinical indicators of progression in psoriatic arthritis (PsA): multivariate relative risk model. Journal of Rheumatology, 22, 675–679.
Gómez, G., Calle, M. L. and Oller, R. (2004).] Frequentist and Bayesian approaches for interval-censored data. Statistics Papers, 2, 139–173.
Gómez, G., Espinal and Lagakos, S. W. (2003).] Inference for a linear regression model with an interval-censored covariate. Statistics in Medicine, 22, 409–425.
Hsieh, H. J., Chen, T. H-H. and Chang, S. H. (2002). Assessing chronic disease progression using non-homogeneous exponential regression Markov models: An illustration using a selective breast cancer screening in Taiwan. Statistics in Medicine, 21, 3369–3382.
Huang, X. and Liu, L. (2007). A joint frailty model for survival and gap times between recurrent events. Biometrics, 63, 389–397.
Ibrahim, J. G., Chen, M.-H. and Sinha, D. (2001). Bayesian survival analysis. Springer-Verlag: New York.
Ishwaran, H. and James, L. F. (2004). Computational methods for multiplicative intensity models using weighted gamma processes: proportional hazards, marked point processes, and panel count data. Journal of the American Statistical Association, 99, 175–190.
James, L. F. (2003). Bayesian calculus for gamma processes with applications to semiparametric intensity models. Sankhya A, 65, 196–223.
Johnson, R. A. and Wichern, D. W. (2002). Applied multivariate statistical analysis. Fifth edition, Prentice Hall, Inc.
Joly, P. and Commenges, D. (1999). A penalized likelihood approach for a progressive three-state model with censored and truncated data: Application to AIDS. Biometrics, 55, 887–890.
Joly, P., Commenges, D., Helmer, C. and Letenneur, L. (2002). A penalized likelihood approach for an illness-death model with interval-censored data: Application to age-specific incidence of dementia. Biostatistics, 3, 433–443.
Joly, P., Durand, C., Helmer, C. and Commenges, D. (2009). Estimating life expectancy of demented and institutionalized subjects from interval-censored observations of a multi-state model. Statistical Modelling, 9, 345–360.
Kalbfleisch, J. D. and Lawless, J. F. (1985). The analysis of panel data under a Markov assumption. Journal of the American Statistical Association, 80, 863–871.
Kay, R. (1986). A Markov model for analyzing cancer markers and diseases states in survival studies. Biometrics, 42, 855–865.
Kim, Y-J. (2007). Analysis of panel count data with measurement errors in the covariates. Journal of Statistical Computation and Simulation, 77, 109–117.
Lagakos, S. W. and Louis, T. (1988). Use of tumour lethality to interpret tumorigenicity experiments lacking cause-of-death data. Applied Statistics, 37, 169–179.
Langohr, K., Gómez, G. and Muga, R. (2004). A parametric survival model with an interval-censored covariate. Statistics in Medicine, 23, 3159–3175.
Lawless, J. F. and Zhan, M. (1998). Analysis of interval-grouped recurrent-event data using piecewise constant rate functions. Canadian Journal of Statistics, 26, 549–565.
Lee, M-L. T. (2004). Analysis of microarray gene expression data. Kluwwe Academis Publishers.
Lin, D. Y., Wei, L. J. and Ying, Z. (1993). Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika, 80, 557–572.
Lin, D. Y. and Ying, Z. (2001). Nonparametric tests for the gap time distributions of serial events based on censored data. Biometrics, 57, 369–375.
Lindsey, J. C. and Ryan, L. M. (1993). A three-state multiplicative model for rodent tumorigenicity experiments. Applied Statistics, 42, 283–300.
Little, R. J. A. and Rubin, D. B. (1987). Statistical analysis with missing data, John Wiley: New York.
Mallows, C. L. (1973). Some comments on Cp. Technometrics, 15, 661–675.
Mclachlan, G. and Peel, D. (2000). Finite mixture models. Wiley: New York.
Nielsen, J. D. and Dean, C. B. (2008). Clustered mixed nonhomogeneous Poisson process spline models for the analysis of recurrent event panel data. Biometrics, 64, 751–761.
Park, D-H. (2005). Semiparametric and nnonparametric methods for the analysis of longitudinal data. Ph.D. Dissertation, University of Missouri, Columbia.
Prentice, R. L. (1982). Covariate measurement errors and parameter estimation in a failure time regression model. Biometrika, 69, 331–342.
Robison, L., Mertens, A., Boice, J., et al. (2002). Study design and cohort characteristics of the childhood cancer survivor study: A multi-institutional collaborative project. Medical and Pediatric Oncology, 38, 229–239.
Rosen, O., Jiang, W. and Tanner, M. A. (2000). Mixtures of marginal models. Biometrika, 87, 391–404.
Schaubel, D. E. and Cai, J. (2004). Regression methods for gap time hazard functions of sequentially ordered multivariate failure time data. Biometrika, 91, 291–303.
Scheike, T. H. and Martinussen, T. (2004). On estimation and tests of time-varying effects in the proportional hazards model. Scandinavian Journal of Statistics, 31, 51–62.
Schwartz, G. (1978). Estimating the dimension of a model. The Annals of Statistics, 6, 461–464.
Singer, B. and Spilerman, S. (1976a). The representation of social processes by Markov models. American Journal of Sociology, 82, 1–54.
Singer, B. and Spilerman, S. (1976b). Some methodological issues in the analysis of longitudinal surveys. Annals Economic and Sociological Measurement, 5, 447–474.
Song, X. and Wang, C. Y. (2008). Semiparametric approaches for joint modeling of longitudinal and survival data with time-varying coefficients. Biometrics, 64, 557–566.
Staniswalls, J. G., Thall, P. F. and Salch, J. (1997). Semiparametric regression analysis for recurrent event interval counts. Biometrics, 53, 1334–1353.
Sun, J. and Matthews, D. E. (1997). A random-effect regression model for medical follow-up studies. Canadian Journal of Statistics, 25, 101–111.
Sun, J., Park, D-H., Sun, L. and Zhao, X (2005). Semiparametric regression analysis of longitudinal data with informative observation times. Journal of the American Statistical Association, 100, 882–889.
Sun, J and Rai, S. N. (2001). Nonparametric tests for the comparison of point processes based on incomplete data. Scand Journal Statistics, 28, 725–732.
Sun, L., Guo, S. and Chen, M. (2009a). Marginal regression model with time-varying coefficients for panel data. Communications in Statistics, Theory and Methods, 38, 1241–1261.
Sun, L., Park, D. and Sun, J. (2006). The additive hazards model for recurrent gap times. Statistica Sinica, 16, 919–932.
Sun, L., Zhu, L. and Sun, J. (2009b). Regression analysis of multivariate recurrent event data with time-varying covariate effects. Journal Multivariate Analysis, 100, 2214–2223.
Sun, Y. and Wu, H. (2005). Semiparametric time-varying coefficients regression model for longitudinal data. Scandinavian Journal of Statistics, 32, 21–47.
Susko, E., Kalbfleisch, J. D. and Chen, J. (1998). Constrained nonparametric maximum-likelihood estimation for mixture models. Canadan Journal of Statistics, 26, 601–617.
Tibshirani, R. J. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B, 58, 267–288.
Tibshirani, R. J. (1997). The lasso method for variable selection in the Cox model. Statstics in Medicine, 16, 385–395.
Titman, A. C. (2011). Flexible nonhomogeneous Markov models for panel observed data. Biometrics, 67, 780–787.
Tong, X., He, X., Sun, L. and Sun, J. (2009). Variable selection for panel count data via nonconcave penalized estimating function. Scandinavian Journal of Statistics, 36, 620–635.
Tsiatis, A. A., DeGruttola, V. and Wulfsohn, M. S. (1995). Modeling the relationship of survival to longitudinal data measured with error. Applications to survival and CD4 counts in patients with AIDS. Journal of the American Statistical Association, 90, 27–37.
Tuma, N. B. and Robins, P. K. (1980). A dynamic model of employment behavior: An application to the Seattle and Denver income maintenance experiments. Econometrica, 48, 1031–1-52.
Wang, M. C. and Chen, Y. Q. (2000). Nonparametric and semiparametric trend analysis of stratified recurrence time data. Biometrics, 56, 789–794.
Wang, P., Puterman, M. L., Cockburn, I. and Le, N. (1996). Mixed Poisson regression models with covariate dependent rates. Biometrics, 52, 381–400.
Wasserman, S. (1980). Analyzing social networks as stochastic processes. Journal of the American Statistical Association, 75, 280–294.
White, H. (1982). Maximum likelihood estimation of misspecified models. Econometrica, 50, 1–25.
Wulfsohn, M. S. and Tsiatis, A. A. (1997). A joint model for survival and longitudinal data measured with error. Biometrics, 53, 330–339.
Yan, J. and Huang, J. (2012). Model selection for Cox models with time-varying coefficients. Biometrics, 68, 419–428.
Yi, G. Y. and Lawless, J. F. (2012). Likelihood-based and marginal inference methods for recurrent event data with covariate measurement error. Canadian Journal of Statistics, 40, 530–549.
Zhang, C. (2010). Nearly unbiased variable selection under minimax concave penalty. The Annals of Statistics, 38, 894–942.
Zhang, H., Sun, J. and Wang, D. (2013a). Variable selection and estimation for multivariate panel count data via the seamless-L 0 penalty. The Canadian Journal of Statistics, in press.
Zhao, Q. and Sun, J. (2006). Semiparametric and nonparametric analysis of recurrent events with observation gaps. Computational Statistics and Data Analysis, 51, 1924–1933.
Zhao, X., Tong, X. and Sun, L. (2012). Joint analysis of longitudinal data with dependent observation times. Statistics Sinica, 22, 317–336.
Zhao, X., Zhou, J. and Sun, L. (2011b). Semiparametric transformation models with time-varying coefficients for recurrent and terminal events. Biometrics, 67, 404–414.
Zhao, X. and Zhou, X. (2012). Modeling gap times between recurrent events by marginal rate function. Computational Statistics and Data Analysis, 56, 370–383.
Zhou, H. and Pepe, M. S. (1995). Auxilliary covariate data in failure time regression analysis. Biometrika, 82, 139–149.
Zhu, L., Tong, X., Zhao, H., Sun, J., Srivastava, D., Leisenring, W. and Robison, L. (2013). Statistical analysis of mixed recurrent event data with application to cancer survivor study. Statistics in Medicine, to appaer.
Zou, H. (2006). The adaptive lasso and its oracle properties. Journal of the American Statistical Association, 101, 1418–1429.
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media New York
About this chapter
Cite this chapter
Sun, J., Zhao, X. (2013). Other Topics. In: Statistical Analysis of Panel Count Data. Statistics for Biology and Health, vol 80. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8715-9_8
Download citation
DOI: https://doi.org/10.1007/978-1-4614-8715-9_8
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-8714-2
Online ISBN: 978-1-4614-8715-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)