Abstract
Misalignment of functional features in a sample of random curves leads to potentially misleading inference, when variation in timing is ignored. This chapter reviews the use of Bayesian hierarchical curve registration in Biostatistics and Bioinformatics. Several models allowing for unit-specific random time scales are discussed and applied to longitudinal data arising in biomedicine, pharmacokinetics, and time-course genomics. We consider representations of random functionals based on P-spline priors. Under this framework, straightforward posterior simulation strategies are outlined for inference. Beyond curve registration, we discuss joint regression modeling of both random effects and population level functional quantities. Finally, the use of mixture priors is discussed in the setting of differential expression analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
In our development, X ∼ Ga(a; b) is parametrized so that \(E[X] = a/b\).
References
Albert, J. and Chib, S. (1993). Bayesian analysis of binary and polychotomous response data. Journal of the American Statistical Association, 88, 669–679.
Baladandayuthapani, V., Mallick, B. K., and Carroll, R. J. (2005). Spatially adaptive Bayesian penalized regression splines (P-splines). Journal of Computational and Graphical Statistics, 14(2), 378–394.
Benjamini, Y. and Hochberg, Y. (1995). Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society, Series B, 57, 289–300.
Brumback, L. C. and Lindstrom, M. J. (2004). Self modeling with flexible, random time transformations. Biometrics, 60(2), 461–470.
Cheng, W., Dryden, I., and Huang, X. (2013). Bayesian registrations of functions and curves. eprint arXiv:1311.2105.
De Boor, C. (1978). A Practical Guide to Splines. Berlin: Springer-Verlag.
Della Gatta, G., Bansal, M., Ambesi-Impiombato, A., Antonini, D., Missero, C., and di Bernardo, D. (2008). Direct targets of the trp63 transcription factor revealed by a combination of gene expression profiling and reverse engineering. Genome Research, 18(6), 939–948.
Eilers, P. H. C. and Marx, B. D. (1996). Flexible smoothing with B-splines and penalties. Statistical Science, 11, 89–102.
Erosheva, E. A., Matsueda, R. L., and Telesca, D. (2014). Breaking bad: Reviewing two decades of life course data analysis in criminology and beyond. Annual Reviews of Statistics and Its Applications, 1, 301–332.
Gasser, T. and Kneip, A. (1995). Searching for structure in curve samples. JASA, 90, 1179–1188.
Gelman, A., Carlin, J., Stern, H., Dunson, D., Vehtari, A., and Rubin, D. (2013). Bayesian Data Analysis. Chapman & Hall / CRC, 3rd edition.
Gervini, D. and Gasser, T. (2004). Self-modelling warping functions. Journal of the Royal Statistical Society, Series B: Statistical Methodology, 66(4), 959–971.
Guo, W. (2002). Functional mixed effects models. Biometrics, 58(1), 121–128.
Hart, J. D. and Wehrly, T. E. (1986). Kernel regression estimation using repeated measurements data. Journal of the American Statistical Association, 81, 1080–1088.
Hastie, T. and Tibshirani, R. (1993). Varying–coefficient models. Journal of the Royal Statistical Society, 55, 757–796.
Hastie, T., Tibshirani, R., and Friedman, J. H. (2001). The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer–Verlag Inc.
Kneip, A. and Gasser, T. (1988). Convergence and consistency results for self-modeling nonlinear regression. The Annals of Statistics, 16, 82–112.
Kneip, A. and Gasser, T. (1992). Statistical tools to analyze data representing a sample of curves. The Annals of Statistics, 20, 1266–1305.
Kneip, A., Li, X., MacGibbon, K. B., and Ramsay, J. O. (2000). Curve registration by local regression. The Canadian Journal of Statistics / La Revue Canadienne de Statistique, 28(1), 19–29.
Laird, N. M. and Ware, J. H. (1982). Random-effects models for longitudinal data. Biometrics, 38, 963–974.
Lang, S. and Brezger, A. (2004). Bayesian P-splines. Journal of Computational and Graphical Statistics, 13(1), 183–212.
Leng, X. and Müller, H. (2006). Time ordering of gene co-expression. Biostatistics, 7.
Liu, X. and Müller, H. (2004). Functional averaging and synchronization for time-warped random curves. Journal of the American Statistical Association, 99, 687–699.
Morris, J. S. and Carroll, R. J. (2006). Wavelet-based functional mixed models. Journal of the Royal Statistical Society, Series B: Statistical Methodology, 68(2), 179–199.
Müller, P., Parmigiani, G., and Rice, K. (2006). Fdr and Bayesian multiple comparisons rules. Proceedings of the Valencia/ISBA 8th World Meeting on Bayesian Statistics (Oxford University Press).
Parmigiani, G., Garrett, S. E., Anbashgahn, R., and Gabrielson, E. (2002). A statistical framework for expression-based molecular classification in cancer. Journal of The Royal Statistical Society, Series B, 64, 717–736.
Pinheiro, J. C. and Bates, D. M. (2000). Mixed–Effects Models in S and S–Plus. Springer–Verlag: New York.
Pound, C. R., Partin, A. W., Eisenberger, M. A., Chan, D. W., Pearson, J. D., and Walsh, P. C. (1999). Natural history of progression after psa elevation following radical prostatectomy. Journal of the American Medical Association, 281, 1591–1597.
Qian, J., Dolled-Filhart, M., Lin, J., Yu, H., and Gerstein, M. (2001). Beyond synexpression relationships: local clustering of time-shifted and inverted gene expression profiles identifies new, biologically relevant interactions. Journal of Molecular Biology, 314, 1053–1066.
Ramsay, J. O. and Li, X. (1998). Curve registration. Journal of the Royal Statistical Society, Series B: Statistical Methodology, 60, 351–363.
Rice, J. A. and Silverman, B. W. (1991). Estimating the mean and covariance structure nonparametrically when the data are curves. Journal of the Royal Statistical Society, Series B: Methodological, 53, 233–243.
Ruppert, D., Wand, M., and Carroll, R. J. (2003). Semiparametric Regression. Cambridge University Press.
Sakoe, H. and Chiba, S. (1978). Dynamic programming optimization for spoken word recognition. IEEE Transactions of Acoustic, Speech and Signal Processing, ASSP–26(1), 43–49.
Shi, M., Weiss, R. E., and Taylor, J. M. G. (1996). An analysis of paediatric CD4 counts for acquired immune deficiency syndrome using flexible random curves. Applied Statistics, 45, 151–163.
Silverman, B. W. (1995). Incorporating parametric effects into functional principal components analysis. Journal of the Royal Statistical Society, Series B: Methodological, 57, 673–689.
Telesca, D. and Inoue, L. Y. T. (2008). Bayesian hierarchical curve registration. Journal of the American Statistical Association, 103 (481), 328–339.
Telesca, D., Inoue, L. Y. T., Neira, M., Etzioni, R., Gleave, M., and Nelson, C. (2009). Differential expression and network inferences through functional data modeling. Biometrics, 65, 793–804.
Telesca, D., Erosheva, E. A., Kreager, D. A., and Matsueda, R. L. (2012a). Modeling criminal careers as departures from a unimodal popolation age-crime curve: The case of marijuana use. JASA, 107(500), 1427–1440.
Telesca, D. adn Muller, P., Parmigiani, G., and RS, F. (2012b). Modeling dependent gene expression. Annals of Applied Statistics, 6(2), 542–560.
Tuddenham, R. D. and Snyder, M. M. (1954). Physical growth of Califorfornia boys and girls from birth to eighteen years. University of California Publications in Child Development, 1, 183–364.
Verbyla, A. P., Arunas, P., Cullis, B. R., Kenward, M. G., and Welham, S. J. (1999). The analysis of designed experiments and longitudinal data by using smoothing splines. Journal of the Royal Statistical Society, Series C: Applied Statistics, 48, 269–300.
Wakefield, J. (2012). Bayesian and Frequentist regression methods. Springer.
Wang, K. and Gasser, T. (1997). Alignment of curves by dynamic time warping. The Annals of Statistics, 25(3), 1251–1276.
Wang, K. and Gasser, T. (1999). Synchronizing sample curves nonparametrically. The Annals of Statistics, 27(2), 439–460.
Wypij, D., Pugh, M., and Ware, J. H. (1993). Modeling pulmonary function growth with regression splines. Statistica Sinica, 3, 329–350.
Yao, F., Müller, H. J., and Wang, J. L. (2005). Functional data analysis of sparse longitudinal data. JASA, 100(470), 577–590.
Zeger, S. L. and Diggle, P. J. (1994). Semiparametric models for longitudinal data with application to CD4 cell numbers in HIV seroconverters. Biometrics, 50, 689–699.
Zhang, Y. and Telesca, D. (2014). Joint clustering and registration of functional data. Technical report, UCLA.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Telesca, D. (2015). Bayesian Analysis of Curves Shape Variation Through Registration and Regression. In: Mitra, R., Müller, P. (eds) Nonparametric Bayesian Inference in Biostatistics. Frontiers in Probability and the Statistical Sciences. Springer, Cham. https://doi.org/10.1007/978-3-319-19518-6_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-19518-6_14
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-19517-9
Online ISBN: 978-3-319-19518-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)