A Skew-t-Normal Multi-level Reduced-Rank Functional PCA Model for the Analysis of Replicated Genomics Time Course Data

Berk, Maurice; Montana, Giovanni

doi:10.1007/978-3-642-34156-4_7

A Skew-t-Normal Multi-level Reduced-Rank Functional PCA Model for the Analysis of Replicated Genomics Time Course Data

Maurice Berk¹⁹ &
Giovanni Montana²⁰

Conference paper

1696 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7619))

Abstract

Modelling replicated genomics time series data sets is challenging for two key reasons. Firstly, they exhibit two distinct levels of variation — the between-transcript and, nested within that, the between-replicate. Secondly, the typical assumption of normality rarely holds. Standard practice in light of these issues is to simply treat each transcript independently which greatly simplifies the modelling approach, reduces the computational burden and nevertheless appears to yield good results. We have set out to improve upon this, and in this article we present a multi-level reduced-rank functional PCA model that more accurately reflects the biological reality of these replicated genomics data sets, retains a degree of computational efficiency and enables us to carry out dimensionality reduction.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bar-Joseph, Z., Gerber, G., Simon, I., Gifford, D.K., Jaakkola, T.S.: Comparing the continuous representation of time-series expression profiles to identify differentially expressed genes. Proceedings of the National Academy of Sciences of the United States of America 100(18), 10146–10151 (2003)
Article MathSciNet MATH Google Scholar
Berk, M., Ebbels, T., Montana, G.: A statistical framework for metabolic profiling using longitudinal data. Bioinformatics 27, 1979–1985 (2011)
Article Google Scholar
Di, C., Crainiceanu, C.M., Kuechenhoff, H., Peters, A.: Multilevel functional principal component analysis. Annals of Applied Statistics 3, 458–488 (2009)
Article MathSciNet MATH Google Scholar
Gómez, H.W., Venegas, O., Bolfarine, H.: Skew-symmetric distributions generated by the distribution function of the normal distribution. Environmetrics 18(4), 395–407 (2007)
Article MathSciNet Google Scholar
Ho, H.J., Lin, T.-I.: Robust linear mixed models using the skew t distribution with application to schizophrenia data. Biometrical Journal 52(4), 449–469 (2010)
Article MathSciNet MATH Google Scholar
James, G., Hastie, T., Sugar, C.: Principal component models for sparse functional data. Biometrika 87(3), 587–602 (2000)
Article MathSciNet MATH Google Scholar
Luan, Y., Li, H.: Clustering of time-course gene expression data using a mixed-effects model with B-splines. Bioinformatics 19(4), 474–482 (2003)
Article Google Scholar
Ma, P., Castillo-Davis, C.I., Zhong, W., Liu, J.S.: A data-driven clustering method for time course gene expression data. Nucleic Acids Research 34(4), 1261–1269 (2006)
Article Google Scholar
Nelder, J.A., Mead, R.: A Simplex Method for Function Minimization. The Computer Journal 7(4), 308–313 (1965)
MATH Google Scholar
Storey, J.D., Xiao, W., Leek, J.T., Tompkins, R.G., Davis, R.W.: Significance analysis of time course microarray experiments. Proceedings of the National Academy of Sciences of the United States of America 102(36), 12837–12842 (2005)
Article Google Scholar
Tusher, V.G., Tibshirani, R., Chu, G.: Significance analysis of microarrays applied to the ionizing radiation response. Proceedings of the National Academy of Sciences of the United States of America 98(9), 5116–5121 (2001)
Article MATH Google Scholar
Wei, G.C.G., Tanner, M.A.: A Monte Carlo Implementation of the EM Algorithm and the Poor Man’s Data Augmentation Algorithms. Journal of the American Statistical Association 85(411), 699–704 (1990)
Article Google Scholar
Zhou, L., Huang, J.Z., Carroll, R.J.: Joint modelling of paired sparse functional data using principal components. Biometrika 95(3), 601–619 (2008)
Article MathSciNet MATH Google Scholar
Zhou, L., Huang, J.Z., Martinez, J.G., Maity, A., Baladandayuthapani, V., Carroll, R.J.: Reduced rank mixed effects models for spatially correlated hierarchical functional data. Journal of the American Statistical Association 105(489), 390–400 (2010)
Article MathSciNet Google Scholar

Download references

Author information

Authors and Affiliations

Department of Medicine, Imperial College London, UK
Maurice Berk
Department of Mathematics, Imperial College London, UK
Giovanni Montana

Authors

Maurice Berk
View author publications
You can also search for this author in PubMed Google Scholar
Giovanni Montana
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Information and Computer Science, Aalto University School of Science, P.O. Box 15400, 00076, Aalto, Finland
Jaakko Hollmén
Department of Computer Science, Ostfalia University of Applied Sciences, Salzdahlumer Straße 46/48, 38302, Wolfenbüttel, Germany
Frank Klawonn
School of Information Systems, Computing and Mathematics, Brunel University, UB8 3PH, Uxbridge, Middlesex, UK
Allan Tucker

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Berk, M., Montana, G. (2012). A Skew-t-Normal Multi-level Reduced-Rank Functional PCA Model for the Analysis of Replicated Genomics Time Course Data. In: Hollmén, J., Klawonn, F., Tucker, A. (eds) Advances in Intelligent Data Analysis XI. IDA 2012. Lecture Notes in Computer Science, vol 7619. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34156-4_7

Download citation

DOI: https://doi.org/10.1007/978-3-642-34156-4_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-34155-7
Online ISBN: 978-3-642-34156-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics