Abstract
A new generation of high-throughput measurement technologies in genomics has opened up new avenues for biomedical research. To make use of this potential, statistical challenges related to the size and complexity of these new types of data sets need to be overcome. In particular, noisy data has led to irreproducible scientific results undermining the credibility of the new technologies. This article reviews recent work by statisticians on visualisation and assessment of the quality of data from gene expression microarrays and related technologies. It then traces the impact this work had on the biomedical research community. An example of the use of the new statistical quality assessment tools is their role in the Microarray Quality Control project, an FDA initiative to establish quality standards for high-throughput gene expression data. Another example is their role in the development of a diagnostic tool for thyroid cancer that has hugely reduced the number of unnecessary surgeries.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
This terminology stems from microarray technology. If another technology is used this collection of reference values can be computed the same way, though technically is not an array.
References
Alexander, E.K., et al.: Preoperative diagnosis of benign thyroid nodules with indeterminate cytology. New Engl. J. Med. 23 August 2012
Allison, D., Cui, X., Page, G., Sabripour, M.: Microarray data analysis: from disarray to consolidation and consensus. Nat. Rev. Genet. 7, 55–65 (2006)
Brettschneider, J.: Quality assessment for high-throughput genomic data in research and clinical practice, Impact Statement REF 2014, Unit of assessment: Mathematical Sciences, Summary impact type: Technological (2014)
Brettschneider, J., Collin, F., Bolstad, B.M., Speed, T.P.: Quality assessment for short oligonucleotide arrays, with 5 commentaries and rejoinder. Technometrics 50, 241–264 (2008)
Irizarry, R., Bolstad, B., Collin, F., Cope, L., Hobbs, B., Speed, T.: Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res. 31, e15 (2003)
Li, H., Robinson, K., Anton, B., Saldanha, I., Ladenson, P.: Cost-effectiveness of a novel molecular test of cytologically indeterminate thyroid nodules. JCEM 96(11), E1719 (2011)
Mullard, A.: Reliability of ’new drug target’ claims called into question, News and analysis. Nat. Rev. Drug Discov. 10, 643–644 (2011)
Acknowledgments
I thank F. Collin (Genomic Health), B. Bolstad (Affymetrix) and T. Speed (UC Berkeley and WEHI Melbourne) for our longstanding collaboration. I am also grateful to G. Kennedy (Veracyte), D. Brewer (ICR) and A. Scherer (Spheromics) for support with demonstrating impact, and D. Firth (University of Warwick) for feedback on drafts of my REF 2014 impact case.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Brettschneider, J. (2016). Practical Uses of Quality Assessment for High-Dimensional Gene Expression Data. In: Aston, P., Mulholland, A., Tant, K. (eds) UK Success Stories in Industrial Mathematics. Springer, Cham. https://doi.org/10.1007/978-3-319-25454-8_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-25454-8_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-25452-4
Online ISBN: 978-3-319-25454-8
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)