Abstract
The development of high-throughput experimental techniques has made measurements for virtually all kinds of cellular components possible. Effective integration and analysis of this diverged information to produce insightful knowledge is central to biological study today. In this chapter, we present a methodology for building integrative analytical workbenches using the workflow technology. We focus on the field of gene discovery through the combined study of transcriptomics, genomics and epigenomics, although the methodology is generally applicable to any omics-data analysis for biomarker discovery. We illustrate the application of the methodology by presenting our study on the identification of aberrant genomic regions, genes and/or their regulatory elements with their implications for breast cancer research. We also discuss the challenges and opportunities brought by the latest development of the next generation sequencing technology.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Velculescu, V. E., Zhang, L., Vogelstein, B., and Kinzler, K. W. (1995) Serial analysis of gene expression. Science 270, 484–487.
Shinawi, M., and Cheung, S. W. (2008) The array CGH and its clinical applications. Drug Discov Today 13, 760–770.
Yue, P., and Moult, J. (2006) Identification and analysis of deleterious human SNPs. J Mol Biol 356, 1263–1274.
Bird, A. (2007) Perceptions of epigenetics. Nature 447, 396–398.
Aparicio, O., and Geisberg, J. V. (2004) Chromatin immunoprecipitation for determining the association of proteins with specific genomic sequences in vivo. In, Chapter 17, Unit 17.7 Current Protocols in Cell Biology. Juan S. Bonifacino, Mary Dasso, Joe B. Harford, Jennifer Lippincott-Schwartz, and Kenneth M. Yamada (Eds.) Wiley, Los Angeles, CA.
Conover, W. J. (1980) Practical Nonparametric Statistics, 3rd ed. Wiley, New York, NY.
Chong, F., Carraro, G., Wolter, R. (2009) Multi-Tenant Data Architecture. http://msdn.microsoft.com/en-us/library/aa479086.aspx
Stafford, P. (2008) Methods In Microarray Normalization. CRC Press, New York, NY.
Kaplan, E. L., and Meier, P. (1958) Nonparametric Estimation from Incomplete Observations. J Am Stat Assoc 53(282), 457–481.
Freedman, D. A. (2007) Statistics. W.W. Norton & Company, New York, NY.
Corder, G. W., and Foreman, D. I. (2009) Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach. Wiley, River Street Hoboken, NJ.
Zuber, V., and Strimmer, K. (2009) Gene ranking and biomarker discovery. Bioinformatics 25, 2700–2707.
Tan, P., Steinbach, M., and Kumar, V. (2002) Clustering analysis: basic concepts and algorithms. In Introduction to Data Mining, Chapter 8, 487–559. Addison-Wesley, Boston, MA.
Klosgen, W., and Zytkow, J. M. (2002) Handbook of Data Mining and Knowledge Discovery. Oxford University Press, Oxford.
Gorlov, I. P., Gallick, G. E., Gorlova, O. Y., Amos, C., and Logothetis, C. J. (2009) GWAS meets microarray: are the results of genome-wide association studies and gene-expression profiling consistent? Prostate cancer as an example. PLoS One 4, e6511.
Neve, R. M., Chin, K., Fridlyand, J., Yeh, J., Baehner, F. L., Fevr, T., Clark, L., Bayani, N., Coppe, J. P., Tong, F., Speed, T., Spellman, P. T., DeVries, S., Lapuk, A., Wang, N. J., Kuo, W. L., Stilwell, J. L., Pinkel, D., Albertson, D. G., Waldman, F. M., McCormick, F., Dickson, R. B., Johnson, M. D., Lippman, M., Ethier, S., Gazdar, A., and Gray, J. W. (2006) A collection of breast cancer cell lines for the study of functionally distinct cancer subtypes. Cancer Cell 10, 515–527.
Wold, B., and Myers, R. M. (2008) Sequence census methods for functional genomics. Nat Methods 5, 19–21.
Mardis, E. R. (2008) The impact of next–generation sequencing technology on genetics. Trends Genet 24, 133–141.
Acknowledgements
The authors would like to acknowledge Andrew J. Tutt for his contribution to the breast cancer study presented in Section 3.3.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer Science+Business Media, LLC
About this protocol
Cite this protocol
Guo, Y., Munro, R.E., Kalaitzopoulos, D., Grigoriadis, A. (2011). The ForeSee (4C) Approach for Integrative Analysis in Gene Discovery. In: Yu, B., Hinchcliffe, M. (eds) In Silico Tools for Gene Discovery. Methods in Molecular Biology, vol 760. Humana Press. https://doi.org/10.1007/978-1-61779-176-5_4
Download citation
DOI: https://doi.org/10.1007/978-1-61779-176-5_4
Published:
Publisher Name: Humana Press
Print ISBN: 978-1-61779-175-8
Online ISBN: 978-1-61779-176-5
eBook Packages: Springer Protocols