Gene Selection with the δ-Sequence Method

Qiu, Xing; Klebanov, Lev

doi:10.1007/978-1-60327-337-4_4

Xing Qiu⁴ &
Lev Klebanov⁵

Part of the book series: Methods in Molecular Biology ((MIMB,volume 972))

2569 Accesses
1 Citations

Abstract

In this chapter, we discuss a method of selecting differentially expressed genes based on a newly discovered structure termed as the δ-sequence. Together with the nonparametric empirical Bayes methodology, it leads to dramatic gains in terms of the mean numbers of true and false discoveries, and in the stability of the results of testing. Furthermore, its outcomes are entirely free from the log-additive array-specific technical noise. The new paradigm offers considerable scope for future developments in this area of methodological research.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 139.00; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Efron B (2003) Robbins, empirical Bayes and microarrays. Ann Stat 31:366–378
Article Google Scholar
Efron B (2004) Large-scale simultaneous hypothesis testing: the choice of a null hypothesis. J Am Stat Assoc 99:96–104
Article Google Scholar
Efron B, Tibshrani R, Storey JD, Tusher V (2001) Empirical Bayes analysis of a microarray experiment. J Am Stat Assoc 96:1151–1160
Article Google Scholar
Allison DB, Gadbury GL, Heo M, Fern’andez JR, Les C-K, Prolla JA, Weindruch R (2002) A mixture model approach for the analysis of microarray gene expression data. Comput Stat Data Anal 39:1–20
Article Google Scholar
Dalmasso C, Broët P, Moreau T (2005) A simple procedure for estimating the false discovery rate. Bioinformatics 21(5):660–668
Article PubMed CAS Google Scholar
Pounds S, Cheng C (2004) Improving false discovery rate estimation. Bioinformatics 20:1737–1745
Article PubMed CAS Google Scholar
Pounds S, Morris SW (2003) Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-values. Bioinformatics 19:1236–1242
Article PubMed CAS Google Scholar
Reiner A, Yekutieli D, Benjamini Y (2003) Identifying differentially expressed genes using false discovery rate controlling procedures. Bioinformatics 19:368–375
Article PubMed CAS Google Scholar
Storey JD (2002) A direct approach to false discovery rates. J R Stat Soc Ser B 64:479–498
Article Google Scholar
Storey JD, Tibshirani R (2003) Statistical significance for genomewide studies. Proc Natl Acad Sci USA 100(16):9440–9445
Article PubMed CAS Google Scholar
Tsai C-A, Hsueh H-M, Chen JJ (2003) Estimation of false discovery rates in multiple testing: application to gene microarray data. Biometrics 59:1071–1081
Article PubMed Google Scholar
Qiu X, Klebanov L, Yakovlev A (2005) Correlation between gene expression levels and limitations of the empirical Bayes methodology for finding differentially expressed genes. Stat Appl Genet Mol Biol 4:34
Google Scholar
Qiu X, Yakovlev A (2006) Some comments on instability of false discovery rate estimation. J Bioinform Comput Biol 4(5):1057–1068
Article PubMed CAS Google Scholar
Storey JD, Taylor JE, Siegmund D (2003) Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. J R Stat Soc Ser B 66:187–205
Article Google Scholar
Storey JD, Tibshirani R (2003) Statistical significance for genomewide studies. Proc Natl Acad Sci USA 100:9440–9445
Article PubMed CAS Google Scholar
Qiu X, Brooks AI, Klebanov L, Yakovlev A (2005) The effects of normalization on the correlation structure of microarray data. BMC Bioinform 6:120
Article Google Scholar
Klebanov L, Jordan C, Yakovlev A (2006) A new type of stochastic dependence revealed in gene expression data. Stat Appl Genet Mol Biol 5 (Article7)
Google Scholar
Almudevar A, Klebanov LB, Qiu X, Salzman P, Yakovlev AY (2006) Utility of correlation measures in analysis of gene expression. NeuroRx 3(3):384–395
Article PubMed CAS Google Scholar
Klebanov L, Yakovlev A (2006) Treating expression levels of different genes as a sample in microarray data analysis: is it worth a risk? Stat Appl Genet Mol Biol 5, Article 9
Google Scholar
Qiu X, Xiao Y, Gordon A, Yakovlev A (2006) Assessing stability of gene selection in microarray data analysis. BMC Bioinform 7:50
Article Google Scholar
Benjamini Y, Hochberg Y (2000) On the adaptive control of the false discovery rate in multiple testing with independent statistics. J Educ Behav Stat 25(1):60
Google Scholar
Yeoh E-J, Ross ME, Shurtleff SA, Williams WK, Patel D, Mahfouz R, Behm FG, Raimondi SC, Relling MV, Patel A, Cheng C, Campana D, Wilkins D, Zhou X, Li J, Liu H, Pui C-H, Evans WE, Naeve C, Wong L, Downing JR (2002) Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. Cancer Cell 1(2):133–143
Article PubMed CAS Google Scholar
Westfall PH, Young S (1993) Resampling-based multiple testing. Wiley, New York, NY
Google Scholar
Yang YH, Dudoit S, Luu P, Lin DM, Peng V, Ngai J, Speed TP (2002) Normalization for cdna microarray data: a robust composite method addressing single and multiple slide systematic variation. Nucleic Acids Res 30(4):e15
Article PubMed Google Scholar
Bolstad BM, Irizarry RA, Astrand M, Speed TP (2003) A comparison of normalization methods for high density oligonucleotide array data based on variance and bias. Bioinformatics 19:185–193
Article PubMed CAS Google Scholar
Gordon A, Glazko G, Qiu X, Yakovlev A (2007) Control of the mean number of false discoveries, Bonferroni, and stability of multiple testing. Ann Appl Stat 1(1):179–190
Article Google Scholar

Download references

Acknowledgments

This research is supported by NIH Grant GM079259 (X. Qiu) and by Theodosius Dobzhansky Center for Genome Bioinformatics (L. Klebanov).

Author information

Authors and Affiliations

Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, USA
Xing Qiu
Department of Probability Statistics Charles University Prague, Prague, Czech Republic
Lev Klebanov

Authors

Xing Qiu
View author publications
You can also search for this author in PubMed Google Scholar
Lev Klebanov
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xing Qiu .

Editor information

Editors and Affiliations

School of Medicine & Dentistry, Dept. Biostatistics & Computational, University of Rochester, Elmwood Ave. 601, Rochester, 14642, New York, USA
Andrei Y. Yakovlev
, Department of Probability and Statistics, Charles University, Sokolovska 83, Prague, 18675, Czech Republic
Lev Klebanov
State University of New York at Buffalo, Main St - 706 Kimball Tower 3435, Buffalo, 14214, New York, USA
Daniel Gaile

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Qiu, X., Klebanov, L. (2013). Gene Selection with the δ-Sequence Method. In: Yakovlev, A., Klebanov, L., Gaile, D. (eds) Statistical Methods for Microarray Data Analysis. Methods in Molecular Biology, vol 972. Humana Press, New York, NY. https://doi.org/10.1007/978-1-60327-337-4_4

Download citation

DOI: https://doi.org/10.1007/978-1-60327-337-4_4
Published: 03 January 2013
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-60327-336-7
Online ISBN: 978-1-60327-337-4
eBook Packages: Springer Protocols

Publish with us

Policies and ethics