Abstract
Microarrays were a trigger to develop new methods which can allow to estimate disturbances in signal cascades, characterized by sets of genes, in various biological conditions. Existing approaches of gene set analysis take information if genes are differentially expressed or are based on some gene ranking. The most commonly used method is Gene Set Enrichment Analysis (GSEA), where an assumption of uniform distribution of genes in some gene set is tested by weighted Kolmogorov-Smirnov test. Many studies present different gene set analysis methods and their comparison, however none of them focus on basic but crucial parameters, like the rank metric. In this paper we compare nine ranking metrics in terms of sensitivity, specificity and prioritization of identification of functional gene sets using a collection of 34 annotated microarray datasets. We show that absolute value of default GSEA measure is the best ranking metric, while the Baumgartner-Weiss-Schindler test statistic is the best statistical-based metrics, which can be used in Gene Set Enrichment Analysis.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Marczyk, M., Jaksik, R., Polanski, A., Polanska, J.: Adaptive filtering of microarray gene expression data based on Gaussian mixture decomposition. BMC Bioinformatics 14(1), 101 (2013)
Tavazoie, S., Hughes, J.D., Campbell, M.J., Cho, R.J., Church, G.M.: Systematic determination of genetic network architecture. Nature Genetics 22(3), 281–285 (1999)
Subramanian, A., et al.: Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. PNAS 102(43), 15545–15550 (2005)
Tarca, A.L., Draghici, S., Bhatti, G., Romero, R.: Down-weighting overlapping genes improves gene set analysis. BMC Bioinformatics 13, 136 (2012)
Wu, D., Smyth, G.K.: Camera: a competitive gene set test accounting for inter-gene correlation. Nucleic Acids Research 40(17), e133 (2012)
Rahnenführer, J., Domingues, F. S., Maydt, J., Lengauer, T.: Calculating the statistical significance of changes in pathway activity from gene expression data. Statistical Applications in Genetics and Molecular Biology 3(1) (2004)
Shojaie, A., Michailidis, G.: Network enrichment analysis in complex experiments. Statistical Applications in Genetics and Molecular Biology 9(1) (2010)
Hung, J.-H., et al.: Gene set enrichment analysis: performance evaluation and usage guidelines. Briefings in Bioinformatics 13(3), 281–291 (2012)
Maciejewski, H.: Gene set analysis methods: statistical models and methodological differences. Briefings in Bioinformatics 15(4), 504–518 (2014)
Khatri, P., Sirota, M., Butte, A.J.: Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Computational Biology 8(2), e1002375 (2012)
Tarca, A.L., Bhatti, G., Romero, R.: A comparison of gene set analysis methods in terms of sensitivity, prioritization and specificity. PLoS One 8(11), e79217 (2013)
Kanehisa, M., et al.: KEGG as a reference resource for gene and protein annotation. Nucleic Acids Research 44(D1), D457–D462 (2016)
Bayá, A.E., Larese, M.G., Granitto, P.M., Gómez, J.C., Tapia, E.: Gene set enrichment analysis using non-parametric scores. In: Advances in Bioinformatics and Computational Biology, pp. 12–21. Springer, Heidelberg (2007)
Neuhäuser, M.: An exact two-sample test based on the Baumgartner-Weiß-Schindler statistic and a modification of Lepage’s test. Communications in Statistics-Theory and Methods 29(1), 67–78 (2000)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Zyla, J., Marczyk, M., Polanska, J. (2016). Sensitivity, Specificity and Prioritization of Gene Set Analysis When Applying Different Ranking Metrics. In: Saberi Mohamad, M., Rocha, M., Fdez-Riverola, F., DomÃnguez Mayo, F., De Paz, J. (eds) 10th International Conference on Practical Applications of Computational Biology & Bioinformatics. PACBB 2016. Advances in Intelligent Systems and Computing, vol 477. Springer, Cham. https://doi.org/10.1007/978-3-319-40126-3_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-40126-3_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-40125-6
Online ISBN: 978-3-319-40126-3
eBook Packages: EngineeringEngineering (R0)