Arrow Plot for Selecting Genes in a Microarray Experiment: An Explorative Study

  • Catarina Lemos
  • Gustavo Soutinho
  • Ana Cristina BragaEmail author
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 10405)


Genetic expression analysis is essential for the identification of gene functions and take even more importance when they are directly related with diseases. For the performing of a large-scale study of changes in gene expression it is necessary to find a method to do it with precision and accuracy. Thus, the analysis by the microarray technology is an important tool in the diagnosis of diseases.

An important role of the analysis of microarray data involves the determination of which genes could be differentially expressed (DE) across two or more kind of tissue samples.

The traditional methods to detect DE genes are generally based on simple measures of distances and could failed in this classification.

In this work it is explored a new tool proposed by Silva-Fortes [21] that overcome this difficulty, the arrow plot. This tool is also compared with other methods mostly used to this purpose.

The arrow plot is a graphical tool based on two measures of distances between two probability density functions: the overlapping coefficient (OVL) between two densities and the area under the receiver operating characteristic (ROC) curve (AUC), for each gene of a microarray experience.

For illustrative purpose we will use a dataset of pancreatic adenocarcinoma. All computation will be done in R software.


Microarrays Genes Area under the ROC curve Overlapping Coefficient Arrow plot 



This work was supported by FCT - (Fundação para a Ciência e Tecnologia) within the Project Scope: UID/CEC/00319/2013.


  1. 1.
    Behzadi, P., Behzadi, E., Ranjbar, R.: Microarray data analysis. Albanian Med. J. 4, 84–90 (2014)Google Scholar
  2. 2.
    Breitling, R., Armengaud, P., Amtmann, A., Herzyk, P.: Rank products: a simple, yet powerful, new method to detect differentially regulated genes in replicated microarray experiments. FEBS Lett. 573, 83–92 (2004)CrossRefGoogle Scholar
  3. 3.
    Bumgarner, R.: DNA microarrays: types, applications and their future. Curr. Protoc. Mol. Biol. 6137(206), 1–17 (2013)Google Scholar
  4. 4.
    Clemons, T.E., Bradley, E.L.: Nonparametric measure of the overlapping coefficient. Comput. Stat. Data Anal. 34(1), 51–61 (2000)CrossRefzbMATHGoogle Scholar
  5. 5.
    Cleves, M.A.: Comparative assessment of three common algorithms for estimating the variance of the area under the nonparametric receiver operating characteristic curve. Stata J. 2(3), 280–289 (2002)Google Scholar
  6. 6.
    Cooper, M.G.: The Cell: A Molecular Approach. Sinauer Associates, Sunderland (2011)Google Scholar
  7. 7.
    Dalman, M.R., Deeter, A., Nimishakavi, G., Duan, Z.-H.: Fold change and p-value cutoffs significantly alter microarray interpretations. BMC Bioinform. 13(2), 1–4 (2012)Google Scholar
  8. 8.
    Florkowski, C.M.: Sensitivity, specificity, receiver-operating characteristic (ROC) curves and likelihood ratios: communicating the performance of diagnostic tests. Clin. Biochem. Rev. 29(Suppl 1), S83–S87 (2008)Google Scholar
  9. 9.
    Hariharan, R.: The analysis of microarray data. Pharmacogenomics 4(4), 477–497 (2003)CrossRefGoogle Scholar
  10. 10.
    Jeffery, I.B., Higgins, D.G., Culhane, A.C.: Comparison and evaluation of methods for generating differentially expressed gene lists from microarray data. BMC Bioinform. 7, 359 (2006)CrossRefGoogle Scholar
  11. 11.
    Kadota, K., Nakai, Y., Shimizu, K.: A weighted average difference method for detecting differentially expressed genes from microarray data. Algorithms Mol. Biol. AMB 3, 8 (2008)CrossRefGoogle Scholar
  12. 12.
    Larson, B.: Meet the Overlapping Coefficient: A Measure for Elevator Speeches, June 2014Google Scholar
  13. 13.
    Metz, C.E.: Basic principles of ROC analysis. Semin. Nucl. Med. VIII(4), 283–298 (1978)CrossRefGoogle Scholar
  14. 14.
    National Center for Biotechnology Information:
  15. 15.
    National Human Genome Research Institute:
  16. 16.
    Park, S.H., Goo, J.M., Jo, C.-H.: Receiver operating characteristic (ROC) curve: practical review for radiologists. Korean J. Radiol. 5(1), 11 (2004)CrossRefGoogle Scholar
  17. 17.
    Parodi, S., Izzotti, A., Muselli, M.: Re: the central role of receiver operating characteristic (ROC) curves in evaluating tests for the early detection of cancer. J. Nat. Cancer Inst. 97(7), 511–515 (2003)Google Scholar
  18. 18.
    Parodi, S., Pistoia, V., Muselli, M.: Not proper ROC curves as new tool for the analysis of differentially expressed genes in microarray experiments. BMC Bioinform. 9, 410 (2008)CrossRefGoogle Scholar
  19. 19.
    Pepe, M.S., Longton, G., Anderson, G.L., Schummer, M.: Selecting differentially expressed genes from microarray experiments. Biometrics 59, 133–142 (2003)MathSciNetCrossRefzbMATHGoogle Scholar
  20. 20.
    Silva-Fortes, C.: Aplicação da metodologia ROC na análise de dados de microarrays. Ph.D. thesis, Faculdade de Ciéncias de Lisboa (2012)Google Scholar
  21. 21.
    Silva-Fortes, C., Turkman, M.A.A., Sousa, L.: Arrow plot: a new graphical tool for selecting up and down regulated genes and genes differentially expressed on sample subgroups. BMC Bioinform. 13, 147 (2012)CrossRefGoogle Scholar
  22. 22.
    Tusher, V.G., Tibshirani, R., Chu, G.: Significance analysis of microarrays applied to the ionizing radiation response. Proc. Nat. Acad. Sci. Unit. States Am. 98, 5116–5121 (2001)CrossRefzbMATHGoogle Scholar

Copyright information

© Springer International Publishing AG 2017

Authors and Affiliations

  • Catarina Lemos
    • 1
  • Gustavo Soutinho
    • 2
  • Ana Cristina Braga
    • 2
    Email author
  1. 1.Departamento de Informática, Escola de EngenhariaUniversidade do MinhoBragaPortugal
  2. 2.ALGORITMI CentreUniversity of MinhoBragaPortugal

Personalised recommendations