Skip to main content

Computer Tools to Analyze Microarray Data

  • Protocol
  • First Online:

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1986))

Abstract

Microarrays are broadly used in genomic analyses and find several applications in biology and medicine, providing a significant amount of data from a single experiment. Different kinds of microarrays are available which are identifiable by characteristics such as the type of probes, the surface used as support, and the method used for target detection. Although microarrays have been applied in many biological areas, their management, and investigation require advanced computational tools to speed up data analysis and at the same time make the interpretation of the results easier. To better deal with microarray datasets of large size, the development of analysis tools that are simple to use as well as able to produce accurate predictions, and of comprehensible models is essential. The tools have to provide an exhaustive collection of information to discriminate and identify SNPs, which are associated with the activity of particular genes affecting biological functions (e.g., a particular drug response), or involved in the development of complex diseases. The object of this chapter is to provide a review of software tools that are easy to use even for nonexperts of the domain, and that are able to efficiently deal with microarray data.

This is a preview of subscription content, log in via an institution.

Buying options

Protocol
USD   49.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Springer Nature is developing a new tool to find and evaluate Protocols. Learn more

References

  1. Watson JD, Crick FH (1953) Molecular structure of nucleic acids: a structure for deoxyribose nucleic acid. Nature 171:737–738

    Article  CAS  PubMed  Google Scholar 

  2. Sanger F, Nicklen S, Coulson AR (1977) DNA sequencing with chain-terminating inhibitors. Proc Natl Acad Sci U S A 74(12):5463

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Schildkraut CL, Marmur J, Doty P (1961) The formation of hybrid DNA molecules and their use in studies of DNA homologies. J Mol Biol 3(5):595

    Article  CAS  PubMed  Google Scholar 

  4. Grunstein M, Hogness DS (1975) Colony hybridization: a method for the isolation of cloned DNAs that contain a specific gene. Proc Natl Acad Sci U S A 72(10):3961

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Wang J, Bai Y, Li T, Lu Z (2003) DNA microarrays with unimolecular hair-pin double-stranded DNA probes: fabrication and exploration of sequence-specific DNA/protein interactions. J Biochem Biophys Methods 55(3):215

    Article  CAS  PubMed  Google Scholar 

  6. Sussman MJ (1999) i, United States patent, Nature biotechnology 17, 974

    Article  CAS  PubMed  Google Scholar 

  7. Gautier L, Cope L, Bolstad BM, Irizarry RA (2004) affy—analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 20(3):307

    Article  CAS  PubMed  Google Scholar 

  8. Irizarry RA, Bolstad BM, Collin F, Cope LM, Hobbs P, Speed TP (2003) Summaries of Affymetrix GeneChip probe level data. Nucleic Acids Res 31(4):e15

    Article  PubMed  PubMed Central  Google Scholar 

  9. Fan JB, Gunderson KL, Bibikova M, Yeakley JM, Chen J, Garcia EW, Lebruska LL, Laurent M, Shen R, Barker D (2006) Illumina universal bead arrays. Methods Enzymol 410:57

    Article  CAS  PubMed  Google Scholar 

  10. Shen R, Fan JB, Campbell D, Chang W, Chen J, Doucet D, Yeakley J, Bibikova M, Garcia EW, McBride C et al (2005) High-throughput snp genotyping on universal bead arrays. Mutat Res 573(1):70

    Article  CAS  PubMed  Google Scholar 

  11. Blin A, Cisse I, Bockelmann U (2014) Electronic hybridization detection in microarray format and DNA genotyping. Sci Rep 4:4194

    Article  PubMed  PubMed Central  Google Scholar 

  12. Sosnowski RG, Tu E, Butler WF, O’Connell JP, Heller MJ (1997) Rapid determination of single base mismatch mutations in DNA hybrids by direct electric field control. Proc Natl Acad Sci 94(4):1119

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  13. Horan PK, Wheeless LL (1977) Quantitative single cell analysis and sorting. Science 198(4313):149

    Article  CAS  PubMed  Google Scholar 

  14. Schwenk JM, Gry M, Rimini R, Uhl’en M, Nilsson P (2008) Antibody suspension bead arrays within serum proteomics. J Proteome Res 7(8):3168

    Article  CAS  PubMed  Google Scholar 

  15. Gleason LU, Burton RS (2012) High-throughput molecular identification of fish eggs using multiplex suspension bead arrays. Mol Ecol Resour 12(1):57

    Article  CAS  PubMed  Google Scholar 

  16. Pelizzola M, Pavelka N, Foti M, Ricciardi-Castagnoli P (2006) AMDA: an R package for the automated microarray data analysis. BMC Bioinformatics 7(1):335

    Article  PubMed  PubMed Central  Google Scholar 

  17. http://cran.r-project.org/

  18. https://sourceforge.net/projects/automicroarray/files/latest/download?source=typ_redirect

  19. Ihaka R, Gentleman R (1996) R: a language for data analysis and graphics. J Comput Graph Stat 5(3):299

    Google Scholar 

  20. Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth F, Tierney L, Yang JY, Zhang J (2004) Bioconductor: open software development for computational biology and bioinformatics. Genome Biol 5(10):R80

    Article  PubMed  PubMed Central  Google Scholar 

  21. https://www.thermofisher.com/us/en/home/life-science/microarray-analysis.html

  22. Saeed A, Sharov V, White J, Li J, Liang W, Bhagabati N, Braisted J, Klapa M, Currier T, Thiagarajan M et al (2003) Tm4: a free, open-source system for microarray data management and analysis. BioTechniques 34(2):374

    Article  CAS  PubMed  Google Scholar 

  23. Dudoit S, Gentleman RC, Quackenbush J et al (2003) Open source software for the analysis of microarray data. BioTechniques 34(13):45

    Article  Google Scholar 

  24. Saeed AI, Bhagabati NK, Braisted JC, Liang W, Sharov V, Howe EA, Li J, Thiagarajan M, White JA, Quackenbush J (2006) tm4 microarray software suite. Methods Enzymol 411:134

    Article  CAS  PubMed  Google Scholar 

  25. http://www.oracle.com/technetwork/java/javase/downloads/index.html

  26. Gan Z, Wang J, Salomonis N, Stowe JC, Haddad GG, McCulloch AD, Altintas I, Zambon AC (2014) MAAMD: a workflow to standardize meta-analyses and comparison of Affymetrix microarray data. BMC Bioinformatics 15(1):69

    Article  PubMed  PubMed Central  Google Scholar 

  27. Altintas I, Wang J, Crawl D, Li W (2012) Workshop on Data analytics in the Cloud (DanaC2012) at EDBT/ICDT 2012 conference. http://www.edbt.org/Proceedings/2012-Berlin/papers/workshops/danac2012/a5-altintas.pdf

  28. Wang J, Crawl D, Altintas I (2012) 1st international workshop on advances in the Kepler scientific workflow system and its applications at ICCS 2012 conference

    Google Scholar 

  29. Wang J, Altintas I (2012) 1st international workshop on advances in the Kepler scientific workflow system and its applications at ICCS 2012 conference

    Google Scholar 

  30. Crawl D, Wang J, Altintas I (2011) Provenance for MapReduce-based data-intensive workflows. In: Supercomputing 2011 (SC2011) conference, ACM 2011, vol Proceedings of the 6th workshop on workflows in support of large-scale science (WORKS11). http://users.sdsc.edu/~jianwu/JianwuWang_files/Provenance_for_MapReduce-based_Data-Intensive_Workflows-2011.pdf

  31. Altintas I (2011) Distributed workflow-driven analysis of large-scale biological data using biokepler. In: Proceedings of the 2nd international workshop on Petascal data analytics: challenges and opportunities (ACM, New York, NY, USA). Doi:https://doi.org/10.1145/2110205.2110215

  32. https://kepler-project.org/users/downloads

  33. http://www.altanalyze.org/

  34. Mootha VK, Lindgren CM, Eriksson KF, Subramanian A, Sihag S, Lehar J, Puigserver P, Carlsson E, Ridderstråle M, Laurila E, Houstis N, Daly MJ, Patterson N, Mesirov JP, Golub TR, Tamayo P, Spiegelman B, Lander ES, Hirschhorn JN, Altshuler D, Groop LC (2003) PGC-1alpha-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes. Nat Genet 34:267–273

    Article  CAS  PubMed  Google Scholar 

  35. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, Mesirov JP (2005) Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 102(43):15545

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  36. Ji H, Jiang H, Ma W, Johnson DS, Myers RM, Wong WH (2008) An integrated software system for analyzing chip-chip and chip-seq data. Nat Biotechnol 26:1293. EP

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  37. Zhou Q, Wong WH (2004) CisModule: de novo discovery of cis-regulatory modules by hierarchical mixture modeling. Proc Natl Acad Sci U S A 101(33):12114

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. http://www.open-std.org/jtc1/sc22/wg14/

  39. http://www.open-std.org/jtc1/sc22/wg21/

  40. Korn JM, Kuruvilla FG, Mc-Carroll SA, Wysoker A, Nemesh J, Cawley S, Hubbell E, Veitch J, Collins PJ, Darvishi K, Lee C, Nizzari MM, Gabriel SB, Purcell S, Daly MJ, Altshuler D (2008) Integrated genotype calling and association analysis of snps, common copy number polymorphisms and rare cnvs. Nat Genet 40:1253–1260

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. McCarroll SA, Kuruvilla FG, Korn JM, Cawley S, Nemesh J, Wysoker A, Shapero MH, de Bakker PIW, Maller JB, Kirby A, Elliott AL, Parkin M, Hubbell E, Webster T, Mei R, Veitch J, Collins PJ, Handsaker R, Lincoln S, Nizzari M, Blume J, Jones KW, Rava R, Daly MJ, Gabriel SB, Altshuler D (2008) Integrated detection and population-genetic analysis of snps and copy number variation. Nat Genet 40:1166. EP

    Article  CAS  PubMed  Google Scholar 

  42. http://www.python.org

  43. http://numpy.scipy.org

  44. http://lib.stat.cmu.edu/R/CRAN/web/packages/mclust/index.html

  45. http://www.affymetrix.com/support/developer/powertools/index.affx

  46. Sharan R, Maron-Katz A, Shamir R (2003) Click and expander: a system for clustering and visualizing gene expression data. Bioinformatics 19(14):1787

    Article  CAS  PubMed  Google Scholar 

  47. Ulitsky I, Maron-Katz A, Shavit S, Sagir D, Linhart C, Elkon R, Tanay A, Sharan R, Shiloh Y, Shamir R (2010) Expander: from expression microarrays to networks and functions. Nat Protoc 5:303. EP

    Article  CAS  PubMed  Google Scholar 

  48. Shamir R, Maron-Katz A, Tanay A, Linhart C, Steinfeld I, Sharan R, Shiloh Y, Elkon R (2005) Expander z an integrative program suite for microarray data analysis. BMC Bioinformatics 6(1):232

    Article  PubMed  PubMed Central  Google Scholar 

  49. https://emea.illumina.com/

  50. Du P, Kibbe WA, Lin SM (2008) lumi: a pipeline for processing illumina microarray. Bioinformatics 24(13):1547

    Article  CAS  PubMed  Google Scholar 

  51. Du P, Zhang X, Huang CC, Jafari N, Kibbe WA, Hou L, Lin SM (2010) Comparison of beta-value and m-value methods for quantifying methylation levels by microarray analysis. BMC bioinformatics 11(1):587

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Lin SM, Du P, Huber W, Kibbe WA (2008) Model-based variance-stabilizing transformation for Illumina microarray data. Nucleic Acids Res 36(2):e11

    Article  PubMed  PubMed Central  Google Scholar 

  53. Saldanha AJ (2004) Java Treeview—extensible visualization of microarray data. Bioinformatics 20(17):3246

    Article  CAS  PubMed  Google Scholar 

  54. Guzzi PH, Agapito G, Di Martino MT, Arbitrio M, Tassone P, Tagliaferri P, Cannataro M (2012) DMET-Analyzer: automatic analysis of Affymetrix DMET data. BMC Bioinformatics 13(1):258

    Article  PubMed  PubMed Central  Google Scholar 

  55. Agapito G, Guzzi PH, Cannataro M (2019) Parallel extraction of association rules from genomics data. Appl Math Comput 350:434–446

    Article  Google Scholar 

  56. Agapito G, Botta C, Guzzi PH, Arbitrio M, Di Martino MT, Tassone P, Tagliaferri P, Cannataro M (2016) OSAnalyzer: a bioinformatics tool for the analysis of gene polymorphisms en-riched with clinical outcomes. Microarrays 5(4). https://doi.org/10.3390/microarrays5040024

    Article  PubMed Central  Google Scholar 

  57. Agapito G, Guzzi PH, Cannataro M (2015) DMET-Miner: Efficient discovery of association rules from pharmacogenomic data. J Biomed Inform 56:273

    Article  PubMed  Google Scholar 

  58. Agapito G, Cannataro M, Guzzi PH, Marozzo F, Talia D, Trunfio P (2013) Proceedings of the international conference on bioinformatics, computational biology and biomedical informatics

    Google Scholar 

  59. Marozzo F, Talia D, Trunfio P (2013) A cloud framework for big data analytics workflows on azure. In: Grandinetti L (ed) Clouds, grids and big data. IOS Press, Chap. Big Data

    Google Scholar 

  60. Marozzo F, Talia D, Trunfio P (2012) European conference on parallel processing. Springer, pp 220–227

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giuseppe Agapito .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Science+Business Media, LLC, part of Springer Nature

About this protocol

Check for updates. Verify currency and authenticity via CrossMark

Cite this protocol

Agapito, G. (2019). Computer Tools to Analyze Microarray Data. In: Bolón-Canedo, V., Alonso-Betanzos, A. (eds) Microarray Bioinformatics. Methods in Molecular Biology, vol 1986. Humana, New York, NY. https://doi.org/10.1007/978-1-4939-9442-7_13

Download citation

  • DOI: https://doi.org/10.1007/978-1-4939-9442-7_13

  • Published:

  • Publisher Name: Humana, New York, NY

  • Print ISBN: 978-1-4939-9441-0

  • Online ISBN: 978-1-4939-9442-7

  • eBook Packages: Springer Protocols

Publish with us

Policies and ethics