Data and Databases

Damian, Daniel

doi:10.1007/978-0-387-92738-1_18

Daniel Damian⁴

3512 Accesses

Abstract

Owing to the recent advances in technology and to the growth in the number and size of projects tasked with collecting and assembling biological and genomic information, a highly heterogeneous collection of databases have become available to the community in the last two decades. As a consequence of rapid and distributed progress throughout the field, bioinformatics databases are provided in a variety of formats and specifications. This chapter discusses the most frequently encountered data formats in bioinformatics and the tools used to access these data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Aho A, Hopcroft JE, Ullman J (1986) Compilers: Principles, techniques, and tools. Addison-Wesley, Reading
Google Scholar
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215(3):403–410
CAS PubMed Google Scholar
Appel A (1998) Modern compiler implementation in C. Cambridge University Press, New York
Google Scholar
BioPerl (http://www.bioperl.org)
Stajich et al (2002) The bioperl toolkit: Perl modules for the life sciences, Genome Res (12):1611–1618
Google Scholar
Codd EF (1983) A relational model of data for large shared data banks. Commun ACM 26, 1 (Jan. 1983):64–69
Google Scholar
The Document Object Model (http://www.w3c.org/DOM/)
Extensible Markup Language (XML) (2006) 1.0 (Fourth Edition). W3C Recommendation, March 2006 (http://www.w3.org/TR/REC-xml/)
The Gene Ontology Consortium (2000) Gene Ontology: Tool for the unification of biology. Nature Genet 25:25–29
Google Scholar
GNU Bison (http://www.gnu.org/software/bison/)
Grune D, Jacobs CJH (1990) Parsing techniques – A practical guide. Ellis Horwood, Chichester, England (http://www.cs.vu.nl/∼dick/PTAPG.html)
Higgins DG, Sharp PM (1988) CLUSTAL: A package for performing multiple sequence alignment on a microcomputer. Gene 73:237–244
Article CAS PubMed Google Scholar
Hubbard TJP, Aken BL, Beal K, Ballester B, Caccamo M, Chen Y et al (2007) Ensembl 2007. Nucleic Acids Res 35, Database issue:D610–D617
Google Scholar
IUPAC-IUB Joint Commission on Biochemical Nomenclature (1984) Nomenclature and symbolism for amino acids and peptides. Recommendations. Eur J Biochem 138:9–37
Google Scholar
OWL Web Ontology Language Reference (1999) W3C Recommendation November 1999 (http://www.w3.org/TR/owl)
Rice P, Longden I, Bleasby A (2000) EMBOSS: The European molecular biology open software suite. Trends Genet 16(6):276–277
Article CAS PubMed Google Scholar
The European Bioinformatics Institute (http://www.ebi.ac.uk/)
The UniProt Consortium (2008) The universal protein resource (UniProt). Nucleic Acids Res 36:D190–D195
Article Google Scholar
The World-Wide-Web Consortium (http://www.w3c.com/)
XML (1999) Path Language (XPath). W3C Recommendation, November 1999 (http://www.w3.org/TR/xpath)
XQuery 1.0: An XML Query Language. W3C Recommendation January 2007 (http://www.w3.org/TR/xquery/)
XSL Transformations (XSLT) (1999) Version 1.0. W3C Recommendation November 1999 (http://www.w3.org/TR/xslt)
W3C Semantic Web Activity (http://www.w3.org/2001/sw)
Wall L, Christiansen T, Orwant J (2000) Programming Perl. O’Reilly & Associates, Inc.
Google Scholar
Wilkinson M, Links M (2002) BioMOBY: An open source biological web services proposal. Brief Bioinformatics 3(4):331–341
Article PubMed Google Scholar

Download references

Author information

Authors and Affiliations

BioWisdom Ltd., Cambridge, CB22 7GG, UK
Daniel Damian

Authors

Daniel Damian
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Daniel Damian .

Editor information

Editors and Affiliations

Inst. Molecular Bioscience, University of Queensland, St.Lucia, 4072, Australia
David Edwards
Dept. Plant & Microbial Biology, University of California, Berkeley, Koshland Hall 111, Berkeley, 94720, U.S.A.
Jason Stajich
e-Health Research Centre, Adelaide St. 300, Brisbane, 4000, Australia
David Hansen

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Damian, D. (2009). Data and Databases. In: Edwards, D., Stajich, J., Hansen, D. (eds) Bioinformatics. Springer, New York, NY. https://doi.org/10.1007/978-0-387-92738-1_18

Download citation

DOI: https://doi.org/10.1007/978-0-387-92738-1_18
Published: 05 August 2009
Publisher Name: Springer, New York, NY
Print ISBN: 978-0-387-92737-4
Online ISBN: 978-0-387-92738-1
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)

Publish with us

Policies and ethics