The most important basis for applied bioinformatics is the collection of sequence data and its associated biological information. For example, with genome sequencing projects such data are generated daily in very large quantities worldwide. To use these data appropriately, a structured fi ling system of these data is necessary, yet the data should also be accessible to those interested. Annually, the journal Nucleic Acids Research [nar] dedicates an entire issue (fi rst issue in January) to all available biological databases, which are recorded in tabular form with the respective URLs. Furthermore, for a number of databases, original articles describe their functions. This database issue, which is freely accessible also on the WWW, is a good starting point for working with biological databases. Depending on the kind of data included, different categories of biological databases can be distinguished. Primary databases contain primary sequence information (nucleotide or protein) and accompanying annotation information regarding function, bibliographies, cross-references to other databases, etc. Secondary biological databases, however, summarize the results from analyses of primary protein sequence databases. The aim of these analyses is to derive common features for sequence classes, which in turn can be used for the classifi - cation of unknown sequences (annotation). In addition, all other databases that save biological or medical information, e.g., literature databases, are frequently classifi ed as secondary databases.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Attwood TK, Bradley P, Flower DR, Gaulton A et al (2003) PRINTS and its automatic supplement, prePRINTS. Nucleic Acids Res 31:400–402
Bateman A, Birney E, Cerruti L, Durbin R et al (2002) The Pfam Protein Families Database. Nucleic Acids Res 30:276–280
Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE (2000) The protein data bank. Nucleic Acids Res 28:235–242
Etzold T, Ulyanov A, Argos P (1996) SRS: information retrieval system for molecular biology data banks. Methods Enzymol 266:114–128
Falquet L, Pagni M, Bucher P, Hulo N, Sigrist CJA, Hofmann K, Bairoch A (2002) The PROSITE database, its status in 2002. Nucleic Acids Res 30:235–238
Greene LH, Lewis TE, Addou S, Cuff A et al (2007) The CATH domain structure database: new protocols and classification levels give a more comprehensive resource for exploring evolution. Nucleic Acids Res 35:D291–D297
Kahraman A, Avramov A, Nashev L, Popov D et al (2005) PhenomicDB: a multi-species genotype/phenotype database for comparative phenomics. Bioinformatics 21:418–420
Mulder NJ, Apweiler R, Attwood TK, Bairoch A et al (2007) New developments in the InterPro database. Nucleic Acids Res 35:D224–D228
Murzin AG, Brenner SE, Hubbard T, Chothia C (1995) SCOP: a structural classification of proteins database for the investigation of sequences and structures. J Mol Biol 247:536–540
The UniProt Consortium (2007) The universal protein resource (UniProt). Nucleic Acid Res 35:D193–D197
Wheeler DL, Barrett T, Benson DA, Bryant SH et al (2007) Database resources of the National Center for Biotechnology Information. Nucleic Acids Res 35:D5–D12
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
(2008). Biological Databases. In: Selzer, P.M., Marhöfer, R.J., Rohwer, A. (eds) Applied Bioinformatics. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-72800-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-540-72800-9_3
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-72799-6
Online ISBN: 978-3-540-72800-9
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)