Abstract
In addition to archiving sequence and genome data, the EMBL Outstation—the European Bioinformatics Institute (EBI), provides an ever-expanding number of free network services to external users. For a list of server addresses, see Note 1. This chapter is designed to act as an introduction to these services and help prospective users to get started. It is now possible to access these services via anonymous file transfer protocol (FTP), World Wide Web (WWW), Gopher, and electronic mail (E-mail). The various methods of data retrieval and searching, using all four methods, are covered in this chapter.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Pascarella, S. and Argos, P. (1992) A data bank merging related protein structures and sequences. Prot. Eng. 5, 121–137.
Jurka, J. and Smith, T. (1988) A fundamental division in the Alu family of repeated sequences. Proc. Natl. Acad. Sci. USA 85, 4775–4778.
Specht, T., Wolters, J., and Erdmann, V. A. (1991) Compilation of 5S rRNA and 5S rRNA gene sequences. Nucleic Acids Res. 19(Suppl.), 2189–2191.
Rodriguez-Tomé, P. (1997) The Radiation Hybrid Database. Nucleic Acids Res., submitted for publication.
Henikoff, S. and Henikoff, J. G. (1991) Automated assembly of protein blocks for database searching. Nucleic Acids Res. 19, 6565–6572.
Larsen, F., Gundersen, G., Lopez, L., and Prydz, H. (1992) CpG island as gene markers in the human genome. Genomics 13, 1095–1107.
Wada, K., Wada, Y., Doi, H., Ishibashi, F., Gojobori, T., and Ikemura, T. (1991) Codon usage tabulated from the GenBank genetic sequence data. Nucleic Acids Res. 18, 1981–1986.
Boguski, M. S., Lowe, T. M. J., and Tolstoshev, C. M. (1993) dbEST—database for “expressed sequence tags”. Nature Genetics 4, 332,333.
Olson, M., Hood, L., Cantor, C., and Botstein, D. (1989) A common language for physical mapping of the human genome. Science 254, 1434,1435.
Kabsch, W. and Sander, C. (1983) Dictionary of protein secondary structure-pattern-recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637.
Wahl, R., Rice, P., Rice, C. M., and Kroeger, M. (1994) ECD—a totally integrated database of Escherichia coli K12. Nucleic Acids Res. 22, 3450–3455.
Emmert, D. B., Stoehr, P. J., Stoesser, G., and Cameron, G. N. (1994) The European Bioinformatics Institute (EBI) databases. Nucleic Acids Res. 22, 3445–3449.
Bairoch, A. (1994) The ENZYME data bank. Nucleic Acids Res. 22, 3626,3627.
Bucher, P. and Trifonov, E. N. (1986) Compilation and analysis of eukaryotic POL II promoter sequences. Nucleic Acids Res. 14, 10,009–10,026.
The FlyBase Consortium (1994) FlyBas—the Drosophila database. Nucleic Acids Res. 22, 3456–3458.
Holm, L., Ouzounis, C., Sander, C., Tuparev, G., and Vriend, G. (1992) A database of protein structure families with common folding motifs. Prot. Sci. 1, 1691–1698.
Tuddenham, E. G., Schwaab, R., Seehafer, J., Millar, D. S., Gitschier, F., Higuchi, M., Bidichandani, S., Connor, J. M., Hoyer, L. W., and Yoshioka, A. (1994) Haemophilia A: database of nucleotide substitutions, deletions, insertions and rearrangements of the factor VIII gene, second edition (corrected and republished article originally printed in Nucleic Acids Res. 22, 3511-3533 [19941]). Nucleic Acids Res. 22, 4851–4868.
Gianelli, F., Green, P. M., Sommer, S. S., Lillicrap, D. P., Ludwig, M., Schwaab, R., Reitsma, P. H., Goossens, M., Yoshioka, A., and Brownlee, G. G. (1994) Haemophilia B: database of point mutations and short additions and deletions, fifth edition, 1994. Nucleic Acids Res. 22, 3534–3546.
Bodmer, J. G., Marsh, S. G., Albert, E. D., Bodmer, W. F., DuPont, B., Erlich, H. A., Mach, B., Mayr, W. R., Parham, P., and Sasazuki, T. (1994) Nomenclature for factors of the HLA system, 1994. Tissue Antigens 44, 1–18.
Sander, C., and Schneider, R. (1994) The HSSP database of protein structure-sequence alignments. Nucleic Acids Res. 22, 3597–3599.
Lefranc, M.-P. (1995) An integrated database for immunogenetics. Genome Digest 2(l), 9.
Kabat, E. A., Wu, T. T., Perry, H. M., Goettesman, K. S., and Foeller, C. (1992) Sequences of Proteins of Immunological Interest, 5th ed. NIH Publications, Washington, DC.
Keen, G., Redgrave, G., Lawton, J., Cinkosky, M., Mishra, S., Fickett, J., and Burks, C. (1992) Access to molecular biology databases. Math. Comput. Modelling 16, 93–101.
Doelz, R., Mosse, M. O., Slonimski, P. P., Bairoch, A., and Linder, P. (1994) LISTA, LISTA-HOP and LISTA-HON: a comprehensive compilation of protein encoding sequences and its associated homology databases from the yeast Saccharomyces. Nucleic Acids Res. 22, 3459–3461.
Nelson, N. and McClelland, M. (1991) Site-specific methylation: effect on DNA modification methyltransferases and restriction endonucleases. Nucleic Acids Res. 19(Suppl.), 2045–2071.
Holm, L. and Sander, C. (1992) Evaluation of protein models by atomic solvation preference. J. Mol. Biol. 225, 93–105.
Pattabiraman, N., Namboodiri, K., Lowrey, A., and Gaber, B. P. (1990) NRL-three-dimensional: a sequence-structure database derived from the protein data bank (PDB) and searchable within the PIR environment. Protein Seq. Data Anal. 3, 387–405.
Perriere, G., Gouy, M., and Gojobori, T. (1994) NRSub: a non-redundant data base for the Bacillus subtilis genome. Nucleic Acids Res. 22, 5525–5529.
Isohikhes, I. and Trifonov, E. N. (1993) Nucleosomal DNA sequence database. Nucleic Acids Res. 21, 4857–4859.
Hollstein, M., Rice, K., Greenblatt, M. S., Soussi, T., Fuchs, R., Sorlie, T., Hovig, E., Smith-Sorensen, B., Montesano, R., and Harris, C. C. (1994) Database of p53 gene somatic mutations in human tumors and cell lines. Nucleic Acids Res. 22, 3551–3555.
Bernstein, F. C., Koetzle, T. F., Williams, G. J. B., Meyer, E. F., Jr., Brice, M. D., Rodgers, J. R., Kennard, O., Shimanouchi, T., and Tasumi, M. (1977) The Protein Data Bank: a computer-based archival file for macromolecular structures. J. Mol. Biol. 112, 535–542.
Hobohm, U., Scharf, M., Schneider, R., and Sander, C. (1992) Selection of a representative set of structures from the Brookhaven Protein Data Bank. Prot. Sci. 1, 409–417.
Barker, W. C., George, D. G., Mewes, H. W., Pfeiffer, F., and Tsugita, A. (1993) The PIR-Intertional databases. Nucleic Acids Res. 21, 3089–3092.
Hanks, S. K. and Quinn, A. M. (1991) Protein kinase catalytic domain sequence database: identification of conserved features of primary structure and classification of family members. Methods Enzymol. 200, 38–62.
Attwood, T. K., Beck, M. E., Bleasby, A. J., and Parry-Smith, D. J. (1994) PRINTS—A database of protein motif fingerprints. Nucleic Acids Res. 22, 3590–3596.
Sonnhammer, E. L. L. and Kahn, D. (1994) The modular arrangement of proteins as inferred from analysis of homology. Prot. Sci. 3, 482–492.
Bairoch, A. and Bucher, P. (1994) PROSITE: recent developments. Nucleic Acids Res. 22, 3583–3589.
Holm, L. and Sander, C. (1994) Parser for protein folding units. Proteins 19, 256–268.
Maidak, B. L., Larsen, N., McCaughey, M. J., Overbeek, R., Olsen, G. J., Fogel, K, Blandy, J., and Woese, C. R. (1994) The ribosomal database project. Nucleic Acids Res. 22, 3485–3487.
Roberts, R. J. and Macelis, D. (1994) REBASE: restriction enzymes and methylases. Nucleic Acids Res. 22, 3628–3639.
Raschke, E. (1993) Comprehensive restriction enzyme lists to update any DNA sequence computer program. Gen. Anal. Tech. Appl. 10, 49–60.
Jurka, J., Walichiewicz, J., and Milosavljevic, A. (1992) Prototypic sequences for human repetitive DNA. J. Mol. Evol. 35, 286–291.
Lehrach, H. (1990) Hybridization fingerprinting in genome mapping and sequencing. Genome Anal. 1, 39–81.
Neefs, J. M., Van de Peer, Y., De Rijk, P., Chapelle, S., and De Wachter, R. (1993) Compilation of small ribosomal subunit RNA structures. Nucleic Acids Res. 21, 3025–3049.
Pongor, S., Hatsagi, Z., Degtyarenko, K., Fabian, P., Skerl, V., Hegyo, H., Myrvai, J., and Bevilacqua, V. (1994) The SBASE protein domain library, release 3.0: a collection of annotated protein sequence segments. Nucleic Acids Res. 22, 3610–3615.
Bairoch, A. (1991) SEQANALREF: a sequence analysis bibliographic reference databank. Computer Appl. Biosci. 7, 268.
Shumyatsky, G. and Reddy, R. (1992) Compilation of small RNA sequences. Nucleic Acids Res. 20(Suppl.), 2159–2165.
Larsen, N. and Zwieb, C. (1993) The signal recognition particle database (SRPDB). Nucleic Acids Res. 21, 3019,3020.
Bairoch, A. and Boeckmann, B. (1994) The SWISS-PROT protein sequence data bank: current status. Nucleic Acads Res. 22, 3578–3580.
Ghosh, D. (1992) TFD: the transcription factors database. Nucleic Acids Res. 20(Suppl.), 2091–2093.
Wingender, E. (1988) Compilation of transcription regulating proteins. Nucleic Acids Res. 16, 1879–1902.
Brown, C. M., Stockwell, P. A., Dalphin, M. E., and Tate, W. P. (1994) The translational termination signal database (TransTerm) now also includes initiation contexts. Nucleic Acids Res. 22, 362–3624.
Steinberg, S., Misch, A., and Sprinzl, M. (1993) Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res. 21, 3011–3015.
Liebl, S. and Sonnhammer, E. (1994) MIPS, Germany and Sanger Centre, UK.
Eztold, T. and Argos, P. (1993) SRS an indexing and retrieval tool for flat file data libraries. Comput. Appl. Biosci. 9, 49–57.
Pearson, W. R. and Lipman, D. J. (1988) Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. USA 85, 2444–2448.
Sturrock, S. S. and Collins, J. F. (1993) MPsrch version 1.3. Biocomputing Research Unit, University of Edinburgh, UK.
Smith, T. F. and Waterman, M. S. (1981) Identification of common molecular subsequences. J. Mol. Biol. 147, 195–197.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Humana Press Inc.
About this protocol
Cite this protocol
Flores, T.P., Harper, R.A. (1997). The European Bioinformatics Institute. In: Swindell, S.R. (eds) Sequence Data Analysis Guidebook. Methods In Molecular Medicine™, vol 70. Springer, Totowa, NJ. https://doi.org/10.1385/0-89603-358-9:155
Download citation
DOI: https://doi.org/10.1385/0-89603-358-9:155
Publisher Name: Springer, Totowa, NJ
Print ISBN: 978-0-89603-358-0
Online ISBN: 978-1-59259-556-3
eBook Packages: Springer Protocols