The European Bioinformatics Institute

Network Services
  • Tomas P. Flores
  • Robert A. Harper
Part of the Methods In Molecular Medicine™ book series (MIMB, volume 70)


In addition to archiving sequence and genome data, the EMBL Outstation—the European Bioinformatics Institute (EBI), provides an ever-expanding number of free network services to external users. For a list of server addresses, see Note 1. This chapter is designed to act as an introduction to these services and help prospective users to get started. It is now possible to access these services via anonymous file transfer protocol (FTP), World Wide Web (WWW), Gopher, and electronic mail (E-mail). The various methods of data retrieval and searching, using all four methods, are covered in this chapter.


European Bioinformatics Institute File Transfer Protocol Gopher Server Mandatory Line File Transfer Protocol Server 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Pascarella, S. and Argos, P. (1992) A data bank merging related protein structures and sequences. Prot. Eng. 5, 121–137.CrossRefGoogle Scholar
  2. 2.
    Jurka, J. and Smith, T. (1988) A fundamental division in the Alu family of repeated sequences. Proc. Natl. Acad. Sci. USA 85, 4775–4778.PubMedCrossRefGoogle Scholar
  3. 3.
    Specht, T., Wolters, J., and Erdmann, V. A. (1991) Compilation of 5S rRNA and 5S rRNA gene sequences. Nucleic Acids Res. 19(Suppl.), 2189–2191.PubMedGoogle Scholar
  4. 4.
    Rodriguez-Tomé, P. (1997) The Radiation Hybrid Database. Nucleic Acids Res., submitted for publication.Google Scholar
  5. 5.
    Henikoff, S. and Henikoff, J. G. (1991) Automated assembly of protein blocks for database searching. Nucleic Acids Res. 19, 6565–6572.PubMedCrossRefGoogle Scholar
  6. 6.
    Larsen, F., Gundersen, G., Lopez, L., and Prydz, H. (1992) CpG island as gene markers in the human genome. Genomics 13, 1095–1107.PubMedCrossRefGoogle Scholar
  7. 7.
    Wada, K., Wada, Y., Doi, H., Ishibashi, F., Gojobori, T., and Ikemura, T. (1991) Codon usage tabulated from the GenBank genetic sequence data. Nucleic Acids Res. 18, 1981–1986.Google Scholar
  8. 8.
    Boguski, M. S., Lowe, T. M. J., and Tolstoshev, C. M. (1993) dbEST—database for “expressed sequence tags”. Nature Genetics 4, 332,333.PubMedCrossRefGoogle Scholar
  9. 9.
    Olson, M., Hood, L., Cantor, C., and Botstein, D. (1989) A common language for physical mapping of the human genome. Science 254, 1434,1435.CrossRefGoogle Scholar
  10. 10.
    Kabsch, W. and Sander, C. (1983) Dictionary of protein secondary structure-pattern-recognition of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637.PubMedCrossRefGoogle Scholar
  11. 11.
    Wahl, R., Rice, P., Rice, C. M., and Kroeger, M. (1994) ECD—a totally integrated database of Escherichia coli K12. Nucleic Acids Res. 22, 3450–3455.PubMedCrossRefGoogle Scholar
  12. 12.
    Emmert, D. B., Stoehr, P. J., Stoesser, G., and Cameron, G. N. (1994) The European Bioinformatics Institute (EBI) databases. Nucleic Acids Res. 22, 3445–3449.PubMedCrossRefGoogle Scholar
  13. 13.
    Bairoch, A. (1994) The ENZYME data bank. Nucleic Acids Res. 22, 3626,3627.PubMedCrossRefGoogle Scholar
  14. 14.
    Bucher, P. and Trifonov, E. N. (1986) Compilation and analysis of eukaryotic POL II promoter sequences. Nucleic Acids Res. 14, 10,009–10,026.PubMedCrossRefGoogle Scholar
  15. 15.
    The FlyBase Consortium (1994) FlyBas—the Drosophila database. Nucleic Acids Res. 22, 3456–3458.CrossRefGoogle Scholar
  16. 16.
    Holm, L., Ouzounis, C., Sander, C., Tuparev, G., and Vriend, G. (1992) A database of protein structure families with common folding motifs. Prot. Sci. 1, 1691–1698.CrossRefGoogle Scholar
  17. 17.
    Tuddenham, E. G., Schwaab, R., Seehafer, J., Millar, D. S., Gitschier, F., Higuchi, M., Bidichandani, S., Connor, J. M., Hoyer, L. W., and Yoshioka, A. (1994) Haemophilia A: database of nucleotide substitutions, deletions, insertions and rearrangements of the factor VIII gene, second edition (corrected and republished article originally printed in Nucleic Acids Res. 22, 3511-3533 [19941]). Nucleic Acids Res. 22, 4851–4868.PubMedCrossRefGoogle Scholar
  18. 18.
    Gianelli, F., Green, P. M., Sommer, S. S., Lillicrap, D. P., Ludwig, M., Schwaab, R., Reitsma, P. H., Goossens, M., Yoshioka, A., and Brownlee, G. G. (1994) Haemophilia B: database of point mutations and short additions and deletions, fifth edition, 1994. Nucleic Acids Res. 22, 3534–3546.CrossRefGoogle Scholar
  19. 19.
    Bodmer, J. G., Marsh, S. G., Albert, E. D., Bodmer, W. F., DuPont, B., Erlich, H. A., Mach, B., Mayr, W. R., Parham, P., and Sasazuki, T. (1994) Nomenclature for factors of the HLA system, 1994. Tissue Antigens 44, 1–18.PubMedCrossRefGoogle Scholar
  20. 20.
    Sander, C., and Schneider, R. (1994) The HSSP database of protein structure-sequence alignments. Nucleic Acids Res. 22, 3597–3599.PubMedGoogle Scholar
  21. 21.
    Lefranc, M.-P. (1995) An integrated database for immunogenetics. Genome Digest 2(l), 9.Google Scholar
  22. 22.
    Kabat, E. A., Wu, T. T., Perry, H. M., Goettesman, K. S., and Foeller, C. (1992) Sequences of Proteins of Immunological Interest, 5th ed. NIH Publications, Washington, DC.Google Scholar
  23. 23.
    Keen, G., Redgrave, G., Lawton, J., Cinkosky, M., Mishra, S., Fickett, J., and Burks, C. (1992) Access to molecular biology databases. Math. Comput. Modelling 16, 93–101.CrossRefGoogle Scholar
  24. 24.
    Doelz, R., Mosse, M. O., Slonimski, P. P., Bairoch, A., and Linder, P. (1994) LISTA, LISTA-HOP and LISTA-HON: a comprehensive compilation of protein encoding sequences and its associated homology databases from the yeast Saccharomyces. Nucleic Acids Res. 22, 3459–3461.CrossRefGoogle Scholar
  25. 25.
    Nelson, N. and McClelland, M. (1991) Site-specific methylation: effect on DNA modification methyltransferases and restriction endonucleases. Nucleic Acids Res. 19(Suppl.), 2045–2071.PubMedGoogle Scholar
  26. 26.
    Holm, L. and Sander, C. (1992) Evaluation of protein models by atomic solvation preference. J. Mol. Biol. 225, 93–105.PubMedCrossRefGoogle Scholar
  27. 27.
    Pattabiraman, N., Namboodiri, K., Lowrey, A., and Gaber, B. P. (1990) NRL-three-dimensional: a sequence-structure database derived from the protein data bank (PDB) and searchable within the PIR environment. Protein Seq. Data Anal. 3, 387–405.PubMedGoogle Scholar
  28. 28.
    Perriere, G., Gouy, M., and Gojobori, T. (1994) NRSub: a non-redundant data base for the Bacillus subtilis genome. Nucleic Acids Res. 22, 5525–5529.PubMedCrossRefGoogle Scholar
  29. 29.
    Isohikhes, I. and Trifonov, E. N. (1993) Nucleosomal DNA sequence database. Nucleic Acids Res. 21, 4857–4859.CrossRefGoogle Scholar
  30. 30.
    Hollstein, M., Rice, K., Greenblatt, M. S., Soussi, T., Fuchs, R., Sorlie, T., Hovig, E., Smith-Sorensen, B., Montesano, R., and Harris, C. C. (1994) Database of p53 gene somatic mutations in human tumors and cell lines. Nucleic Acids Res. 22, 3551–3555.PubMedGoogle Scholar
  31. 31.
    Bernstein, F. C., Koetzle, T. F., Williams, G. J. B., Meyer, E. F., Jr., Brice, M. D., Rodgers, J. R., Kennard, O., Shimanouchi, T., and Tasumi, M. (1977) The Protein Data Bank: a computer-based archival file for macromolecular structures. J. Mol. Biol. 112, 535–542.PubMedCrossRefGoogle Scholar
  32. 32.
    Hobohm, U., Scharf, M., Schneider, R., and Sander, C. (1992) Selection of a representative set of structures from the Brookhaven Protein Data Bank. Prot. Sci. 1, 409–417.CrossRefGoogle Scholar
  33. 33.
    Barker, W. C., George, D. G., Mewes, H. W., Pfeiffer, F., and Tsugita, A. (1993) The PIR-Intertional databases. Nucleic Acids Res. 21, 3089–3092.PubMedCrossRefGoogle Scholar
  34. 34.
    Hanks, S. K. and Quinn, A. M. (1991) Protein kinase catalytic domain sequence database: identification of conserved features of primary structure and classification of family members. Methods Enzymol. 200, 38–62.PubMedCrossRefGoogle Scholar
  35. 35.
    Attwood, T. K., Beck, M. E., Bleasby, A. J., and Parry-Smith, D. J. (1994) PRINTS—A database of protein motif fingerprints. Nucleic Acids Res. 22, 3590–3596.PubMedGoogle Scholar
  36. 36.
    Sonnhammer, E. L. L. and Kahn, D. (1994) The modular arrangement of proteins as inferred from analysis of homology. Prot. Sci. 3, 482–492.CrossRefGoogle Scholar
  37. 37.
    Bairoch, A. and Bucher, P. (1994) PROSITE: recent developments. Nucleic Acids Res. 22, 3583–3589.PubMedCrossRefGoogle Scholar
  38. 38.
    Holm, L. and Sander, C. (1994) Parser for protein folding units. Proteins 19, 256–268.PubMedCrossRefGoogle Scholar
  39. 39.
    Maidak, B. L., Larsen, N., McCaughey, M. J., Overbeek, R., Olsen, G. J., Fogel, K, Blandy, J., and Woese, C. R. (1994) The ribosomal database project. Nucleic Acids Res. 22, 3485–3487.PubMedCrossRefGoogle Scholar
  40. 40.
    Roberts, R. J. and Macelis, D. (1994) REBASE: restriction enzymes and methylases. Nucleic Acids Res. 22, 3628–3639.PubMedCrossRefGoogle Scholar
  41. 41.
    Raschke, E. (1993) Comprehensive restriction enzyme lists to update any DNA sequence computer program. Gen. Anal. Tech. Appl. 10, 49–60.Google Scholar
  42. 42.
    Jurka, J., Walichiewicz, J., and Milosavljevic, A. (1992) Prototypic sequences for human repetitive DNA. J. Mol. Evol. 35, 286–291.PubMedCrossRefGoogle Scholar
  43. 43.
    Lehrach, H. (1990) Hybridization fingerprinting in genome mapping and sequencing. Genome Anal. 1, 39–81.Google Scholar
  44. 44.
    Neefs, J. M., Van de Peer, Y., De Rijk, P., Chapelle, S., and De Wachter, R. (1993) Compilation of small ribosomal subunit RNA structures. Nucleic Acids Res. 21, 3025–3049.PubMedCrossRefGoogle Scholar
  45. 45.
    Pongor, S., Hatsagi, Z., Degtyarenko, K., Fabian, P., Skerl, V., Hegyo, H., Myrvai, J., and Bevilacqua, V. (1994) The SBASE protein domain library, release 3.0: a collection of annotated protein sequence segments. Nucleic Acids Res. 22, 3610–3615.PubMedGoogle Scholar
  46. 46.
    Bairoch, A. (1991) SEQANALREF: a sequence analysis bibliographic reference databank. Computer Appl. Biosci. 7, 268.Google Scholar
  47. 47.
    Shumyatsky, G. and Reddy, R. (1992) Compilation of small RNA sequences. Nucleic Acids Res. 20(Suppl.), 2159–2165.PubMedGoogle Scholar
  48. 48.
    Larsen, N. and Zwieb, C. (1993) The signal recognition particle database (SRPDB). Nucleic Acids Res. 21, 3019,3020.PubMedCrossRefGoogle Scholar
  49. 49.
    Bairoch, A. and Boeckmann, B. (1994) The SWISS-PROT protein sequence data bank: current status. Nucleic Acads Res. 22, 3578–3580.Google Scholar
  50. 50.
    Ghosh, D. (1992) TFD: the transcription factors database. Nucleic Acids Res. 20(Suppl.), 2091–2093.PubMedGoogle Scholar
  51. 51.
    Wingender, E. (1988) Compilation of transcription regulating proteins. Nucleic Acids Res. 16, 1879–1902.PubMedCrossRefGoogle Scholar
  52. 52.
    Brown, C. M., Stockwell, P. A., Dalphin, M. E., and Tate, W. P. (1994) The translational termination signal database (TransTerm) now also includes initiation contexts. Nucleic Acids Res. 22, 362–3624.Google Scholar
  53. 53.
    Steinberg, S., Misch, A., and Sprinzl, M. (1993) Compilation of tRNA sequences and sequences of tRNA genes. Nucleic Acids Res. 21, 3011–3015.PubMedCrossRefGoogle Scholar
  54. 54.
    Liebl, S. and Sonnhammer, E. (1994) MIPS, Germany and Sanger Centre, UK.Google Scholar
  55. 55.
    Eztold, T. and Argos, P. (1993) SRS an indexing and retrieval tool for flat file data libraries. Comput. Appl. Biosci. 9, 49–57.Google Scholar
  56. 56.
    Pearson, W. R. and Lipman, D. J. (1988) Improved tools for biological sequence comparison. Proc. Natl. Acad. Sci. USA 85, 2444–2448.PubMedCrossRefGoogle Scholar
  57. 57.
    Sturrock, S. S. and Collins, J. F. (1993) MPsrch version 1.3. Biocomputing Research Unit, University of Edinburgh, UK.Google Scholar
  58. 58.
    Smith, T. F. and Waterman, M. S. (1981) Identification of common molecular subsequences. J. Mol. Biol. 147, 195–197.PubMedCrossRefGoogle Scholar

Copyright information

© Humana Press Inc. 1997

Authors and Affiliations

  • Tomas P. Flores
    • 1
  • Robert A. Harper
    • 1
  1. 1.EMBL Outstation—The EBICambridgeUK

Personalised recommendations