Abstract
Many microbial genome sequences have been determined, and more new genome projects are ongoing. Shotgun sequencing of randomly cloned short pieces of genomic DNA can provide a simple way of determining whole genome sequences. This process requires sequencing of many fragments, compilation of the separate sequences into one contiguous sequence, and careful editing of the assembled sequence. The genes present on the microbial genome are then predicted using clues derived from typical gene features, such as codon usage, ribosomal binding sequences, and bacterial initiation codons. Function of genes is predicted by homology searches performed against either public or well-established protein databases. This chapter discusses each of these stages in a genome-sequencing project.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ewing, B., Hillier, L., Wendl, M. C., and Green, P. (1998) Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res. 8, 175–185.
Ewing, B. and Green, P. (1998) Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 8, 186–194.
Maidak, B. L., Cole, J. R., Parker, C. T., Jr., Garrity, G. M., Larsen, N., Li, B., et al. (1999) A new version of the RDP (Ribosomal Database Project). Nucleic Acids Res. 27, 171–173.
Gordon, D., Abajian, C., and Green, P. (1998) Consed: a graphical tool for sequence finishing. Genome Res. 8, 195–202.
Lowe, T. M. and Eddy, S. R. (1997) tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25, 955–964.
Nakamura, Y., Gojobori, T., and Ikemura, T. (2000) Codon usage tabulated from international DNA sequence databases: status for the year 2000. Nucleic Acids Res. 28, 292.
Borodovsky, M., McIninch, J. D., Koonin, E. V., Rudd, K. E., Medigue, C., and Danchin, A. (1995) Detection of new genes in a bacterial genome using Markov models for three gene classes. Nucleic Acids Res. 23, 3554–3562.
Delcher, A. L., Harmon, D., Kasif, S., White, O., and Salzberg, S. L. (1999) Improved microbial gene identification with GLIMMER. Nucleic Acids Res. 27, 4636–4641.
Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990) Basic local alignment search tool. J. Mol. Biol. 215, 403–410.
Pearson, W. R. (1995) Comparison of methods for searching protein sequence databases. Protein Sci. 4, 1145–1160.
Kyrpides, N. C. and Ouzounis, C. A. (1999) Whole-genome sequence annotation: “Going wrong with confidence.” Mol. Microbiol. 32, 886–887.
Tatusov, R. L., Koonin, E. V., and Lipman, D. J. (1997) A genomic perspective on protein families. Science 278, 631–637.
Higgins, D. G., Thompson, J. D., and Gibson, T. J. (1996) Using CLUSTAL for multiple sequence alignments. Methods Enzymol. 266, 383–402.
Thompson, J. D., Gibson, T. J., Plewniak, F., Jeanmougin, F., and Higgins, D. G. (1997) The CLUSTAL_X Windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 25, 4876–4882.
Apweiler, R., Attwood, T. K., Bairoch, A., Bateman, A., Birney, E., Biswas, M., et al. (2001) The InterPro database, an integrated documentation resource for protein families, domains and functional sites. Nucleic Acids Res. 29, 37–40.
Williams, K. P. (1999) The tmRNA website. Nucleic Acids Res. 27, 165–166.
Falquet, L., Pagni, M., Bucher, P., Hulo, N., Sigrist, C. J., Hofmann, K., et al. (2002) The PROSITE database, its status in 2002. Nucleic Acids Res. 30, 235–238.
Kanehisa, M. and Goto, S. (2000) KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 28, 27–30.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Humana Press Inc.
About this protocol
Cite this protocol
Kuroda, M., Hiramatsu, K. (2004). Genome Sequencing and Annotation. In: Woodford, N., Johnson, A.P. (eds) Genomics, Proteomics, and Clinical Bacteriology. Methods in Molecular Biology™, vol 266. Humana Press. https://doi.org/10.1385/1-59259-763-7:029
Download citation
DOI: https://doi.org/10.1385/1-59259-763-7:029
Publisher Name: Humana Press
Print ISBN: 978-1-58829-218-6
Online ISBN: 978-1-59259-763-5
eBook Packages: Springer Protocols