Abstract
The various programs available for searching GCG databases using text-based or sequence-based searches and the programs that allow the user create their own flat-file databases using the Genetics Computer Group (GCG) and the X-windows SeqLab interface will be described. When appropriate the equivalent command line options will be listed using the same format as given in the GCG program manual, for example —WORDsize = 2. It will also be assumed that the reader has access to GCG help documents either through the SeqLab or SeqWeb interfaces or the GCG Program Manuals that come with the software package.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Suggested Reading
Searching by Keyword
Bailey, T. L. and Gribskov, M. (1998) Combining evidence using p-values: application to sequence homology searches, Bioinformatics 14, 48–54.
Lookup
Etzold, T. and Argos, P. (1993) SRS: an indexing and retrieval tool for flat file data libraries, Comp. Appl. Biosci. 9(1), 49–57.
Searching with Query Sequences
Dayhoff, M. O., Schwartz, R. M., and Orcutt, B. C. (1978) Altas of Protein Sequence Research and Structure, vol. 5, Suppl. 3 (Dayhoff, M. O., ed.) National Biomedical Research Foundation, Washington, DC, pp. 345–352.
Henikoff, S. and Henikoff, J. G. (1992) Amino acid substitution matrices from protein blocks, Proc. Natl. Acad. Sci. U S A. 89(22), 10,915–10,919.
BLAST/NetBLAST
Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990) Basic local alignment search tool, J. Mol. Biol. 215, 403–410.
Altschul, S. F., Madden, T. L., Schaffer, A. A., Zhang, J., Zhang, Z., Miller, W., and Lipman, D. J. (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs, Nucleic Acids Res. 25, 3389–3402.
Shpaer, E. G., Robinson, M., Yee, D., Candlin, J. D., Mines, R., and Hunkapiller, T. (1996) Sensitivity and selectivity in protein similarity searches: a comparison of Smith-Waterman in hardware to BLAST and FastA, Genomics 38, 179–191.
Wootton, J. C. and Federhen, S. (1996) Analysis of compositionally biased regions in sequence databases, Methods Enzymol. 266, 554–571.
FastA Suite/Ssearch/FrameSearch
Pearson, W. R. and Lipman, D. J. (1988) Improved tools for biological sequence comparison, Proc. Natl. Acad. Sci. USA, 85(8), 2444–2448.
Pearson, W. R. (1995) Comparison of methods for searching protein sequence databases, Protein Sci. 4, 1145–1160.
Smith, T. F. and Waterman, M. S. (1981) Comparison of Bio-Sequences, Adv. Appl. Math. 2, 482–489.
Rules for Effective Database Searching
Pearson, W. R. (1996) Effective protein sequence comparison, Methods Enzymol. 266, 227–258.
Pearson, W. R. (2000) Flexible sequence similarity searching with the FastA3 program package, Methods Mol. Biol. 132, 185–219.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer Science+Business Media New York
About this chapter
Cite this chapter
Heard, D.J. (2003). GCG Database Searching. In: Krawetz, S.A., Womble, D.D. (eds) Introduction to Bioinformatics. Humana Press, Totowa, NJ. https://doi.org/10.1007/978-1-59259-335-4_28
Download citation
DOI: https://doi.org/10.1007/978-1-59259-335-4_28
Publisher Name: Humana Press, Totowa, NJ
Print ISBN: 978-1-58829-241-4
Online ISBN: 978-1-59259-335-4
eBook Packages: Springer Book Archive