Converting Between Sequence Formats
A “sequence format” is a punctuation style, or defined layout of text, within a computer file that separates a sequence from everything else. It allows computer programs that “understand” the format to distinguish between the sequence and any reference documentation also in the file. Some format definitions extend to the documentation itself (i.e., most database formats), allowing some software to locate specific reference information (e.g., authors, journals, species classification, coding regions).
KeywordsOutput File Sequence Format Sequence File Index File Genetic Computer Group
- 1.Stoehr, P. J. and Cameron, G. N. (1991) The EMBL data library. Nucleic Acids Rex 19, 2227–2230.Google Scholar
- 6.Hunt, L. T. (1990) in Protein Identification Resource Newsletter, vol.9, May. National Biomedical Research Foundation, Washington, DC.Google Scholar
- 8.Staden, R. (1986) The current status and portability of our sequence handling software. Nucleic Acids Res. 14(1).Google Scholar
- 9.Gilbert, D. G. (1989) ReadSeq, C and Pascal routmes for convertmg among nucleic acid & protein sequence file formats, suitable for various computers. Published electronically on the Internet, available via anonymous ftp to ftp.bio.indiana.eduGoogle Scholar