Content Sensors Based on Codon Structure and DNA Methylation for Gene Finding in Vertebrate Genomes
- 754 Downloads
All vertebrate genomes are heavily methylated at CpG dinucleotide sites, and methylated CpG dinucleotides are prone to CpG→TpG mutations through spontaneous deamination. This leaves different footprints on coding and non-coding sequences. We capture these different fingerprints by five indices that can be used to discriminate between coding and non-coding (intron) sequences. We also show that a linear discriminant function derived from a training set of coding and intron sequences from human chromosome 22 can be successfully used in gene-finding of the zebrafish genome.
Key wordscontent sensor DNA methylation gene finding vertebrate genome codon structure
Unable to display preview. Download preview PDF.