Nonparametric Frequency Detection and Optimal Coding in Molecular Biology

  • David S. Stoffer
Part of the International Series in Operations Research & Management Science book series (ISOR, volume 46)


The concept of spectral envelope for analyzing periodicities in categorical-valued time series was introduced in the statistics literature as a computationally simple and general statistical methodology for the harmonic analysis and scaling of non-numeric sequences. One benefit of this technique is that it combines nonparametric statistical analysis with modern computer power to quickly search for diagnostic patterns within long sequences. An interesting area of application is the nucleosome positioning signals and optimal alphabets in long DNA sequences. The examples focus on period lengths in nucleosome signals and optimal alphabets in herpesviruses and we point out some inconsistencies in established gene segments.


Spectral Analysis Optimal Scaling Nucleosome Positioning Signals Herpesviruses DNA Sequences 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. Bernardi, G. and G. Bernardi. (1985). Codon usage and genome composition. Journal of Molecular Evolution, 22, 363–365.CrossRefGoogle Scholar
  2. Bina, M. (1994). Periodicity of dinucleotides in nucleosomes derived from siraian virus 40 chromatin. Journal of Molecular Biology, 235, 198–208.CrossRefGoogle Scholar
  3. Blaisdell, B.E. (1983). Choice of base at silent codon site 3 is not selectively neutral in eucaryotic structural genes: It maintains excess short runs of weak and strong hydrogen bonding bases. Journal of Molecular Evolution, 19, 226–236.CrossRefGoogle Scholar
  4. Buckingham, R.H. (1990). Codon context. Experientia, 46, 1126–1133.CrossRefGoogle Scholar
  5. Cornette, J.L., K.B. Cease, H. Margaht, J.L. Spouge, J.A. Berzofsky, and C. DeLisi. (1987) Hydrophobicity scales and computational techniques for detecting amphipathic structures in proteins. Journal of Molecular Biology, 195, 659–685.CrossRefGoogle Scholar
  6. Drew, H.R. and C. R. Calladine. (1987). Sequence-specific positioning of core histones on an 860 base-pair DNA: Experiment and theory. Journal of Molecular Biology, 195, 143–173.CrossRefGoogle Scholar
  7. Eisenberg, D., R.M. Weiss, and T.C. Terwillger. (1994). The hydrophobic moment detects periodicity in protein hydrophobicity. Proc. Natl. Acad. Sci., 81, 140–144.CrossRefGoogle Scholar
  8. Grunstein, M. (1992). Histones as regulators of genes. Scientific American, 267, 68–74.CrossRefGoogle Scholar
  9. Ioshikhes, I., A. Bolshoy, and E.N. Trifonov. (1992). Preferred positions of AA and TT dinucleotides in aligned nucleosomal DNA sequences. Journal of Biomolecular Structure and Dynamics, 9, 1111–1117.CrossRefGoogle Scholar
  10. McLachlan, A.D. and M. Stewart. (1976). The 14-fold periodicity in alphatropomyosin and the interaction with actin. Journal of Molecular Biology, 103, 271–298.CrossRefGoogle Scholar
  11. Mengeritsky, G. and E.N. Trifonov. (1983). Nucleotide sequence-directed mapping of the nucleosomes. Nucleic Acids Research, 11, 3833–3851.CrossRefGoogle Scholar
  12. Muyldermans, S. and A. A. Travers. (1994) DNA sequence organization in chromatosomes. Journal of Molecular Biology, 235, 855–870.CrossRefGoogle Scholar
  13. Pina, B., D. Barettino, M. Truss, and M. Beato. (1990). Structural features of a regulatory nucleosome. Journal of Molecular Biology, 216, 975–990.CrossRefGoogle Scholar
  14. Satchwell, S.C., H.R. Drew, and A.A. Travers. (1986). Sequence periodicities in chicken nucleosome core DNA. Journal of Molecular Biology, 191, 659–675.CrossRefGoogle Scholar
  15. Schachtel, G.A..P. Bucher, E.S. Mocarski, B.E. Blaisdell, and S. Karlin. (1991). Evidence for selective evolution in codon usage in conserved amino acid segments of human alphaherpesvirus proteins. Journal of Molecular Evolution, 33, 483–494.CrossRefGoogle Scholar
  16. Shrader, T.E. and D.M. Crothers. (1990). Effects of DNA sequence and histonehistone interactions on nucleosome placement. Journal of Molecular Biology, 216, 69–84.CrossRefGoogle Scholar
  17. Shumway, R.H. and D.S. Stoffer. (2000). Time Series Analysis and Its Applications. New York: Springer.CrossRefGoogle Scholar
  18. Stoffer, D.S., D.E. Tyler, and A.J. McDougall. (1993a). Spectral analysis for categorical time series: Scaling and the spectral envelope. Biometrika, 80, 611–622.MathSciNetCrossRefGoogle Scholar
  19. Stoffer, D.S., D.E. Tyler, A.J. McDougall, and G.A. Schachtel. (1993b). Spectral analysis of DNA sequences (with discussion). Bulletin of the International Statistical Institute, Bk 1, 345–361; Bk 4, 63–69.Google Scholar
  20. Stoffer, D.S. and D.E. Tyler. (1998). Matching sequences: Cross-spectral analysis of categorical time series. Biometrika, 85, 201–213.MathSciNetCrossRefGoogle Scholar
  21. Sueoka, N. (1988). Directional mutation pressure and neutral molecular evolution. Proc. Nati. Acad. Sci., 85, 2653–2657.CrossRefGoogle Scholar
  22. Tavaré, S. and B.W. Giddings. (1989). Some statistical aspects of the primary structure of nucleotide sequences. In Mathematical Methods for DNA Sequences, M.S. Waterman ed., pp. 117–131, Boca Raton, Florida: CRC Press.Google Scholar
  23. Travers, A.A. and A. Klug. (1987). The bending of DNA in nucleosomes and its wider implications. Philosophical Transactions of the Royal Society of London, B, 317, 537–561.CrossRefGoogle Scholar
  24. Trifonov, E.N. (1991). DNA in profile. Trends in Biochemical Sciences, 16, 467–470.CrossRefGoogle Scholar
  25. Trifonov, E.N. and J.L. Sussman. (1980). The pitch of chromatin DNA is reflected in its nucleotide sequence. Proc. Natl. Acad. Sci., 77, 3816–3820.CrossRefGoogle Scholar
  26. Uberbacher, E.C., J.M. Harp, and G.J. Bunick. (1988). DNA sequence patterns in precisely positioned riucleosomes. Journal of Biomolecular Structure and Dynamics, 6, 105–120.CrossRefGoogle Scholar
  27. Viari, A., H. Soldano, and E. Ollivier. A scale-independent signal processing method for sequence analysis. Computer Applications in the Biosciences, 6, 71–80.Google Scholar
  28. Zhurkin, V.B. (1983) Specific alignment of nucleosomes on DNA correlates with periodic distribution of purine-pyrimidine and pyrimidine-purine dimers. Febs Letters, 158, 293–297.CrossRefGoogle Scholar
  29. Zhurkin, V.B. (1985). Sequence-dependent bending of DNA and phasing of nucleosomes. Journal of Biomolecular Structure and Dynamics, 2, 785–804.CrossRefGoogle Scholar

Copyright information

© Springer Science + Business Media, Inc. 2002

Authors and Affiliations

  • David S. Stoffer
    • 1
  1. 1.Department of StatisticsUniversity of PittsburghPittsburgh

Personalised recommendations