Skip to main content

Conditional Random Fields for Predicting and Analyzing Histone Occupancy, Acetylation and Methylation Areas in DNA Sequences

  • Conference paper
Applications of Evolutionary Computing (EvoWorkshops 2006)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 3907))

Included in the following conference series:

Abstract

Eukaryotic genomes are packaged by the wrapping of DNA around histone octamers to form nucleosomes. Nucleosome occupancies together with their acetylation and methylation are important modification factors on all nuclear processes involving DNA. There have been recently many studies of mapping these modifications in DNA sequences and of relationship between them and various genetic activities, such as transcription, DNA repair, and DNA remodeling. However, most of these studies are experimental approaches. In this paper, we introduce a computational approach to both predicting and analyzing nucleosome occupancy, acetylation, and methylation areas in DNA sequences. Our method employs conditional random fields (CRFs) to discriminate between DNA areas with high and low relative occupancy, acetylation, or methylation; and rank features of DNA sequences based on their weight in the CRFs model trained from the datasets of these DNA modifications. The results from our method on the yeast genome reveal genetic area preferences of nucleosome occupancy, acetylation, and methylation are consistent with previous studies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bernstein, B.E., Humphrey, E.L., Erlich, R.L., Schneider, R., Bouman, P., Liu, J.S., Kouzarides, T., Schreiber, S.L.: Methylation of histone H3 Lys 4 in coding regions of active genes. Proc. Natl. Acad. Sci. USA. 99(13), 8695–8700 (2002)

    Google Scholar 

  2. Bernstein, B.E., Liu, C.L., Humphrey, E.L., Perlstein, E.O., Schreiber, S.L.: Global nucleosome occupancy in yeast. Genome Biol. 5(9), R62 (2004)

    Article  Google Scholar 

  3. Chen, S.F., Rosenfeld, R.: A gaussian prior for smoothing maximum entropy models. Technical report CMU-CS-99 108 (1999)

    Google Scholar 

  4. Kouzarides, T.: Histone methylation in transcriptional control. Curr. Opin. Genet. Dev. 12(2), 198–209 (2002)

    Article  Google Scholar 

  5. Kurdistani, S.K., Tavazoie, S., Grunstein, M.: Mapping global histone acetylation patterns to gene expression. Cell 117(6), 721–733 (2004)

    Article  Google Scholar 

  6. Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. 18th International Conference on Machine Learing (2001)

    Google Scholar 

  7. Lee, C.K., Shibata, Y., Rao, B., Strahl, B.D., Lieb, J.D.: Evidence for nucleosome depletion at active regulatory regions genome-wide. Nat. Genet. 36(8), 900–905 (2004)

    Article  Google Scholar 

  8. Liu, D., Nocedal, J.: On the limited memory bfgs method for large-scale optimization. Mathematical Programming 45, 503–528 (1989)

    Article  MATH  MathSciNet  Google Scholar 

  9. Luger, K., Mader, A.W., Richmond, R.K., Sargent, D.F., Richmond, T.J.: Crystal structure of the nucleosome core particle at 2. 8 A resolution. Nature 389(6648), 251–260 (1997)

    Google Scholar 

  10. Malouf, R.: A comparison of algorithms for maximum entropy parameter estimation. In: Proc. Proceeding CoNLL (2002)

    Google Scholar 

  11. McCallum, A.: Maximum entropy markov models for information extraction and segmentation. In: Proc. 15th International Conference on Machine Learing (2000)

    Google Scholar 

  12. McCallum, A.: Efficiently inducing features of conditional random fields. In: Proc. 19th Conference on Uncertainy in Artificial Intelligence (2003)

    Google Scholar 

  13. Narlikar, G.J., Fan, H.Y., Kingston, R.E.: Cooperation between complexes that regulate chromatin structure and transcription. Cell 108(4), 475–487 (2002)

    Article  Google Scholar 

  14. Peterson, C.L., Laniel, M.A.: Histones and histone modifications. Curr. Biol. 14(14), R546–R551 (2004)

    Article  Google Scholar 

  15. Pham, T.H., Tran, D.H., Ho, T.B., Satou, K., Valiente, G.: Qualitatively predicting acetylation and methylation areas in dna sequences. In: Proc. 16th International Conference on Genome Informatics (2005)

    Google Scholar 

  16. Pokholok, D.K., Harbison, C.T., Levine, S., Cole, M., Hannett, N.M., Lee, T.I., Bell, G.W., Walker, K., Rolfe, P.A., Herbolsheimer, E., Zeitlinger, J., Lewitter, F., Gifford, D.K., Young, R.A.: Genome-wide map of nucleosome acetylation and methylation in yeast. Cell 122(4), 517–527 (2005)

    Article  Google Scholar 

  17. Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. In: Proc. Proceeding of IEEE, pp. 257–286 (1989)

    Google Scholar 

  18. Ren, B., Robert, F., et al.: Genome-wide location and function of DNA binding proteins. Science 290(5500), 2306–2309 (2000)

    Article  Google Scholar 

  19. Robyr, D., Suka, Y., Xenarios, I., Kurdistani, S.K., Wang, A., Suka, N., Grunstein, M.: Microarray deacetylation maps determine genome-wide functions for yeast histone deacetylases. Cell 109(4), 437–446 (2002)

    Article  Google Scholar 

  20. Sha, F., Pereira, F.: Shalow parsing with conditional random fields. In: Proc. 15th Proceeding of Human Language Technology (2003)

    Google Scholar 

  21. Wallach, H.: Efficient Training of Conditional Random Fields. Master thesis (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Tran, D.H., Pham, T.H., Satou, K., Ho, T.B. (2006). Conditional Random Fields for Predicting and Analyzing Histone Occupancy, Acetylation and Methylation Areas in DNA Sequences. In: Rothlauf, F., et al. Applications of Evolutionary Computing. EvoWorkshops 2006. Lecture Notes in Computer Science, vol 3907. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11732242_20

Download citation

  • DOI: https://doi.org/10.1007/11732242_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-33237-4

  • Online ISBN: 978-3-540-33238-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics