Abstract
Eukaryotic genomes are packaged by the wrapping of DNA around histone octamers to form nucleosomes. Nucleosome occupancies together with their acetylation and methylation are important modification factors on all nuclear processes involving DNA. There have been recently many studies of mapping these modifications in DNA sequences and of relationship between them and various genetic activities, such as transcription, DNA repair, and DNA remodeling. However, most of these studies are experimental approaches. In this paper, we introduce a computational approach to both predicting and analyzing nucleosome occupancy, acetylation, and methylation areas in DNA sequences. Our method employs conditional random fields (CRFs) to discriminate between DNA areas with high and low relative occupancy, acetylation, or methylation; and rank features of DNA sequences based on their weight in the CRFs model trained from the datasets of these DNA modifications. The results from our method on the yeast genome reveal genetic area preferences of nucleosome occupancy, acetylation, and methylation are consistent with previous studies.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Bernstein, B.E., Humphrey, E.L., Erlich, R.L., Schneider, R., Bouman, P., Liu, J.S., Kouzarides, T., Schreiber, S.L.: Methylation of histone H3 Lys 4 in coding regions of active genes. Proc. Natl. Acad. Sci. USA. 99(13), 8695–8700 (2002)
Bernstein, B.E., Liu, C.L., Humphrey, E.L., Perlstein, E.O., Schreiber, S.L.: Global nucleosome occupancy in yeast. Genome Biol. 5(9), R62 (2004)
Chen, S.F., Rosenfeld, R.: A gaussian prior for smoothing maximum entropy models. Technical report CMU-CS-99Â 108 (1999)
Kouzarides, T.: Histone methylation in transcriptional control. Curr. Opin. Genet. Dev. 12(2), 198–209 (2002)
Kurdistani, S.K., Tavazoie, S., Grunstein, M.: Mapping global histone acetylation patterns to gene expression. Cell 117(6), 721–733 (2004)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. 18th International Conference on Machine Learing (2001)
Lee, C.K., Shibata, Y., Rao, B., Strahl, B.D., Lieb, J.D.: Evidence for nucleosome depletion at active regulatory regions genome-wide. Nat. Genet. 36(8), 900–905 (2004)
Liu, D., Nocedal, J.: On the limited memory bfgs method for large-scale optimization. Mathematical Programming 45, 503–528 (1989)
Luger, K., Mader, A.W., Richmond, R.K., Sargent, D.F., Richmond, T.J.: Crystal structure of the nucleosome core particle at 2. 8 A resolution. Nature 389(6648), 251–260 (1997)
Malouf, R.: A comparison of algorithms for maximum entropy parameter estimation. In: Proc. Proceeding CoNLL (2002)
McCallum, A.: Maximum entropy markov models for information extraction and segmentation. In: Proc. 15th International Conference on Machine Learing (2000)
McCallum, A.: Efficiently inducing features of conditional random fields. In: Proc. 19th Conference on Uncertainy in Artificial Intelligence (2003)
Narlikar, G.J., Fan, H.Y., Kingston, R.E.: Cooperation between complexes that regulate chromatin structure and transcription. Cell 108(4), 475–487 (2002)
Peterson, C.L., Laniel, M.A.: Histones and histone modifications. Curr. Biol. 14(14), R546–R551 (2004)
Pham, T.H., Tran, D.H., Ho, T.B., Satou, K., Valiente, G.: Qualitatively predicting acetylation and methylation areas in dna sequences. In: Proc. 16th International Conference on Genome Informatics (2005)
Pokholok, D.K., Harbison, C.T., Levine, S., Cole, M., Hannett, N.M., Lee, T.I., Bell, G.W., Walker, K., Rolfe, P.A., Herbolsheimer, E., Zeitlinger, J., Lewitter, F., Gifford, D.K., Young, R.A.: Genome-wide map of nucleosome acetylation and methylation in yeast. Cell 122(4), 517–527 (2005)
Rabiner, L.R.: A tutorial on hidden markov models and selected applications in speech recognition. In: Proc. Proceeding of IEEE, pp. 257–286 (1989)
Ren, B., Robert, F., et al.: Genome-wide location and function of DNA binding proteins. Science 290(5500), 2306–2309 (2000)
Robyr, D., Suka, Y., Xenarios, I., Kurdistani, S.K., Wang, A., Suka, N., Grunstein, M.: Microarray deacetylation maps determine genome-wide functions for yeast histone deacetylases. Cell 109(4), 437–446 (2002)
Sha, F., Pereira, F.: Shalow parsing with conditional random fields. In: Proc. 15th Proceeding of Human Language Technology (2003)
Wallach, H.: Efficient Training of Conditional Random Fields. Master thesis (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tran, D.H., Pham, T.H., Satou, K., Ho, T.B. (2006). Conditional Random Fields for Predicting and Analyzing Histone Occupancy, Acetylation and Methylation Areas in DNA Sequences. In: Rothlauf, F., et al. Applications of Evolutionary Computing. EvoWorkshops 2006. Lecture Notes in Computer Science, vol 3907. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11732242_20
Download citation
DOI: https://doi.org/10.1007/11732242_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-33237-4
Online ISBN: 978-3-540-33238-1
eBook Packages: Computer ScienceComputer Science (R0)