Identification of DNA CpG Islands Using Inter-dinucleotide Distances

  • Vera AfreixoEmail author
  • Carlos A. C. Bastos
  • João M. O. S. Rodrigues
  • Raquel M. Silva
Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 499)


In this study we set to explore the potentialities of the inter-genomic symbols distance for finding CpG islands in DNA sequences. We explore the distance distributions of the inter CpG and SS distance in the independent nucleotide context (reference). We confront the empirical results from the complete human genome, CpG islands and non CpG islands, with the corresponding reference results.

We propose a model to discriminate CpG islands based on some statistical properties of the inter-dinucleotide distances distributions in DNA sequences. The results of this exploratory study suggest that inter-SS symbols distance has high ability to discriminate CpG islands.


State Diagram Distance Distribution Reference Distribution Symbol Distance Absorb Markov Chain 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Deaton, A.M., Bird, A.: Cpg islands and the regulation of transcription. Genes Dev. 25(10), 1010–1022 (2011)CrossRefGoogle Scholar
  2. 2.
    Durbin, R., Eddy, S.R., Krogh, A., Mitchison, G.: Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, Cambridge (1998)zbMATHCrossRefGoogle Scholar
  3. 3.
    Ehrlich, M.: DNA methylation in cancer: too much, but also too little. Oncogene 21(35), 5400–5413 (2002)CrossRefGoogle Scholar
  4. 4.
    Gardiner-Garden, M., Frommer, M.: Cpg islands in vertebrate genomes. J. Mol. Biol. 196, 261–282 (1987)CrossRefGoogle Scholar
  5. 5.
    Grinstead, C.M.: Introduction to Probability. American Mathematical Society, Washington, D.C. (1998)Google Scholar
  6. 6.
    Hackenberg, M., Previti, C., Luque-Escamilla, P.L., Carpena, P., Martinez-Aroza, J., Oliver, J.L.: CpGcluster: a distance-based algorithm for CpG-island detection. BMC Bioinformatics 7, 446 (2006)CrossRefGoogle Scholar
  7. 7.
    Hackenberg, M., Barturen, G., Carpena, P., Luque-Escamilla, P., Previti, C., Oliver, J.: Prediction of CpG-island function: CpG clustering vs. sliding-window methods. BMC Genomics 11(1), 327 (2010)CrossRefGoogle Scholar
  8. 8.
    Han, L., Zhao, Z.: CpG islands or CpG clusters: how to identify functional GC-rich regions in a genome? BMC Bioinformatics 10, 65 (2009)CrossRefGoogle Scholar
  9. 9.
    Illingworth, R., Kerr, A., DeSousa, D., Jäÿrgensen, H., Ellis, P., Stalker, J., Jackson, D., Clee, C., Plumb, R., Rogers, J., Humphray, S., Cox, T., Langford, C., Bird, A.: A novel CpG island set identifies tissue-specific methylation at developmental gene loci. PLoS Biol. 6(1), e22 (2008)CrossRefGoogle Scholar
  10. 10.
    Takai, D., Jones, P.: The CpG island searcher: a new WWW resource. Silico Biol. 3(3), 235–240 (2003)Google Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  • Vera Afreixo
    • 1
    • 2
    • 4
    Email author
  • Carlos A. C. Bastos
    • 2
    • 3
  • João M. O. S. Rodrigues
    • 2
    • 3
  • Raquel M. Silva
    • 1
    • 2
  1. 1.CIDMA - Center for Research and Development in Mathematics and ApplicationsUniversity of AveiroAveiroPortugal
  2. 2.IEETA - Institute of Electronics and Telematics Engineering of AveiroUniversity of AveiroAveiroPortugal
  3. 3.Department of Electronics Telecommunications and InformaticsUniversity of AveiroAveiroPortugal
  4. 4.Institute for Research in Biomedicine - iBiMEDUniversity of AveiroAveiroPortugal

Personalised recommendations