Advertisement

CRISPR Detection from Short Reads Using Partial Overlap Graphs

  • Ilan Ben-BassatEmail author
  • Benny Chor
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9029)

Abstract

Clustered regularly interspaced short palindromic repeats (CRISPR) are structured regions in bacterial and archaeal genomes, which are part of an adaptive immune system against phages. Most of the automated tools that detect CRISPR loci rely on assembled genomes. However, many assemblers do not successfully handle repetitive regions. The first tool to work directly on raw sequence data is Crass, which requires that reads are long enough to contain two copies of the same repeat. We developed a method to identify CRISPR repeats from a raw sequence data of short reads. The algorithm is based on an observation differentiating CRISPR repeats from other types of repeats, and it involves a series of partial constructions of the overlap graph. A preliminary implementation of the algorithm shows good results and detects CRISPR repeats in cases where other tools fail to do so.

Keywords

CRISPR Detection Overlap graph Partial overlap graph Sampling Filtering \(k\)-mer counting 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Sorek, R., Kunin, V., Hugenholtz, P.: CRISPR - a widespread system that provides acquired resistance against phages in bacteria and archaea. Nat. Rev. Microbiol. 6, 181–186 (2008)CrossRefGoogle Scholar
  2. 2.
    Ishino, Y., Shinagawa, H., Makino, K., Amemura, M., Nakata, A.: Nucleotide sequence of the iap gene, responsible for alkaline phosphatase isozyme conversion in Escherichia coli, and identification of the gene product. J. Bacteriol. 169, 5429–5433 (1987)Google Scholar
  3. 3.
    Mojica, F.J., Diez-Villasenor, C., Garcia-Martinez, J., Soria, E.: Intervening sequences of regularly spaced prokaryotic repeats derive from foreign genetic elements. J. Mol. Evol. 60, 174–182 (2005)CrossRefGoogle Scholar
  4. 4.
    Horvath, P., Barrangou, R.: CRISPR-Cas, the immune system of bacteria and archaea. Science 327, 167–170 (2010)CrossRefGoogle Scholar
  5. 5.
    Stern, A., Mick, E., Tirosh, I., Sagy, O., Sorek, R.: CRISPR targeting reveals a reservoir of common phages associated with the human gut microbiome. Genome Res. 22, 1985–1994 (2012)CrossRefGoogle Scholar
  6. 6.
    Hu, W., et al.: RNA-directed gene editing specifically eradicates latent and prevents new HIV-1 infection. Proc. Natl. Acad Sci. USA 111(31), 11461–11466 (2014)CrossRefGoogle Scholar
  7. 7.
    Edgar, R.C.: PILER-CR: fast and accurate identification of CRISPR repeats. BMC Bioinformatics 8, 18 (2007)CrossRefGoogle Scholar
  8. 8.
    Bland, C., Ramsey, T.L., Sabree, F., Lowe, M., Brown, K., Kyrpides, N.C., Hugenholtz, P.: CRISPR Recognition Tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinformatics 8, 209 (2007)CrossRefGoogle Scholar
  9. 9.
    Grissa, I., Vergnaud, G., Pourcel, C.: CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 35, W52–W57 (2007)CrossRefGoogle Scholar
  10. 10.
    Skennerton, C.T., Imelfort, M., Tyson, G.W.: Crass: identification and reconstruction of CRISPR from unassembled metagenomic data. Nucleic Acids Res. 41, e105 (2012)CrossRefGoogle Scholar
  11. 11.
    Myers, E.: Toward Simplifying and Accurately Formulating Fragment Assembly. Jornal of Computational Biology 2, 275–290 (1995)CrossRefGoogle Scholar
  12. 12.
    Roy, R.S., Bhattacharya, D., Schliep, A.: Turtle: Identifying frequent k-mers with cache-efficient algorithms. Bioinformatics (2014). doi: 10.1093/bioinformatics/btu132 Google Scholar
  13. 13.
    CRISPRs web server. http://crispr.u-psud.fr/
  14. 14.
    Zerbino, D.R., Birney, E.: Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18, 821–829 (2008)CrossRefGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2015

Authors and Affiliations

  1. 1.School of Computer ScienceTel Aviv UniversityTel AvivIsrael

Personalised recommendations