Sequence-based classification of type II polyketide synthase biosynthetic gene clusters for antiSMASH

  • Rasmus Villebro
  • Simon Shaw
  • Kai BlinEmail author
  • Tilmann WeberEmail author
Natural Products - Original Paper


The software antiSMASH examines microbial genome data to identify and analyze biosynthetic gene clusters for a wide range of natural products. So far, type II polyketide synthase (PKS) gene clusters could only be identified, but no detailed predictions for type II PKS gene clusters could be provided. In this study, an antiSMASH module for analyzing type II PKS gene clusters has been developed. The module detects genes/proteins in the type II PKS gene cluster involved with polyketide biosynthesis and is able to make predictions about the aromatic polyketide product. Predictions include the putative starter unit, the number of malonyl elongations during polyketide biosynthesis, the putative class and the molecular weight of the product. Furthermore, putative cyclization patterns are predicted. The accuracy of the predictions generated with the new PKSII antiSMASH module was evaluated using a leave-one-out cross validation. The prediction module is available in antiSMASH version 5 at


Type II polyketide synthases PKS Aromatic polyketides Secondary metabolite Natural product Genome mining 



This work was funded by Grants of the Novo Nordisk Foundation [NNF10CC1016517, NNF16OC0021746] to TW.

Supplementary material

10295_2018_2131_MOESM1_ESM.pdf (938 kb)
Supplementary material 1 (PDF 938 kb)


  1. 1.
    Blin K, Medema MH, Kottmann R et al (2017) The antiSMASH database, a comprehensive database of microbial secondary metabolite biosynthetic gene clusters. Nucleic Acids Res 45:D555–D559. CrossRefGoogle Scholar
  2. 2.
    Blin K, Medema MH, Kazempour D et al (2013) antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers. Nucleic Acids Res 41:W204–W212. CrossRefGoogle Scholar
  3. 3.
    Blin K, Pascal Andreu V, de los Santos EC et al (2018) The antiSMASH database version 2: a comprehensive resource on secondary metabolite biosynthetic gene clusters. Nucleic Acids Res. Google Scholar
  4. 4.
    Blin K, Wolf T, Chevrette MG et al (2017) antiSMASH 4.0—improvements in chemistry prediction and gene cluster boundary identification. Nucleic Acids Res 45:W36–W41. CrossRefGoogle Scholar
  5. 5.
    Camacho C, Coulouris G, Avagyan V et al (2009) BLAST+: architecture and applications. BMC Bioinform 10:421. CrossRefGoogle Scholar
  6. 6.
    Cane DE, Walsh CT (1999) The parallel and convergent universes of polyketide synthases and nonribosomal peptide synthetases. Chem Biol 6:319–325. CrossRefGoogle Scholar
  7. 7.
    Eddy SR (2011) Accelerated profile HMM searches. PLoS Comput Biol 7:e1002195. CrossRefGoogle Scholar
  8. 8.
    Edgar RC (2004) MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res 32:1792–1797. CrossRefGoogle Scholar
  9. 9.
    Feng Z, Kallifidas D, Brady SF (2011) Functional analysis of environmental DNA-derived type II polyketide synthases reveals structurally diverse secondary metabolites. Proc Natl Acad Sci 108:12629–12634. CrossRefGoogle Scholar
  10. 10.
    Fernandez-Moreno MA, Martinez E, Boto L et al (1992) Nucleotide sequence and deduced functions of a set of cotranscribed genes of Streptomyces coelicolor A3(2) including the polyketide synthase for the antibiotic actinorhodin. J Biol Chem 267:19278–19290Google Scholar
  11. 11.
    Hadjithomas M, Chen IMA, Chu K et al (2017) IMG-ABC: new features for bacterial secondary metabolism analysis and targeted biosynthetic gene cluster discovery in thousands of microbial genomes. Nucleic Acids Res 45:D560–D565. CrossRefGoogle Scholar
  12. 12.
    Hertweck C, Luzhetskyy A, Rebets Y, Bechthold A (2007) Type II polyketide synthases: gaining a deeper insight into enzymatic teamwork. Nat Prod Rep 24:162–190. CrossRefGoogle Scholar
  13. 13.
    Hofeditz T, Unsin C, Wiese J et al (2018) Lysoquinone-TH1, a new polyphenolic tridecaketide produced by expressing the lysolipin minimal PKS II in Streptomyces albus. Antibiotics 7:53. CrossRefGoogle Scholar
  14. 14.
    Ichikawa N, Sasagawa M, Yamamoto M et al (2013) DoBISCUIT: a database of secondary metabolite biosynthetic gene clusters. Nucleic Acids Res 41:408–414. CrossRefGoogle Scholar
  15. 15.
    Katz L, Baltz RH (2016) Natural product discovery: past, present, and future. J Ind Microbiol Biotechnol 43:155–176. CrossRefGoogle Scholar
  16. 16.
    Kawasaki T, Moriyama A, Nakagawa K, Imamura N (2016) Cloning and identification of saprolmycin biosynthetic gene cluster from Streptomyces sp. TK08046. Biosci Biotechnol Biochem 80:2144–2150. CrossRefGoogle Scholar
  17. 17.
    Kim J, Yi G-SS (2012) PKMiner: a database for exploring type II polyketide synthases. BMC Microbiol 12:169. CrossRefGoogle Scholar
  18. 18.
    Lopez P, Hornung A, Welzel K et al (2010) Isolation of the lysolipin gene cluster of Streptomyces tendae Tu 4042. Gene 461:5–14. CrossRefGoogle Scholar
  19. 19.
    Lukežič T, Lešnik U, Podgoršek A et al (2013) Identification of the chelocardin biosynthetic gene cluster from Amycolatopsis sulphurea: a platform for producing novel tetracycline antibiotics. Microbiol (United Kingdom) 159:2524–2532. Google Scholar
  20. 20.
    Medema MH, Blin K, Cimermancic P et al (2011) antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. Google Scholar
  21. 21.
    Medema MH, Kottmann R, Yilmaz P et al (2015) Minimum information about a biosynthetic gene cluster. Nat Chem Biol 11:625–631. CrossRefGoogle Scholar
  22. 22.
    Medema MH, Fischbach MA (2015) Computational approaches to natural product discovery. Nat Chem Biol 11:639–648. CrossRefGoogle Scholar
  23. 23.
    Newman DJ, Cragg GM (2016) Natural products as sources of new drugs from 1981 to 2014. J Nat Prod 79:629–661CrossRefGoogle Scholar
  24. 24.
    Otten SL, Stutzman-Engwall KJ, Hutchinson CR (1990) Cloning and expression of daunorubicin biosynthesis genes from Streptomyces peucetius and S. peucetius subsp. caesius. J Bacteriol 172:3427–3434CrossRefGoogle Scholar
  25. 25.
    Pickens LB, Tang Y (2009) Decoding and engineering tetracycline biosynthesis. Metab Eng 11:69–75CrossRefGoogle Scholar
  26. 26.
    Pickens LB, Tang Y (2010) Oxytetracycline biosynthesis. J Biol Chem 285:27509–27515. CrossRefGoogle Scholar
  27. 27.
    Price MN, Dehal PS, Arkin AP (2010) FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One 5:e9490. CrossRefGoogle Scholar
  28. 28.
    Sandmann A, Dickschat J, Jenke-Kodama H et al (2007) A type II polyketide synthase from the gram-negative bacterium Stigmatella aurantiaca is involved in aurachin alkaloid biosynthesis. Angew Chemie (Int Ed) 46:2712–2716. CrossRefGoogle Scholar
  29. 29.
    Skinnider MA, Dejong CA, Rees PN et al (2015) Genomes to natural products prediction informatics for secondary metabolomes (PRISM). Nucleic Acids Res 43:9645–9662. Google Scholar
  30. 30.
    Skinnider MA, Merwin NJ, Johnston CW, Magarvey NA (2017) PRISM 3: expanded prediction of natural product chemical structures from microbial genomes. Nucleic Acids Res 45:W49–W54. CrossRefGoogle Scholar
  31. 31.
    Tang Y, Tsai SC, Khosla C (2003) Polyketide chain length control by chain length factor. J Am Chem Soc 125:12708–12709CrossRefGoogle Scholar
  32. 32.
    Weber T, Blin K, Duddela S et al (2015) antiSMASH 3.0-a comprehensive resource for the genome mining of biosynthetic gene clusters. Nucleic Acids Res 43:W237–W243. CrossRefGoogle Scholar
  33. 33.
    Zhang M, Hou X-F, Qi L-H et al (2015) Biosynthesis of trioxacarcin revealing a different starter unit and complex tailoring steps for type II polyketide synthase. Chem Sci 6:3440–3447. CrossRefGoogle Scholar
  34. 34.
    Zhang Z, Pan H-X, Tang G-L (2017) New insights into bacterial type II polyketide biosynthesis. F1000Research 6:172. CrossRefGoogle Scholar
  35. 35.
    Zhou H, Li Y, Tang Y (2010) Cyclization of aromatic polyketides from bacteria and fungi. Nat Prod Rep 27:839–868. CrossRefGoogle Scholar
  36. 36.
    Zhu T, Cheng X, Liu Y et al (2013) Deciphering and engineering of the final step halogenase for improved chlortetracycline biosynthesis in industrial Streptomyces aureofaciens. Metab Eng 19:69–78. CrossRefGoogle Scholar
  37. 37.
    Ziemert N, Alanjary M, Weber T (2016) The evolution of genome mining in microbes—a review. Nat Prod Rep 33:988–1005. CrossRefGoogle Scholar

Copyright information

© Society for Industrial Microbiology and Biotechnology 2019

Authors and Affiliations

  1. 1.The Novo Nordisk Foundation Center for Biosustainability, Technical University of DenmarkKongens LyngbyDenmark

Personalised recommendations