Skip to main content

Feature Reduction Using a Topic Model for the Prediction of Type III Secreted Effectors

  • Conference paper
Neural Information Processing (ICONIP 2011)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7062))

Included in the following conference series:

  • 3785 Accesses

Abstract

The type III secretion system (T3SS) is a specialized protein delivery system that plays a key role in pathogenic bacteria. Until now, the secretion mechanism has not been fully understood yet. Recently, a lot of emphasis has been put on identifying type III secreted effectors (T3SE) in order to uncover the signal and principle that guide the secretion process. However, the amino acid sequences of T3SEs have great sequence diversity through fast evolution and many T3SEs have no homolog in the public databases at all. Therefore, it is notoriously challenging to recognize T3SEs. In this paper, we use amino acid sequence features to predict T3SEs, and conduct feature reduction using a topic model. The experimental results on Pseudomonas syringae data set demonstrate that the proposed method can effectively reduce the features and improve the prediction accuracy at the same time.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. He, S.Y., Nomura, K., Whittam, T.S.: Type III protein secretion mechanism in mammalian and plant pathogens. BBA-Molecular Cell Research 1694(1-3), 181–206 (2004)

    Google Scholar 

  2. Guttman, D.S., Vinatzer, B.A., Sarkar, S.F., Ranall, M.V., Kettler, G., Greenberg, J.T.: A functional screen for the type III (Hrp) secretome of the plant pathogen Pseudomonas syringae. Science 295(5560), 1722–1726 (2002)

    Article  Google Scholar 

  3. Vencato, M., Tian, F., Alfano, J.R., Buell, C.R., Cartinhour, S., DeClerck, G.A., Guttman, D.S., Stavrinides, J., Joardar, V., Lindeberg, M., et al.: Bioinformatics-enabled identification of the HrpL regulon and type III secretion system effector proteins of Pseudomonas syringae pv. phaseolicola 1448A. Molecular Plant-Microbe Interactions 19(11), 1193–1206 (2006)

    Article  Google Scholar 

  4. Arnold, R., Brandmaier, S., Kleine, F., Tischler, P., Heinz, E., Behrens, S., Niinikoski, A., Mewes, H., Horn, M., Rattei, T.: Sequence-based prediction of type III secreted proteins. PLoS Pathogens 5(4), e1000376 (2009)

    Google Scholar 

  5. Löwer, M., Schneider, G.: Prediction of Type III Secretion Signals in Genomes of Gram-Negative Bacteria. PloS One 4(6), e5917 (2009)

    Article  Google Scholar 

  6. Yang, Y., Zhao, J., Morgan, R., Ma, W., Jiang, T.: Computational prediction of type III secreted proteins from gram-negative bacteria. BMC Bioinformatics 11(suppl. 1), S47 (2010)

    Google Scholar 

  7. Wang, Y., Zhang, Q., Sun, M., Guo, D.: High-accuracy prediction of bacterial type iii secreted effectors based on position-specific amino acid composition profiles. Bioinformatics 27(6), 777–784 (2011)

    Article  Google Scholar 

  8. Yang, Y., Lu, B.L.: Extracting features from protein sequences using Chinese segmentation techniques for subcellular localization. In: Proceedings of the 2005 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, pp. 288–295 (2005)

    Google Scholar 

  9. Yang, Y., Lu, B., Yang, W.: Classification of protein sequences based on word segmentation methods. In: Proceedings of the 6th Asia-Pacific Bioinformatics Conference, vol. 6, pp. 177–186 (2008)

    Google Scholar 

  10. Liu, B., Wang, X., Lin, L., Dong, Q., Wang, X.: A discriminative method for protein remote homology detection and fold recognition combining top-n-grams and latent semantic analysis. BMC Bioinformatics 9(1), 510 (2008)

    Article  Google Scholar 

  11. Yeh, J., Chen, C.: Protein remote homology detection based on latent topic vector model. In: Proceedings of 2010 International Conference on Networking and Information Technology, pp. 456–460 (2010)

    Google Scholar 

  12. Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. The Journal of Machine Learning Research 3, 993–1022 (2003)

    MATH  Google Scholar 

  13. Griffiths, T., Steyvers, M., Blei, D., Tenenbaum, J.: Integrating topics and syntax. In: Advances in Neural Information Processing Systems, vol. 17, pp. 537–544 (2005)

    Google Scholar 

  14. Steyvers, M., Griffiths, T.: Matlab Topic Modeling Toolbox 1.4 (2011), Software available at http://psiexp.ss.uci.edu/research/programs_data/toolbox.htm

  15. Chang, C.C., Lin, C.J.: LIBSVM: a library for support vector machines (2001), Software available at http://www.csie.ntu.edu.tw/~cjlin/libsvm

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2011 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Qi, S., Yang, Y., Song, A. (2011). Feature Reduction Using a Topic Model for the Prediction of Type III Secreted Effectors. In: Lu, BL., Zhang, L., Kwok, J. (eds) Neural Information Processing. ICONIP 2011. Lecture Notes in Computer Science, vol 7062. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24955-6_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-24955-6_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-24954-9

  • Online ISBN: 978-3-642-24955-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics