PFMFind: A System for Discovery of Peptide Homology and Function

Stojmirović, Aleksandar; Andreae, Peter; Boland, Mike; Jordan, Thomas William; Pestov, Vladimir G.

doi:10.1007/978-3-642-41062-8_32

Aleksandar Stojmirović¹⁸,
Peter Andreae¹⁹,
Mike Boland²⁰,
Thomas William Jordan²¹ &
…
Vladimir G. Pestov²²

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8199))

Included in the following conference series:

International Conference on Similarity Search and Applications

1650 Accesses
1 Citations

Abstract

Protein Fragment Motif Finder (PFMFind) is a system that enables efficient discovery of relationships between short fragments of protein sequences using similarity search. It supports queries based on amino acid similarity matrices and position specific score matrices (PSSMs) obtained through an iterative procedure. PSSM construction is customisable through plugins written in Python. PFMFind consists of a GUI client, an index for fast similarity search and a relational database for storing search results and sequence annotations. It is written mostly in Python. The components of PFMFind communicate through TCP/IP sockets and can be located on different physical machines. PFMFind is freely available for download (under a GPL licence) from http://pfmfind.stojmirovic.org .

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Altschul, S.F., Madden, T.L., Schaffer, A.A., Zhang, J., Zhang, Z., Miller, W., Lipman, D.J.: Gapped BLAST and PSI–BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389–3402 (1997)
Article Google Scholar
Bairoch, A., Apweiler, R., Wu, C.H., Barker, W.C., Boeckmann, B., Ferro, S., Gasteiger, E., Huang, H., Lopez, R., Magrane, M., Martin, M.J., Natale, D.A., O’Donovan, C., Redaschi, N., Yeh, L.S.L.: The Universal Protein Resource (UniProt). Nucleic Acids Res 33 Database Issue, 154–159 (2005)
Google Scholar
Dayhoff, M.O., Schwartz, R.M., Orcutt, B.C.: A model of evolutionary change in proteins. In: Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, vol. 5, ch.22, pp. 345–352. National Biomedical Research Foundation (1978)
Google Scholar
Gribskov, M., McLachlan, A.D., Eisenberg, D.: Profile analysis: detection of distantly related proteins. Proc Natl. Acad. Sci. USA 84, 4355–4358 (1987)
Article Google Scholar
Henikoff, S., Henikoff, J.G.: Position-based sequence weights. J. Mol. Biol. 243(4), 574–578 (1994)
Article Google Scholar
Henikoff, S., Henikoff, J.G.: Amino acid substitution matrices from protein blocks. Proc. Natl. Acad. Sci. USA 89, 10915–10919 (1992)
Article Google Scholar
Hunter, S., Jones, P., Mitchell, A., Apweiler, R., Attwood, T.K., Bateman, A., Bernard, T., Binns, D., Bork, P., Burge, S., de Castro, E., Coggill, P., Corbett, M., Das, U., Daugherty, L., Duquenne, L., Finn, R.D., Fraser, M., Gough, J., Haft, D., Hulo, N., Kahn, D., Kelly, E., Letunic, I., Lonsdale, D., Lopez, R., Madera, M., Maslen, J., McAnulla, C., McDowall, J., McMenamin, C., Mi, H., Mutowo-Muellenet, P., Mulder, N., Natale, D., Orengo, C., Pesseat, S., Punta, M., Quinn, A.F., Rivoire, C., Sangrador-Vegas, A., Selengut, J.D., Sigrist, C.J.A., Scheremetjew, M., Tate, J., Thimmajanarthanan, M., Thomas, P.D., Wu, C.H., Yeats, C., Yong, S.Y.: Interpro in 2011: New developments in the family and domain prediction database. Nucleic Acids Res. 40(Database issue), D306–D312 (2012)
Google Scholar
Pestov, V., Stojmirović, A.: Indexing schemes for similarity search: an illustrated paradigm. Fundam. Inform. 70(4), 367–385 (2006)
MATH Google Scholar
Sjölander, K., Karplus, K., Brown, M., Hughey, R., Krogh, A., Mian, I., Haussler, D.: Dirichlet mixtures: A method for improving detection of weak but significant protein sequence homology. Comput. Appl. Biosci. 12(4), 327–345 (1996)
Google Scholar
Stojmirović, A., Pestov, V.: Indexing schemes for similarity search in datasets of short protein fragments. Inf. Syst. 32(8), 1145–1165 (2007)
Article Google Scholar
Watt, T.J., Doyle, D.F.: ESPSearch: a program for finding exact sequences and patterns in DNA, RNA, or protein. Biotechniques 38(1), 109–115 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, 20894, United States
Aleksandar Stojmirović
School of Engineering and Computer Science, Victoria University of Wellington, PO Box 600, Wellington, 6140, New Zealand
Peter Andreae
Riddet Institute, Massey University, PB 11 222, Palmerston North, 4442, New Zealand
Mike Boland
School of Biological Sciences, Victoria University of Wellington, PO Box 600, Wellington, 6140, New Zealand
Thomas William Jordan
Department of Mathematics and Statistics, University of Ottawa, 585 King Edward Ave., Ottawa, ON K1N 6N5, Canada
Vladimir G. Pestov

Authors

Aleksandar Stojmirović
View author publications
You can also search for this author in PubMed Google Scholar
Peter Andreae
View author publications
You can also search for this author in PubMed Google Scholar
Mike Boland
View author publications
You can also search for this author in PubMed Google Scholar
Thomas William Jordan
View author publications
You can also search for this author in PubMed Google Scholar
Vladimir G. Pestov
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Database Laboratory, Universidade da Coruña, Spain
Nieves Brisaboa & Oscar Pedreira &
Faculty of Informatics, Masaryk University, Brno, Czech Republic
Pavel Zezula

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stojmirović, A., Andreae, P., Boland, M., Jordan, T.W., Pestov, V.G. (2013). PFMFind: A System for Discovery of Peptide Homology and Function. In: Brisaboa, N., Pedreira, O., Zezula, P. (eds) Similarity Search and Applications. SISAP 2013. Lecture Notes in Computer Science, vol 8199. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-41062-8_32

Download citation

DOI: https://doi.org/10.1007/978-3-642-41062-8_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-41061-1
Online ISBN: 978-3-642-41062-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics