Automated Computational Inference of Multi-protein Assemblies from Biochemical Co-purification Data

Goebels, Florian; Hu, Lucas; Bader, Gary; Emili, Andrew

doi:10.1007/978-1-4939-7759-8_25

Florian Goebels³,
Lucas Hu³,
Gary Bader³ &
…
Andrew Emili³

Part of the book series: Methods in Molecular Biology ((MIMB,volume 1764))

4120 Accesses
1 Citations
1 Altmetric

Abstract

Biology has amassed a wealth of information about the function of a multitude of protein-coding genes across species. The challenge now is to understand how all these proteins work together to form a living organism, and a crucial step for gaining this knowledge is a complete description of the molecular “wiring circuits” that underlie cellular processes. In this chapter, we describe a general computational framework for predicting multi-protein assemblies from biochemical co-fractionation data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Protocol: USD 49.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 139.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Lucas Hu Ming FG, Cuihong Wan, Gary Bader, Andrew Emili (2018) EPIC: elution profile-based inference of protein complex membership. Under revision.
Google Scholar
Havugimana PC et al (2012) A census of human soluble protein complexes. Cell 150(5):1068–1081
Article CAS PubMed PubMed Central Google Scholar
Wan C et al (2015) Panorama of ancient metazoan macromolecular complexes. Nature 525(7569):339–344
Article CAS PubMed PubMed Central Google Scholar
Shannon P et al (2003) Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res 13(11):2498–2504
Article CAS PubMed PubMed Central Google Scholar
Ruepp A et al (2010) CORUM: the comprehensive resource of mammalian protein complexes—2009. Nucleic Acids Res 38(suppl 1):D497–D501
Article CAS PubMed Google Scholar
Kerrien S et al (2012) The IntAct molecular interaction database in 2012. Nucleic Acids Res 40(D1):D841–D846
Article CAS PubMed Google Scholar
Gene Ontology C (2015) Gene ontology consortium: going forward. Nucleic Acids Res 43(Database issue):D1049–D1056
Article CAS Google Scholar
Wehrens, R. and M.R. Wehrens, Package ‘wccsom’. 2015
Google Scholar
Sánchez-Taltavull D et al (2016) Bayesian correlation analysis for sequence count data. PLoS One 11(10):e0163595
Article CAS PubMed PubMed Central Google Scholar
Suykens JA, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9(3):293–300
Article Google Scholar
Breiman L (2001) Random forests. Mach Learn 45(1):5–32
Article Google Scholar
Szklarczyk D et al (2017) The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible. Nucleic Acids Res 45(D1):D362–D368
Article CAS PubMed Google Scholar
Warde-Farley D et al (2010) The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res 38(suppl_2):W214–W220
Article CAS PubMed PubMed Central Google Scholar
Davis J and Goadrich M 2006. The relationship between precision-recall and ROC curves. In Proceedings of the 23rd international conference on Machine learning. ACM
Google Scholar
Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1):29–36
Article CAS Google Scholar
Lee I et al (2011) Prioritizing candidate disease genes by network-based boosting of genome-wide association data. Genome Res 21(7):1109–1121
Article CAS PubMed PubMed Central Google Scholar
Lee I et al (2010) Predicting genetic modifier loci using functional gene networks. Genome Res 20(8):1143–1153
Article CAS PubMed PubMed Central Google Scholar
Kim WK, Krumpelman C, Marcotte EM (2008) Inferring mouse gene functions from genomic-scale data using a combined functional network/classification strategy. Genome Biol 9(1):S5
Article CAS PubMed PubMed Central Google Scholar

Download references

Author information

Authors and Affiliations

Donnelly Centre for Cellular and Biomolecular Research, University of Toronto, Toronto, ON, Canada
Florian Goebels, Lucas Hu, Gary Bader & Andrew Emili

Authors

Florian Goebels
View author publications
You can also search for this author in PubMed Google Scholar
Lucas Hu
View author publications
You can also search for this author in PubMed Google Scholar
Gary Bader
View author publications
You can also search for this author in PubMed Google Scholar
Andrew Emili
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Andrew Emili .

Editor information

Editors and Affiliations

MRC Human Genetics Unit, Institute of Genetics & Molecular Medicine, University of Edinburgh, Edinburgh, United Kingdom
Joseph A. Marsh

Rights and permissions

Reprints and permissions

Copyright information

About this protocol

Cite this protocol

Goebels, F., Hu, L., Bader, G., Emili, A. (2018). Automated Computational Inference of Multi-protein Assemblies from Biochemical Co-purification Data. In: Marsh, J. (eds) Protein Complex Assembly. Methods in Molecular Biology, vol 1764. Humana Press, New York, NY. https://doi.org/10.1007/978-1-4939-7759-8_25

Download citation

DOI: https://doi.org/10.1007/978-1-4939-7759-8_25
Published: 01 April 2018
Publisher Name: Humana Press, New York, NY
Print ISBN: 978-1-4939-7758-1
Online ISBN: 978-1-4939-7759-8
eBook Packages: Springer Protocols

Publish with us

Policies and ethics