Fitting in the Age of Single-Molecule Experiments: A Guide to Maximum-Likelihood Estimation and Its Advantages

Eslami-Mosallam, Behrouz; Katechis, Iason; Depken, Martin

doi:10.1007/978-1-4939-9726-8_5

Behrouz Eslami-Mosallam³⁰,
Iason Katechis³⁰ &
Martin Depken³⁰

Part of the book series: Biological and Medical Physics, Biomedical Engineering ((BIOMEDICAL))

794 Accesses

Abstract

Biological function often springs from the intricate synchronization of individual proteins, rather than from bulk interactions. High-throughput single-molecule techniques now allow us to move beyond bulk rates to record distributions of reaction times. Such distributions can greatly help mechanistic modeling efforts, as they often contain signatures of the underlying reaction path. With a tentative model at hand, correctly judging its predictive power is predicated on correctly estimating its parameters from the available data. For complex models, such parameter estimation can be far from trivial, and the choice of method can significantly influence the result. We here provide a self-contained introduction to maximum-likelihood estimation aimed at single-molecule experimenters. By considering relevant examples, we explain how to use maximum-likelihood estimation and we compare its performance to that of popular least-squares methods. Considering single-molecule data, we argue that maximum-likelihood estimation is generally the superior choice and conclude with a discussion of how to estimate the spread in parameter estimates through bootstrapping.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 139.00; Price excludes VAT (USA)

Hardcover Book: USD 179.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
Note that we do not need to know the actual constant value of \( \sigma_{b} \), as it will not affect the position of the minimum of \( R^{\text{uwLS}} \).
2.
A more intuitive way of writing this might be in the form \( p\left( {{\text{model}}|{\text{data}}} \right) = \frac{{p\left( {\text{model}} \right)}}{{p\left( {\text{data}} \right)}}p\left( {{\text{data}}|{\text{model}}} \right). \)
3.
There are subtleties here relating to variable changes [18], but these lie outside our present scope.
4.
It should be noted that as the logarithm takes a unit-less argument, while the PDF has units (inverse time in case of the unbinding experiments). Strictly, we therefore need to multiply the PDF with some constant that renders the argument of the logarithm unit less in the definition of \( L^{\text{ML}} \left( {\left\{ \tau \right\}_{M} } \right) \). As the value of this constant does not affect the position of the minimum, we drop it for notational convenience.

References

Aartsen, M. G., Abraham, K., Ackermann, M., Adams, J., Aguilar, J. A., Ahlers, M., et al. (2015). A combined maximum-likelihood analysis of the high-energy astrophysical neutrino flux measured with icecube. Astrophysical Journal, 809(1), 1–15.
Article Google Scholar
Avdis, E., & Wachter, J. A. (2017). Maximum likelihood estimation of the equity premium. Journal of Financial Economics, 125(3), 589–609.
Article Google Scholar
Bahl, L. R., Jelinek, F., & Mercer, R. L. (1983). A maximum likelihood approach to continuous speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(2), 179–190.
Article Google Scholar
Dulin, D., Berghuis, B. A., Depken, M., & Dekker, N. H. (2015). Untangling reaction pathways through modern approaches to high-throughput single-molecule force-spectroscopy experiments. Current Opinion in Structural Biology.
Google Scholar
Efron, B., & Tibshirani, R. (1994). An introduction to the bootstrap. Chapman & Hall.
Google Scholar
Fareh, M., van Lopik, J., Katechis, I., Bronkhorst, A. W., Haagsma, A. C., van Rij, R. P., & Joo, C. (2018). Viral suppressors of RNAi employ a rapid screening mode to discriminate viral RNA from cellular small RNA. Nucleic Acids Research (March), 1–11.
Google Scholar
Felsenstein, J. (1981). Evolutionary trees from DNA sequences: A maximum likelihood approach. Journal of Molecular Evolution, 17(6), 368–376.
Article ADS Google Scholar
Forney, G. D. (1972). Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference. IEEE Transactions on Information Theory, 18(3), 363–378.
Article MathSciNet Google Scholar
Hastie, T., Tibsharani, R., & Friedman, J. (2009). The elements of statistical learning. The mathematical intelligencer (2nd ed.). New York: Springer.
Book Google Scholar
Hauschild, T., & Jentschel, M. (2001). Comparison of maximum likelihood estimation and chi-square statistics applied to counting experiments. Nuclear Instruments and Methods in Physics Research A, 457(1–2), 384–401.
Article ADS Google Scholar
Humphrey, P. J., Liu, W., & Buote, D. A. (2009). χ2 and Poissonian data: BIASES even in the high-count regime and how to avoid them. The Astrophysical Journal, 693(1), 822–829.
Article ADS Google Scholar
Jaynes, E. T. (2003). Probability theory: The logic of science. Cambridge: Cambridge University Press.
Book Google Scholar
Johansen, S., & Juselius, K. (1990). Maximum likelihood estimation and inference on cointegration—With applications to the demand for money. Oxford Bulletin of Economics and Statistics, 52(2), 169–210.
Article Google Scholar
Joo, C., Balci, H., Ishitsuka, Y., Buranachai, C., & Ha, T. (2008). Advances in single-molecule fluorescence methods for molecular biology. Annual Review of Biochemistry, 77, 51–76.
Article Google Scholar
Joo, C., & Ha, T. (2012). Single-molecule FRET with total internal reflection microscopy. Cold Spring Harbor Protocols, 7(12), 1223–1237.
Google Scholar
Leggetter, C. J., & Woodland, P. C. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech & Language, 9(2), 171–185.
Article Google Scholar
Murshudov, G. N., Vagin, A. A., & Dodson, E. J. (1997). Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallographica. Section D, Biological Crystallography, 53(3), 240–255.
Article Google Scholar
Nelson, P. (2015). Physical models of living systems. New York: W. H. Freeman.
Google Scholar
Nishimura, G., & Tamura, M. (2005). Artefacts in the analysis of temporal response functions measured by photon counting. Physics in Medicine & Biology, 50(6), 1327–1342.
Article ADS Google Scholar
Nørrelykke, S. F., & Flyvbjerg, H. (2010). Power spectrum analysis with least-squares fitting: Amplitude bias and its elimination, with application to optical tweezers and atomic force microscope cantilevers. Review of Scientific Instruments, 81(7).
Google Scholar
Nousek, J. A., & Shue, D. R. (1989). Chi-squared and C statistic minimization for low count per bin data. Astrophysical Journal, 342, 1207–1211.
Article ADS Google Scholar
Press, W., Teukolsky, S., Vetterling, W., Flannery, B., Ziegel, E., Press, W., et al. (2007). Numerical recipes: The art of scientific computing (3rd ed.). Cambridge: Cambridge University Press.
MATH Google Scholar
Santra, K., Zhan, J., Song, X., Smith, E. A., Vaswani, N., & Petrich, J. W. (2016). What is the best method to fit time-resolved data? A comparison of the residual minimization and the maximum likelihood techniques as applied to experimental time-correlated, single-photon counting data. Journal of Physical Chemistry B, 120(9), 2484–2490.
Article Google Scholar
Scholten, T. L., & Blume-Kohout, R. (2018). Behavior of the maximum likelihood in quantum state tomography. New Journal of Physics, 20, 023050.
Article ADS Google Scholar
Stamatakis, A. (2006). RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics, 22(21), 2688–2690.
Article Google Scholar
Trifinopoulos, J., Nguyen, L. T., von Haeseler, A., & Minh, B. Q. (2016). W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Research, 44(W1), W232–W235.
Article Google Scholar
Turton, D. A., Reid, G. D., & Beddard, G. S. (2003). Accurate analysis of fluorescence decays from single molecules in photon counting experiments. Analytical Chemistry, 75(16), 4182–4187.
Article Google Scholar
Whelan, S., & Goldman, N. (2001). A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Molecular Biology and Evolution, 18(5), 691–699.
Article Google Scholar

Download references

Acknowledgements

We thank Tao Ju (Thijs) Cui, Misha Klein, and Olivera Rakic for careful reading of the manuscript and thoughtful feedback. B. Eslami-Mosallam acknowledges financial support through the research program Crowd management: the physics of genome processing in complex environments, which is financed by the Netherlands Organisation for Scientific Research. I. Katechis acknowledges financial support from the Netherlands Organisation for Scientific Research, as part of the Frontiers in Nanoscience program.

Author information

Authors and Affiliations

Department of BioNanoScience, Kavli Institute of NanoScience, Delft University of Technology, 2629 HZ, Delft, The Netherlands
Behrouz Eslami-Mosallam, Iason Katechis & Martin Depken

Authors

Behrouz Eslami-Mosallam
View author publications
You can also search for this author in PubMed Google Scholar
Iason Katechis
View author publications
You can also search for this author in PubMed Google Scholar
Martin Depken
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Martin Depken .

Editor information

Editors and Affiliations

Kavli Institute of NanoScience, Delft University of Technology, Delft, Zuid-Holland, The Netherlands
Chirlmin Joo
Department of Medicine, Imperial College London, London, UK
David Rueda

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Eslami-Mosallam, B., Katechis, I., Depken, M. (2019). Fitting in the Age of Single-Molecule Experiments: A Guide to Maximum-Likelihood Estimation and Its Advantages. In: Joo, C., Rueda, D. (eds) Biophysics of RNA-Protein Interactions. Biological and Medical Physics, Biomedical Engineering. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-9726-8_5

Download citation

DOI: https://doi.org/10.1007/978-1-4939-9726-8_5
Published: 20 September 2019
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-9724-4
Online ISBN: 978-1-4939-9726-8
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)

Publish with us

Policies and ethics