Abstract
Biological function often springs from the intricate synchronization of individual proteins, rather than from bulk interactions. High-throughput single-molecule techniques now allow us to move beyond bulk rates to record distributions of reaction times. Such distributions can greatly help mechanistic modeling efforts, as they often contain signatures of the underlying reaction path. With a tentative model at hand, correctly judging its predictive power is predicated on correctly estimating its parameters from the available data. For complex models, such parameter estimation can be far from trivial, and the choice of method can significantly influence the result. We here provide a self-contained introduction to maximum-likelihood estimation aimed at single-molecule experimenters. By considering relevant examples, we explain how to use maximum-likelihood estimation and we compare its performance to that of popular least-squares methods. Considering single-molecule data, we argue that maximum-likelihood estimation is generally the superior choice and conclude with a discussion of how to estimate the spread in parameter estimates through bootstrapping.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Note that we do not need to know the actual constant value of \( \sigma_{b} \), as it will not affect the position of the minimum of \( R^{\text{uwLS}} \).
- 2.
A more intuitive way of writing this might be in the form \( p\left( {{\text{model}}|{\text{data}}} \right) = \frac{{p\left( {\text{model}} \right)}}{{p\left( {\text{data}} \right)}}p\left( {{\text{data}}|{\text{model}}} \right). \)
- 3.
There are subtleties here relating to variable changes [18], but these lie outside our present scope.
- 4.
It should be noted that as the logarithm takes a unit-less argument, while the PDF has units (inverse time in case of the unbinding experiments). Strictly, we therefore need to multiply the PDF with some constant that renders the argument of the logarithm unit less in the definition of \( L^{\text{ML}} \left( {\left\{ \tau \right\}_{M} } \right) \). As the value of this constant does not affect the position of the minimum, we drop it for notational convenience.
References
Aartsen, M. G., Abraham, K., Ackermann, M., Adams, J., Aguilar, J. A., Ahlers, M., et al. (2015). A combined maximum-likelihood analysis of the high-energy astrophysical neutrino flux measured with icecube. Astrophysical Journal, 809(1), 1–15.
Avdis, E., & Wachter, J. A. (2017). Maximum likelihood estimation of the equity premium. Journal of Financial Economics, 125(3), 589–609.
Bahl, L. R., Jelinek, F., & Mercer, R. L. (1983). A maximum likelihood approach to continuous speech recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 5(2), 179–190.
Dulin, D., Berghuis, B. A., Depken, M., & Dekker, N. H. (2015). Untangling reaction pathways through modern approaches to high-throughput single-molecule force-spectroscopy experiments. Current Opinion in Structural Biology.
Efron, B., & Tibshirani, R. (1994). An introduction to the bootstrap. Chapman & Hall.
Fareh, M., van Lopik, J., Katechis, I., Bronkhorst, A. W., Haagsma, A. C., van Rij, R. P., & Joo, C. (2018). Viral suppressors of RNAi employ a rapid screening mode to discriminate viral RNA from cellular small RNA. Nucleic Acids Research (March), 1–11.
Felsenstein, J. (1981). Evolutionary trees from DNA sequences: A maximum likelihood approach. Journal of Molecular Evolution, 17(6), 368–376.
Forney, G. D. (1972). Maximum-likelihood sequence estimation of digital sequences in the presence of intersymbol interference. IEEE Transactions on Information Theory, 18(3), 363–378.
Hastie, T., Tibsharani, R., & Friedman, J. (2009). The elements of statistical learning. The mathematical intelligencer (2nd ed.). New York: Springer.
Hauschild, T., & Jentschel, M. (2001). Comparison of maximum likelihood estimation and chi-square statistics applied to counting experiments. Nuclear Instruments and Methods in Physics Research A, 457(1–2), 384–401.
Humphrey, P. J., Liu, W., & Buote, D. A. (2009). χ2 and Poissonian data: BIASES even in the high-count regime and how to avoid them. The Astrophysical Journal, 693(1), 822–829.
Jaynes, E. T. (2003). Probability theory: The logic of science. Cambridge: Cambridge University Press.
Johansen, S., & Juselius, K. (1990). Maximum likelihood estimation and inference on cointegration—With applications to the demand for money. Oxford Bulletin of Economics and Statistics, 52(2), 169–210.
Joo, C., Balci, H., Ishitsuka, Y., Buranachai, C., & Ha, T. (2008). Advances in single-molecule fluorescence methods for molecular biology. Annual Review of Biochemistry, 77, 51–76.
Joo, C., & Ha, T. (2012). Single-molecule FRET with total internal reflection microscopy. Cold Spring Harbor Protocols, 7(12), 1223–1237.
Leggetter, C. J., & Woodland, P. C. (1995). Maximum likelihood linear regression for speaker adaptation of continuous density hidden markov models. Computer Speech & Language, 9(2), 171–185.
Murshudov, G. N., Vagin, A. A., & Dodson, E. J. (1997). Refinement of macromolecular structures by the maximum-likelihood method. Acta Crystallographica. Section D, Biological Crystallography, 53(3), 240–255.
Nelson, P. (2015). Physical models of living systems. New York: W. H. Freeman.
Nishimura, G., & Tamura, M. (2005). Artefacts in the analysis of temporal response functions measured by photon counting. Physics in Medicine & Biology, 50(6), 1327–1342.
Nørrelykke, S. F., & Flyvbjerg, H. (2010). Power spectrum analysis with least-squares fitting: Amplitude bias and its elimination, with application to optical tweezers and atomic force microscope cantilevers. Review of Scientific Instruments, 81(7).
Nousek, J. A., & Shue, D. R. (1989). Chi-squared and C statistic minimization for low count per bin data. Astrophysical Journal, 342, 1207–1211.
Press, W., Teukolsky, S., Vetterling, W., Flannery, B., Ziegel, E., Press, W., et al. (2007). Numerical recipes: The art of scientific computing (3rd ed.). Cambridge: Cambridge University Press.
Santra, K., Zhan, J., Song, X., Smith, E. A., Vaswani, N., & Petrich, J. W. (2016). What is the best method to fit time-resolved data? A comparison of the residual minimization and the maximum likelihood techniques as applied to experimental time-correlated, single-photon counting data. Journal of Physical Chemistry B, 120(9), 2484–2490.
Scholten, T. L., & Blume-Kohout, R. (2018). Behavior of the maximum likelihood in quantum state tomography. New Journal of Physics, 20, 023050.
Stamatakis, A. (2006). RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics, 22(21), 2688–2690.
Trifinopoulos, J., Nguyen, L. T., von Haeseler, A., & Minh, B. Q. (2016). W-IQ-TREE: A fast online phylogenetic tool for maximum likelihood analysis. Nucleic Acids Research, 44(W1), W232–W235.
Turton, D. A., Reid, G. D., & Beddard, G. S. (2003). Accurate analysis of fluorescence decays from single molecules in photon counting experiments. Analytical Chemistry, 75(16), 4182–4187.
Whelan, S., & Goldman, N. (2001). A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Molecular Biology and Evolution, 18(5), 691–699.
Acknowledgements
We thank Tao Ju (Thijs) Cui, Misha Klein, and Olivera Rakic for careful reading of the manuscript and thoughtful feedback. B. Eslami-Mosallam acknowledges financial support through the research program Crowd management: the physics of genome processing in complex environments, which is financed by the Netherlands Organisation for Scientific Research. I. Katechis acknowledges financial support from the Netherlands Organisation for Scientific Research, as part of the Frontiers in Nanoscience program.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Science+Business Media, LLC, part of Springer Nature
About this chapter
Cite this chapter
Eslami-Mosallam, B., Katechis, I., Depken, M. (2019). Fitting in the Age of Single-Molecule Experiments: A Guide to Maximum-Likelihood Estimation and Its Advantages. In: Joo, C., Rueda, D. (eds) Biophysics of RNA-Protein Interactions. Biological and Medical Physics, Biomedical Engineering. Springer, New York, NY. https://doi.org/10.1007/978-1-4939-9726-8_5
Download citation
DOI: https://doi.org/10.1007/978-1-4939-9726-8_5
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4939-9724-4
Online ISBN: 978-1-4939-9726-8
eBook Packages: Physics and AstronomyPhysics and Astronomy (R0)