PAC-Learning of Markov Models with Hidden State

Gavaldà, Ricard; Keller, Philipp W.; Pineau, Joelle; Precup, Doina

doi:10.1007/11871842_18

Ricard Gavaldà²¹,
Philipp W. Keller²²,
Joelle Pineau²² &
…
Doina Precup²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4212))

Included in the following conference series:

European Conference on Machine Learning

5617 Accesses
7 Citations

Abstract

The standard approach for learning Markov Models with Hidden State uses the Expectation-Maximization framework. While this approach had a significant impact on several practical applications (e.g. speech recognition, biological sequence alignment) it has two major limitations: it requires a known model topology, and learning is only locally optimal. We propose a new PAC framework for learning both the topology and the parameters in partially observable Markov models. Our algorithm learns a Probabilistic Deterministic Finite Automata (PDFA) which approximates a Hidden Markov Model (HMM) up to some desired degree of accuracy. We discuss theoretical conditions under which the algorithm produces an optimal solution (in the PAC-sense) and demonstrate promising performance on simple dynamical systems.

Download to read the full chapter text

Chapter PDF

Learning Probability Distributions Generated by Finite-State Machines

Hidden Markov Models

Learning Hidden Markov Models Using Probabilistic Matrix Factorization

References

Carrasco, R., Oncina, J.: Learning stochastic regular grammars by means of a state merging method. In: Carrasco, R.C., Oncina, J. (eds.) ICGI 1994. LNCS, vol. 862. Springer, Heidelberg (1994)
Google Scholar
Clark, A., Thollard, F.: PAC-learnability of Probabilistic Deterministic Finite State Automata. Journal of Machine Learning Research 5 (2004)
Google Scholar
Dupont, P., Denis, F., Esposito, Y.: Links between Probabilistic Automata and Hidden Markov Models. Pattern Recognition 38(9) (2005)
Google Scholar
Durbin, R., Eddy, S.R., Krogh, A., Mitchison, G.J.: Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids. Cambridge University Press, Cambridge (1998)
Book MATH Google Scholar
Holmes, M., Isbell, C.: Looping Suffix Tree-Based Inference of Partially Observable Hidden State. In: Proceedings of ICML (2006)
Google Scholar
Jaeger, H., Zhao, M., Kolling, A.: Efficient estimation of OOMs. In: Proceedings of NIPS (2005)
Google Scholar
Kearns, M., Mansour, Y., Ron, D., Rubinfeld, R., Schapire, R.E., Sellie, L.: On the learnability of discrete distributions. In: ACM Symposium on the Theory of Computing (1995)
Google Scholar
Lipton, R.J., Naughton, J.F.: Query size estimation by adaptive sampling. J. Computer and System Sciences 51, 18–25 (1995)
Article MATH MathSciNet Google Scholar
Ostendorf, M., Singer, H.: HMM topology design using maximum likelihood successive state splitting. Computer Speech and Language 11 (1997)
Google Scholar
Rabiner, L.R.: A tutorial on Hidden Markov Models and Selected Applications in Speech Recognition. Proceedings of the IEEE 77(2) (1989)
Google Scholar
Ron, D., Singer, Y., Tishby, N.: On the learnability and usage of acyclic robabilistic finite automata. In: Proceedings of COLT (1995)
Google Scholar
Rosencrantz, M., Gordon, G., Thrun, S.: Learning Low Dimensional Predictive Representations. In: Proceedings of ICML (2004)
Google Scholar
Singh, S., Littman, M.L., Jong, N.K., Pardoe, D., Stone, P.: Learning Predictive State Representations. In: Proceedings of ICML (2003)
Google Scholar
Stolcke, A., Omohundro, S.M.: Hidden Markov Model Induction by Bayesian Model Merging. In: Proceedings of NIPS (1993)
Google Scholar
Thollard, F., Dupont, P., de la Higuera, C.: Probabilistic DFA Inference using Kullback-Leibler Divergence and Minimality. In: Proceedings of ICML (2000)
Google Scholar
Valiant, L.: A theory of the learnable. Communications of the ACM 27(11) (1984)
Google Scholar

Download references

Author information

Authors and Affiliations

Universitat Politècnica de Catalunya, Barcelona, Spain
Ricard Gavaldà
School of Computer Science, McGill University, Montreal, QC, Canada
Philipp W. Keller, Joelle Pineau & Doina Precup

Authors

Ricard Gavaldà
View author publications
You can also search for this author in PubMed Google Scholar
Philipp W. Keller
View author publications
You can also search for this author in PubMed Google Scholar
Joelle Pineau
View author publications
You can also search for this author in PubMed Google Scholar
Doina Precup
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Knowledge Engineering Group, Technische Universität Darmstadt,
Johannes Fürnkranz
Max Planck Institute for Computer Science, Saarbrücken, Germany
Tobias Scheffer
Faculty of Computer Science, Otto-von-Guericke-University Magdeburg, Germany
Myra Spiliopoulou

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gavaldà, R., Keller, P.W., Pineau, J., Precup, D. (2006). PAC-Learning of Markov Models with Hidden State. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds) Machine Learning: ECML 2006. ECML 2006. Lecture Notes in Computer Science(), vol 4212. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11871842_18

Download citation

DOI: https://doi.org/10.1007/11871842_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45375-8
Online ISBN: 978-3-540-46056-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

PAC-Learning of Markov Models with Hidden State

Abstract

Chapter PDF

Similar content being viewed by others

Learning Probability Distributions Generated by Finite-State Machines

Hidden Markov Models

Learning Hidden Markov Models Using Probabilistic Matrix Factorization

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

PAC-Learning of Markov Models with Hidden State

Abstract

Chapter PDF

Similar content being viewed by others

Learning Probability Distributions Generated by Finite-State Machines

Hidden Markov Models

Learning Hidden Markov Models Using Probabilistic Matrix Factorization

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation