Hyper-parameter Optimization of Sticky HDP-HMM Through an Enhanced Particle Swarm Optimization

  • Jiaxi LiEmail author
  • Junfu Yin
  • Yuk Ying Chung
  • Feng Sha
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 9949)


Faced with the problem of uncertainties in object trajectory and pattern recognition in terms of the non-parametric Bayesian approach, we have derived that 2 major methods of optimizing hierarchical Dirichlet process hidden Markov model (HDP-HMM) for the task. HDP-HMM suffers from poor performance not only on moderate dimensional data, but also sensitivity to its parameter settings. For the purpose of optimizing HDP-HMM on dimensional data, test for optimized results will be carried on the Tum Kitchen dataset [7], which was provided for the purpose of research the motion and activity recognitions. The optimization techniques capture the best hyper-parameters which then produce optimal solution to the task given in a certain search space.


Non-parametric Bayes HDP-HMM Pattern recognition Model selection Optimization Hyper-parameters 


  1. 1.
    Bergstra, J., Bengio, Y.: Random search for hyper-parameter optimization. J. Mach. Learn. Res. 13, 281–305 (2012)MathSciNetzbMATHGoogle Scholar
  2. 2.
    Fox, E.B.: Bayesian nonparametric learning of complex dynamical phenomena. PhD thesis, Massachusetts Institute of Technology, September 2009Google Scholar
  3. 3.
    Fox, E.B., Sudderth, E.B., Jordan, M.I., Willsky, A.S.: The sticky HDP-HMM: Bayesian nonparametric hidden Markov models with persistent states. Arxiv preprint (2007)Google Scholar
  4. 4.
    Fox, E.B., Sudderth, E.B., Jordan, M.I., Willsky, A.S.: An HDP-HMM for systems with state persistence. In: Proceedings of the 25th International Conference on Machine Learning, pp. 312–319. ACM (2008)Google Scholar
  5. 5.
    Fox, E.B., Sudderth, E.B., Jordan, M.I., Willsky, A.S.: A sticky HDP-HMM with application to speaker diarization. ArXiv e-prints, May 2011Google Scholar
  6. 6.
    Kivinen, J.J., Sudderth, E.B., Jordan, M.I.: Learning multiscale representations of natural scenes using Dirichlet processes. In: IEEE 11th International Conference on Computer Vision, 2007. ICCV 2007, pp. 1–8. IEEE (2007)Google Scholar
  7. 7.
    Tenorth, M., Bandouch, J., Beetz, M.: The TUM kitchen data set of everyday manipulation activities for motion tracking and action recognition. In: IEEE International Workshop on Tracking Humans for the Evaluation of their Motion in Image Sequences (THEMIS), in conjunction with ICCV 2009 (2009)Google Scholar
  8. 8.
    Xing, E.P., Sohn, K.-A., et al.: Hidden Markov Dirichlet process: modeling genetic inference in open ancestral space. Bayesian Anal. 2(3), 501–527 (2007)MathSciNetCrossRefzbMATHGoogle Scholar
  9. 9.
    Xue, L., Yin, J., Ji, Z., Jiang, L.: A particle swarm optimization for hidden Markov model training. In: 8th International Conference on Signal Processing, vol. 1. IEEE (2006)Google Scholar

Copyright information

© Springer International Publishing AG 2016

Authors and Affiliations

  • Jiaxi Li
    • 1
    Email author
  • Junfu Yin
    • 1
  • Yuk Ying Chung
    • 1
  • Feng Sha
    • 1
  1. 1.School of Information TechnologiesUniversity of SydneySydneyAustralia

Personalised recommendations