Abstract
Can we model the temporal evolution of topics in Web image collections? If so, can we exploit the understanding of dynamics to solve novel visual problems or improve recognition performance? These two challenging questions are the motivation for this work. We propose a nonparametric approach to modeling and analysis of topical evolution in image sets. A scalable and parallelizable sequential Monte Carlo based method is developed to construct the similarity network of a large-scale dataset that provides a base representation for wide ranges of dynamics analysis. In this paper, we provide several experimental results to support the usefulness of image dynamics with the datasets of 47 topics gathered from Flickr. First, we produce some interesting observations such as tracking of subtopic evolution and outbreak detection, which cannot be achieved with conventional image sets. Second, we also present the complementary benefits that the images can introduce over the associated text analysis. Finally, we show that the training using the temporal association significantly improves the recognition performance.
Chapter PDF
References
Arulampalam, M.S., Maskell, S., Gordon, N., Clapp, T.: A Tutorial on Particle Filters for On-line Non-linear/Non-Gaussian Bayesian Tracking. IEEE Trans. Signal Processing 50(2), 174–188 (2002)
Becker, S.: Implicit Learning in 3D Object Recognition: The Importance of Temporal Context. Neural Computation 11(2), 347–374 (1999)
Blei, D.M., Lafferty, J.D.: Dynamic Topic Models. In: ICML (2006)
Bosch, A., Zisserman, A., Munoz, X.: Image Classification using Random Forests and Ferns. In: ICCV (2007)
Boutell, M., Luo, J., Brown, C.: A Generalized Temporal Context Model for Classifying Image Collections. Multimedia Systems 11(1), 82–92 (2005)
Cao, L., Luo, J., Kautz, H., Huang, T.S.: Annotating Collections of Photos using Hierarchical Event and Scene Models. In: CVPR (2008)
Everingham, M., Van Gool, L., Williams, C.K.I., Winn, J., Zisserman, A.: The PASCAL Visual Object Classes Challenge, VOC 2010 Results (2010), http://www.pascal-network.org/challenges/VOC/voc2010/workshop/index.html
Hinton, G.E.: Training Products of Experts by Minimizing Contrastive Divergence. Neural Computation 14(8), 1771–1800 (2002)
Isard, M., Blake, A.: CONDENSATION – Conditional Density Propagation for Visual Tracking. Int. J. Computer Vision 29(1), 5–28 (1998)
Kalogerakis, E., Vesselova, O., Hays, J., Efros, A., Hertzmann, A.: Image Sequence Geolocation with Human Travel Priors. In: ICCV (2009)
Kim, G., Torralba, A.: Unsupervised Detection of Regions of Interest using Iterative Link Analysis. In: NIPS (2009)
Li, Y., Crandall, D.J., Huttenlocher, D.P.: Landmark Classification in Large-scale Image Collections. In: ICCV (2009)
Liu, C., Yuen, J., Torralba, A.: Nonparametric Scene Parsing: Label Transfer via Dense Scene Alignment. In: CVPR (2009)
Liu, C., Yuen, J., Torralba, A., Sivic, J., Freeman, W.T.: SIFT Flow: Dense Correspondence across Different Scenes. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 28–42. Springer, Heidelberg (2008)
MacKay, D.: Information Theory, Inference and Learning Algorithms. Cambridge University Press, Cambridge (2002)
Paletta, L., Prantl, M., Pinz, A.: Learning Temporal Context in Active Object Recognition Using Bayesian Analysis. In: ICPR (2000)
Quack, T., Leibe, B., Gool, L.V.: World-scale Mining of Objects and Events from Community Photo Collections. In: CIVR (2008)
Russell, B.C., Torralba, A.: Building a Database of 3D Scenes from User Annotations. In: CVPR (2009)
Sinha, P., Balas, B., Ostrovsky, Y., Russell, R.: Face Recognition by Humans: Nineteen Results All Computer Vision Researchers Should Know About. Proceedings of the IEEE 94(11), 1948–1962 (2006)
Torralba, A., Fergus, R., Freeman, W.T.: 80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition. IEEE PAMI 30(11), 1958–1970 (2008)
Wallis, G., Bulthöff, H.H.: Effects of Temporal Association on Recognition Memory. PNAS 98(8), 4800–4804 (2001)
Wang, X., McCallum, A.: Topics Over Time: a Non-Markov Continuous-Time Model of Topical Trends. In: KDD (2006)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Kim, G., Xing, E.P., Torralba, A. (2010). Modeling and Analysis of Dynamic Behaviors of Web Image Collections. In: Daniilidis, K., Maragos, P., Paragios, N. (eds) Computer Vision – ECCV 2010. ECCV 2010. Lecture Notes in Computer Science, vol 6315. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15555-0_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-15555-0_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15554-3
Online ISBN: 978-3-642-15555-0
eBook Packages: Computer ScienceComputer Science (R0)