Abstract
One category of videos usually contains the same thematic pattern, e.g., the spin action in skating videos. The discovery of the thematic pattern is essential to understand and summarize the video contents. This article addresses two critical issues in mining thematic video patterns: (1) automatic discovery of thematic patterns without any training or supervision information, and (2) accurate localization of the occurrences of all thematic patterns in videos. The major contributions are twofold. First, we formulate the thematic video pattern discovery as a cohesive subgraph selection problem by finding a subset of visual words that are spatio-temporally collocated. Then spatio-temporal branch-and-bound search can locate all instances accurately. Second, a novel method is proposed to efficiently find the cohesive subgraph of maximum overall mutual information scores. Our experimental results on challenging commercial and action videos show that our approach can discover different types of thematic patterns despite variations in scale, view-point, color, and lighting conditions, or partial occlusions. Our approach is also robust to the videos with cluttered and dynamic backgrounds.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Blei, D.M., Ng, A.Y., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003). doi:10.1162/jmlr.2003.3.4-5.993
Chen, C., Mangasarian, O.L.: A class of smoothing functions for nonlinear and mixed complementarity problems. Comput. Optim. Appl. 5, 97–138 (1996). doi:10.1007/BF00249052
Chum, O., Matas, J.: Large-scale discovery of spatially related images. IEEE Trans. Pattern Anal. Mach. Intell. 32(2), 371–377 (2010). doi:10.1109/TPAMI.2009.166
Du, L., Buntine, W.L., Jin, H.: Sequential latent dirichlet allocation: Discover underlying topic structures within a document. In: Proceedings of the 2010 IEEE International Conference on Data Mining, ICDM ’10, pp. 148–157. IEEE Computer Society (2010). doi:10.1109/ICDM.2010.51
Gao, J., Hu, Y., Liu, J., Yang, R.: Unsupervised learning of high-order structural semantics from images. In: IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27–October 4, 2009, pp. 2122–2129 (2009). doi:10.1109/ICCV.2009.5459465
Han, J., Cheng, H., Xin, D., Yan, X.: Frequent pattern mining: current status and future directions. Data Min. Knowl. Discov. 15, 55–86 (2007). doi:10.1007/s10618-006-0059-1. http://portal.acm.org/citation.cfm?id=1275092.1275097
Hong, P., Huang, T.S.: Spatial pattern discovery by learning a probabilistic parametric model from multiple attributed relational graphs. Discrete Appl. Math. 139(1–3), 113–135 (2004). doi:10.1016/j.dam.2002.11.007
Kim, G., Xing, E.P.: Jointly aligning and segmenting multiple web photo streams for the inference of collective photo storylines. In: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, CVPR ’13, pp. 620–627. IEEE Computer Society, Washington, DC (2013). doi:10.1109/CVPR.2013.86
Lampert, C.H., Blaschko, M.B., Hofmann, T.: Efficient subwindow search: a branch and bound framework for object localization. TPAMI 31(12), 2129–2142 (2009). doi:10.1109/TPAMI.2009.144
Laptev, I.: On space-time interest points. Int. J. Comput. Vision 64(2–3), 107–123 (2005). doi:10.1007/s11263-005-1838-7
Lee, Y.J., Grauman, K.: Shape discovery from unlabeled image collections. In: 2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2009), 20–25 June 2009, Miami, Florida, USA, pp. 2254–2261 (2009). doi:10.1109/CVPRW.2009.5206698
Li, Q., Wu, J., Tu, Z.: Harvesting mid-level visual concepts from large-scale internet images. In: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, CVPR ’13, pp. 851–858. IEEE Computer Society, Washington, DC (2013). doi:10.1109/CVPR.2013.115
Liu, D., Hua, G., Chen, T.: A hierarchical visual model for video object summarization. TPAMI (2010). http://doi.ieeecomputersociety.org/10.1109/TPAMI.2010.31
Liu, J., Liu, Y.: Grasp recurring patterns from a single view. In: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, CVPR ’13, pp. 2003–2010. IEEE Computer Society, Washington, DC (2013). doi:10.1109/CVPR.2013.261
Lowe, D.G.: Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vision 60(2), 91–110 (2004). doi:10.1023/B:VISI.0000029664.99615.94
Luo, Y., Zhao, G., Yuan, J.: Thematic saliency detection using spatial-temporal context. In: Proceedings of the 2013 IEEE International Conference on Computer Vision Workshops, ICCVW ’13, pp. 347–353. IEEE Computer Society, Washington, DC (2013). doi:10.1109/ICCVW.2013.53
Ng, K.M.: A continuation approach for solving nonlinear optimization problems with discrete variables. Ph.d. Dissertation, Stanford University (2002)
Pavan, M., Pelillo, M.: Dominant sets and pairwise clustering. TPAMI 29, 167–172 (2007). doi:10.1109/TPAMI.2007.10
Rodriguez, M.D., Ahmed, J., Shah, M.: Action MACH a spatio-temporal maximum average correlation height filter for action recognition. In: 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 24–26 June 2008, Anchorage, Alaska (2008). doi:10.1109/CVPR.2008.4587727
Rubinstein, M., Joulin, A., Kopf, J., Liu, C.: Unsupervised joint object discovery and segmentation in internet images. In: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, CVPR ’13, pp. 1939–1946. IEEE Computer Society, Washington, DC (2013). doi:10.1109/CVPR.2013.253
Russell, B.C., Freeman, W.T., Efros, A.A., Sivic, J., Zisserman, A.: Using multiple segmentations to discover objects and their extent in image collections. In: Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR ’06, vol. 2, pp. 1605–1614. IEEE Computer Society (2006). doi:|DOIurl10.1109/CVPR.2006.326
Shan, M.K., Wei, L.Y.: Algorithms for discovery of spatial co-orientation patterns from images. Expert Syst. Appl. 37, 5795–5802 (2010). doi:10.1016/j.eswa.2010.02.028
Tang, K., Sukthankar, R., Yagnik, J., Fei-Fei, L.: Discriminative segment annotation in weakly labeled video. In: Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, CVPR ’13, pp. 2483–2490. IEEE Computer Society, Washington, DC (2013). doi:10.1109/CVPR.2013.321
Wang, D., Li, T., Ding, C.: Weighted feature subset non-negative matrix factorization and its applications to document understanding. In: ICDM10, pp. 541–550. IEEE Computer Society (2010). http://dx.doi.org/10.1109/ICDM.2010.47
Wang, H., Zhao, G., Yuan, J.: Visual pattern discovery in image and video data: a brief survey. Wiley Interdiscip. Rev.: Data Min. Knowl. Discov. 4(1), 24–37 (2014). doi:10.1002/widm.1110
Wang, L., Hua, G., Sukthankar, R., Xue, J., Zheng, N.: Video object discovery and co-segmentation with extremely weak supervision. In: Proceeding of the European Conference on Computer Vision (2014)
Todorovic, S., Ahuja, N.: Unsupervised category modeling, recognition, and segmentation in images. IEEE Trans. Pattern Anal. Mach. Intell. 30(12), 2158–2174 (2008). doi:10.1109/TPAMI.2008.24
Xie, Y., Yu, P.S.: Max-clique: A top-down graph-based approach to frequent pattern mining. In: ICDM10, pp. 1139–1144. IEEE Computer Society (2010). http://dx.doi.org/10.1109/ICDM.2010.73
Xu, J., Yuan, J., Wu, Y.: Learning spatio-temporal dependency of local patches for complex motion segmentation. Comput. Vis. Image Underst. 115, 334–351 (2011). doi:10.1016/j.cviu.2010.11.010
Yuan, J., Wu, Y., Yang, M.: From frequent itemsets to semantically meaningful visual patterns. ACM SIGKDD (2007). http://doi.acm.org/10.1145/1281192.1281284
Yuan, J., Liu, Z., Wu, Y.: Discriminative video pattern search for efficient action detection. IEEE Trans. Pattern Anal. Mach. Intell. 33(9), 1728–1743 (2011). doi:10.1109/TPAMI.2011.38
Zhang, D., Javed, O., Shah, M.: Video object co-segmentation by regulated maximum weight cliques. In: Proceeding of the European Conference on Computer Vision (2014)
Zhao, G., Yuan, J.: Discovering thematic patterns in videos via cohesive sub-graph mining. In: Proceedings of the 2011 IEEE 11th International Conference on Data Mining, ICDM ’11, pp. 1260–1265. IEEE Computer Society, Washington, DC (2011). doi:10.1109/ICDM.2011.55
Zhao, G., Yuan, J.: Mining and cropping common objects from images. In: Proceedings of the International Conference on Multimedia, MM ’10, pp. 975–978. ACM, New York (2010). doi:10.1145/1873951.1874127
Acknowledgments
This project is supported in part by MoE Tier-1 grant “Exploring Visual Relevance to Construct a Holistic Picture of Online News”.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Zhao, G., Yuan, J. (2015). Automatic Visual Pattern Discovery via Cohesive Subgraph Mining. In: Hua, G., Hua, XS. (eds) Mobile Cloud Visual Media Computing. Springer, Cham. https://doi.org/10.1007/978-3-319-24702-1_13
Download citation
DOI: https://doi.org/10.1007/978-3-319-24702-1_13
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-24700-7
Online ISBN: 978-3-319-24702-1
eBook Packages: Computer ScienceComputer Science (R0)