Extremist YouTube Videos
Abstract
With the emergence of Web 2.0, sharing personal content, communicating ideas, and interacting with other online users in Web 2.0 communities have become daily routines for online users. User-generated data from Web 2.0 sites provide rich personal information, such as personal preferences and interests, and can be utilized to obtain insight about cyber communities and their social networks. Many studies have focused on leveraging user-generated information to analyze blogs and forums, but few studies have applied this approach to video-sharing web sites. In this chapter, we proposed a text-based framework for video content classification of online video-sharing web sites. Different types of user-generated data (e.g., titles, descriptions, and comments) were used as proxies for online videos, and three types of text features (lexical, syntactic, and content-specific features) were extracted. Three feature-based classification techniques (C4.5, Naïve Bayes, and SVM) were used to classify videos. To evaluate the proposed framework, user-generated data from candidate videos, which were identified by searching user-given keywords on YouTube, were first collected. Then, a subset of the collected data was randomly selected and manually tagged by users as our experiment data. The experimental results showed that the proposed approach was able to classify online videos based on users’ interests with accuracy rates up to 87.2%, and all three types of text features contributed to discriminating videos. SVM outperformed C4.5 and Naïve Bayes in our experiments. In addition, our case study further demonstrated that accurate video classification results are very useful for identifying implicit cyber communities on video-sharing web sites.
Keywords
Support Vector Machine Text Feature Gaussian Mixture Model Semantic Concept Online VideoNotes
Acknowledgments
This work was supported by the NSF Computer and Network Systems (CNS) Program, “(CRI: CRD) Developing a Dark Web Collection and Infrastructure for Computational and Social Sciences” (CNS-0709338), September 2007–August 2010.
References
- Abbasi, A., and Chen, H. (2005). Applying authorship analysis to extremist-group web forum messages. IEEE Intelligent Systems, 20(5), 67–75.CrossRefGoogle Scholar
- Abbasi, A., and Chen, H. (2008). Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace. ACM Transactions on Information Systems, 26(2), 1–29.CrossRefGoogle Scholar
- Abbasi, A., Chen, H.-M., and Nunamaker, J. (2008a). Stylometric identification in electronic markets: Scalability and robustness. Journal of Management Information Systems, 25(1), 49–78.CrossRefGoogle Scholar
- Abbasi, A., Chen, H., and Salem, A. (2008b). Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. ACM Transactions on Information Systems, 26(3), 1–34.CrossRefGoogle Scholar
- Amir, A., Basu, S., Iyengar, G., Lin, C.-Y., Naphade, M., Smith, J.R., et al. (2004). A multi-modal system for the retrieval of semantic video events. Computer Vision and Image Understanding, 96(2), 216–236.CrossRefGoogle Scholar
- Argamon, S., Šarić, M., and Stein, S. S. (2003). Style mining of electronic messages for multiple authorship discrimination: First results. Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, 475–480.Google Scholar
- Baayen, H., van Halteren, H., and Tweedie, F. (1996). Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution. Literary and Linguistic Computing, 11(3), 121–132.CrossRefGoogle Scholar
- Borgne, H.L., Guérin-Dugué, A., and O’Connor, N.E. (2007). Learning midlevel image features for natural scene and texture classification. IEEE Transactions on Circuits and Systems forVideo Technology, 17(3), 286–297.CrossRefGoogle Scholar
- Burris, V., Smith, E., and Strahm, A. (2000). White supremacist networks on the Internet. Sociological Focus, 33(2), 215–235.CrossRefGoogle Scholar
- Caillol, H., Pieczynski, W., and Hillion, A. (1997). Estimation of fuzzy Gaussian mixture and unsupervised statistical image segmentation. IEEE Transactions on Image Processing, 6(3), 425–440.CrossRefGoogle Scholar
- Chau, M., and Xu, J. (2007). Mining communities and their relationships in blogs: A study of online hate groups. International Journal of Human–Computer Studies 65, 57–70.CrossRefGoogle Scholar
- Chellappa, R., Wilson, C.L., and Sirohey, S. (1995). Human and machine recognition of faces: A survey. Proceedings of the IEEE, 83(5), 705–741.CrossRefGoogle Scholar
- Chen, H., Shankaranarayanan, G., She, L., and Iyer, A. (1998). A machine learning approach to inductive query by examples: An experiment using relevance feedback, ID3, genetic algorithms, and simulated annealing. Journal of the American Society for Information Science and Technology, 49(8), 639–705.Google Scholar
- Chen, H., Thoms, S., and Fu, T. (2008). Cyber extremism in Web 2.0: An exploratory study of international Jihadist groups. IEEE International Conference on Intelligence and Security Informatics, 98–103.Google Scholar
- Das, S.R., and Chen, M.Y. (2007).Yahoo! for Amazon: Sentiment extraction from small talk on the web. Management Science, 53(9), 1375–1388.CrossRefGoogle Scholar
- De Vel, O. (2000). Mining e-mail authorship. Proceedings of Workshop on Text Mining, ACM International Conference on Knowledge Discovery and Data Mining (KDD’2000), Boston, MA.Google Scholar
- Diederich, J., Kindermann, J., Leopold, E., and Paass, G. (2000). Authorship attribution with support vector machines. Applied Intelligence, 19(1), 109–123.CrossRefzbMATHGoogle Scholar
- Dietterich, T.G., Hild, H., and Bakiri, G. (1990). A comparative study of ID3 and backpropagation for English text-to-speech mapping. Proceedings of the 7th International Conference on Machine Learning, 24–31.Google Scholar
- Dimitrova, N., Agnihotri, L., and Wei, G. (2000). Video classification based on HMM using text and faces. European Signal Processing Conference, Tampere, Finland.Google Scholar
- Ding, Y., Jacob, E.K., Zhang, Z., Foo, S., Yan, E., George, N.L., et al. (2009). Perspectives on social tagging. Journal of the American Society for Information Science and Technology, 60(12), 2388–2401.CrossRefGoogle Scholar
- Djeraba, C. (2002). Content-based multimedia indexing and retrieval. Multimedia, IEEE, 9(2), 18–22.CrossRefGoogle Scholar
- Duan, L.-Y., Xu, M., Chua, T.-S., Tian, Q., and Xu, C.-S. (2003). A mid-level representation framework for semantic sports video analysis. Proceedings of the 11th ACM international Conference on Multimedia, 33–44.Google Scholar
- Eickeler, S.,andMuller, S. (1999). Content-based video indexing of TV broadcast news using hidden Markov models. IEEE International Conference on Acoustics, Speech, and Signal Processing, 6, 2997–3000.Google Scholar
- Fischer, S., Lienhart, R., and Effelsberg, W. (1995). Automatic recognition of film genres. Proceedings of the 3rd ACM International Conference on Multimedia, 295–304.Google Scholar
- Forsyth, R.S., and Holmes, D.I. (1996). Feature finding for text classification. Literary and Linguistic Computing, 11(4), 163–174.CrossRefGoogle Scholar
- Fu, T., Abbasi, A., and Chen, H. (2008). A hybrid approach to web forum interactional coherence analysis. Journal of the American Society for Information Science and Technology, 59(8), 1195–1209.CrossRefGoogle Scholar
- Geisler, G., and Burns, S. (2007). Tagging video: Conventions and strategies of the YouTube community. Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries, 480–480.Google Scholar
- Gibert, X., Li, H., and Doermann, D. (2003). Sports video classification using HMMS. Proceedings of the 2003 International Conference on Multimedia and Expo, Baltimore, MD, 345–348.Google Scholar
- Girgensohn, A., and Foote, J. (1999). Video classification using transform coefficients. Proceedings of the Acoustics, Speech, and Signal Processing, 3045–3048.Google Scholar
- Guironnet, M., Pellerin, D., and Rombaut, M. (2005). Video classification based on low-level feature fusion model. Proceedings of the 13th European Signal Processing Conference, Antalya, Turkey.Google Scholar
- Henri, F. (1992). Computer conferencing and content analysis. In A. Kaye (Ed.), Collaborative Learning Through Computer Conferencing: The Najaden Papers, NewYork: Springer-Verlag, 117–136.CrossRefGoogle Scholar
- Hirst, G. and Feiguina, O. (2007). Bigrams of syntactic labels for authorship discrimination of short texts. Literary and Linguistic Computing, 22, 405–417.CrossRefGoogle Scholar
- Hsu, W., and Chang, S.-F. (2005). Visual cue cluster construction via information bottleneck principle and kernel density estimation. International Conference on Content-Based Image and Video Retrieval, 3568, 82–91.CrossRefGoogle Scholar
- Huang, J., Liu, Z., Wang, Y., Chen, Y., and Wong, E. (1999). Integration of multimodal features for video scene classification based on HMM. In IEEE Workshop Multimedia Signal Processing (MMSP-99) Copenhagen, Denmark, 53–58.Google Scholar
- Hung, M.-H., Hsieh, C.-H., and Kuo, C.-M. (2007). Rule-based event detection of broadcast baseball videos using mid-level cues. Proceedings of the 2nd International Conference on Innovative Computing, Information and Control, 240–240.Google Scholar
- Jiang, M., Jensen, E., Beitzel, S., and Argamon, S. (2004). Choosing the right bigrams for information retrieval. In Proceedings of the Meeting of the International Federation of Classification Societies.CrossRefGoogle Scholar
- Jing, F., Li, M., Zhang, H.-J., and Zhang, B. (2004). An efficient and effective region-based image retrieval framework. IEEE Transactions on Image Processing, 13(5), 699–709.CrossRefGoogle Scholar
- Koppel, M., and Schler, J. (2003). Exploiting stylistic idiosyncrasies for authorship attribution. Proceedings of the IJCAI Workshop on Computational Approaches to Style Analysis and Synthesis, 69–72.Google Scholar
- Koppel, M., Schler, J., and Argamon, S. (2009). Computational methods in authorship attribution. Journal of the American Society for Information Science and Technology, 60, 9–26.CrossRefGoogle Scholar
- Kumar, R., Raghavan, P., Rajagopalan, S., and Tomkins, A. (1999). Trawling the web for emerging cyber-communities. Computer Network, 31, (11–16), 1481–1493.CrossRefGoogle Scholar
- Lazebnik, S., Schmid, C., and Ponce, J. (2006). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2, 2169–2178.Google Scholar
- Ledger, G.R., and Merriam, T.V.N. (1994). Shakespeare, Fletcher, and the two Noble Kinsmen. Literary and Linguistic Computing, 9, 235–248.CrossRefGoogle Scholar
- Lew, M.S., Sebe, N., Djeraba, C., and Jain, R. (2006). Content-based multimedia information retrieval: State of the art and challenges. ACM Transactions on Multimedia Computing, Communications and Applications, 2(1), 1–19.CrossRefGoogle Scholar
- Lewis, D. (1998). Naive (Bayes) at forty: The independence assumption in information retrieval. Machine Learning, 4–15.Google Scholar
- Lin,W.-H.,and Hauptmann, A. (2002). News video classification using SVM-based multimodal classifiers and combination strategies. Proceedings of the 10th ACM international Conference on Multimedia, 323–326.Google Scholar
- Lu, C., Drew, M.S., and Au, J. (2001). Classification of summarized videos using hidden Markov models on compressed chromaticity signatures. Proceedings of the 9th ACM International Conference on Multimedia, 479–482.Google Scholar
- Luo, J., and Boutell, M. (2005). Automatic image orientation detection via confidence-based integration of low-level and semantic cues. IEEE Transactions on Patent Analysis and Machine Intelligence, 27(5), 715–726.CrossRefGoogle Scholar
- Ma,Y.-F., and Zhang, H.-J. (2003). Motion pattern-based video classification and retrieval. EURASIP Journal on Applied Signal Processing, 2003(1), 199–208.Google Scholar
- McCallum, A., and Nigam, K. (1998). A comparison of event models for Naïve Bayes text classification. Proceedings of the AAAI Workshop on Learning for Text Categorization, 41–48.Google Scholar
- Mendenhall, T.C. (1887). The characteristic curves of composition. Science, 11(11), 237–249.CrossRefGoogle Scholar
- Messina, A., Montagnuolo, M., and Sapino, M.L. (2006). Characterizing multimedia objects through multimodal content analysis and fuzzy fingerprints. In IEEE International Conference on Signal-Image Technology and Internet-Based Systems, IEEE Computer Society Press, Los Alamitos.Google Scholar
- Mitra, M., Buckley, C., Singhal, A., and Cardie, C. (1997). An analysis of statistical and syntactic phrases. Proceedings of the 5th RIAO Conference, Computer-Assisted Information Searching on the Internet, 200–214.Google Scholar
- Montagnuolo, M., and Messina, A. (2007). Automatic genre classification of TV programmes using Gaussian mixture models and neural networks. Proceedings of the 18th International Conference on Database and Expert Systems Applications, 99–103.Google Scholar
- Oliveira de Melo, A.C., Marcos de Moraes, R., and dos Santos Machado, L. (2003). Gaussian mixture models for supervised classification of remote sensing multispectral images. Lecture Notes in Computer Science, 2905, 440–447.CrossRefGoogle Scholar
- O’Reilly, T. (2005). What is Web 2.0? Design patterns and business models for the next generation of software. Available at: http://www.oreillynet.com/lpt/a/6228
- Pan, J.-Y., and Faloutsos, C. (2002). VideoCube: A novel tool for video mining and classification. Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology, 194–205.Google Scholar
- Peng, F., Schuurmans, D., Keselj, V., and Wang, S. (2003). Automated authorship attribution with character level language models. Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics, 267–274.Google Scholar
- Pieczynski,W., Bouvrais, J.,andMichel, C. (2000). Estimation of generalized mixture in the case of correlated sensors. IEEE Transactions on Image Processing, 9(2), 308–312.CrossRefGoogle Scholar
- Quinlan, J.R. (1986). Induction of decision trees. In J.W. Shavlik and T.G. Dietterich (Ed.), Readings in Machine Learning, Morgan Kaufmann, San Mateo, CA, 81–106.Google Scholar
- Rabiner, L.R., and Juang, B.H. (1986). A tutorial on hidden Markov models. IEEE ASSP Magazine, 4–15.CrossRefGoogle Scholar
- Rasheed, Z., Sheikh, Y., and Shah, M. (2003). Semantic film preview classification using low-level computable features. Proceedings of the 3rd International Workshop on Multimedia Data and Document Engineering.Google Scholar
- Rourke, L., Anderson, T., Garrison, D.R., and Archer, W. (2001). Methodological issues in the content analysis of computer conference transcripts. International Journal of Artificial Intelligence in Education, 12, 8–22.Google Scholar
- Sahami, M. (1996). Learning limited dependence Bayesian classifiers. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, 335–338.Google Scholar
- Salem, A., Reid, E., and Chen, H. (2008). Multimedia content coding and analysis: Unraveling the content of Jihadi extremist groups’ videos. Studies in Conflict and Terrorism, 31(7), 605–626.CrossRefGoogle Scholar
- Samal, A., and Iyengar, P.A. (1992). Automatic recognition and analysis of human faces and facial expressions: A survey. Pattern Recognition, 25(1), 65–77.CrossRefGoogle Scholar
- Schafer, J. (2002). Spinning the web of hate: Web-based hate propagation by extremist organizations. Journal of Criminal Justice and Popular Culture, 9(2), 69–88.Google Scholar
- Shannon, C.E. (1948). A mathematical theory of communication. Bell System Technical Journal, 27(4), 379–423.MathSciNetCrossRefGoogle Scholar
- Sharma, A.S., and Elidrisi, M. (2008). Classification of multi-media content (videos onYouTube) using tags and focal points. Unpublished manuscript.Google Scholar
- Smoliar, S.W., and HongJiang, Z. (1994). Content based video indexing and retrieval. Multimedia, IEEE, 1(2), 62–72.CrossRefGoogle Scholar
- Tweedie, F., and Baayen, R. (1998). How variable may a constant be? Measures of lexical richness in perspective. Computers and the Humanities, 32(5), 323–352.CrossRefGoogle Scholar
- Vapnik, V.N. (1995). The nature of statistical learning theory. New York: Springer-Verlag.CrossRefGoogle Scholar
- Vapnik, V.N. (1998). Statistical learning theory. New York, NY, Wiley-Interscience.zbMATHGoogle Scholar
- Vasconcelos, N., and Lippman, A. (2000). Statistical models of video structure for content analysis and characterization. IEEE Transactions on Image Processing, 9(1), 3–19.CrossRefGoogle Scholar
- Xu, D., and Chang, S.-F. (2008).Video event recognition using kernel methods with multilevel temporal alignment. IEEE Transactions on Patent Analysis and Machine Intelligence, 30(11), 1985–1997.CrossRefGoogle Scholar
- Xu, L.-Q., and Li, Y. (2003). Video classification using spatial-temporal features and PCA. International Conference on Multimedia and Expo, 3, 485–488.Google Scholar
- Yang,Y., and Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. Proceedings of the 14th International Conference on Machine Learning, 412–420.Google Scholar
- Zhang, J., Marszalek, M., Lazebnik, S., and Schmid, C. (2007). Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision, 73(2), 213–238.CrossRefGoogle Scholar
- Zheng, R., Li, J., Chen, H., and Huang, Z. (2006). A framework for authorship identification of online messages: Writing-style features and classification techniques. Journal of the American Society for Information Science and Technology, 57(3), 378–393.CrossRefGoogle Scholar
- Zhou, W., Vellaikal, A., and Kuo, C.C.J. (2000). Rule-based video classification system for basketball video indexing. Proceedings of ACM Workshops on Multimedia, 213–216.Google Scholar
- Zhou,Y.-H., Cao,Y.-D., Zhang, L.-F., and Zhang, H.-X. (2005). An SVM-based soccer video shot classification. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, 9, 5398–5403.CrossRefGoogle Scholar