Dark Web pp 295-318 | Cite as

Extremist YouTube Videos

  • Hsinchun ChenEmail author
Part of the Integrated Series in Information Systems book series (ISIS, volume 30)


With the emergence of Web 2.0, sharing personal content, communicating ideas, and interacting with other online users in Web 2.0 communities have become daily routines for online users. User-generated data from Web 2.0 sites provide rich personal information, such as personal preferences and interests, and can be utilized to obtain insight about cyber communities and their social networks. Many studies have focused on leveraging user-generated information to analyze blogs and forums, but few studies have applied this approach to video-sharing web sites. In this chapter, we proposed a text-based framework for video content classification of online video-sharing web sites. Different types of user-generated data (e.g., titles, descriptions, and comments) were used as proxies for online videos, and three types of text features (lexical, syntactic, and content-specific features) were extracted. Three feature-based classification techniques (C4.5, Naïve Bayes, and SVM) were used to classify videos. To evaluate the proposed framework, user-generated data from candidate videos, which were identified by searching user-given keywords on YouTube, were first collected. Then, a subset of the collected data was randomly selected and manually tagged by users as our experiment data. The experimental results showed that the proposed approach was able to classify online videos based on users’ interests with accuracy rates up to 87.2%, and all three types of text features contributed to discriminating videos. SVM outperformed C4.5 and Naïve Bayes in our experiments. In addition, our case study further demonstrated that accurate video classification results are very useful for identifying implicit cyber communities on video-sharing web sites.


Support Vector Machine Text Feature Gaussian Mixture Model Semantic Concept Online Video 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.



This work was supported by the NSF Computer and Network Systems (CNS) Program, “(CRI: CRD) Developing a Dark Web Collection and Infrastructure for Computational and Social Sciences” (CNS-0709338), September 2007–August 2010.


  1. Abbasi, A., and Chen, H. (2005). Applying authorship analysis to extremist-group web forum messages. IEEE Intelligent Systems, 20(5), 67–75.CrossRefGoogle Scholar
  2. Abbasi, A., and Chen, H. (2008). Writeprints: A stylometric approach to identity-level identification and similarity detection in cyberspace. ACM Transactions on Information Systems, 26(2), 1–29.CrossRefGoogle Scholar
  3. Abbasi, A., Chen, H.-M., and Nunamaker, J. (2008a). Stylometric identification in electronic markets: Scalability and robustness. Journal of Management Information Systems, 25(1), 49–78.CrossRefGoogle Scholar
  4. Abbasi, A., Chen, H., and Salem, A. (2008b). Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums. ACM Transactions on Information Systems, 26(3), 1–34.CrossRefGoogle Scholar
  5. Amir, A., Basu, S., Iyengar, G., Lin, C.-Y., Naphade, M., Smith, J.R., et al. (2004). A multi-modal system for the retrieval of semantic video events. Computer Vision and Image Understanding, 96(2), 216–236.CrossRefGoogle Scholar
  6. Argamon, S., Šarić, M., and Stein, S. S. (2003). Style mining of electronic messages for multiple authorship discrimination: First results. Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, 475–480.Google Scholar
  7. Baayen, H., van Halteren, H., and Tweedie, F. (1996). Outside the cave of shadows: Using syntactic annotation to enhance authorship attribution. Literary and Linguistic Computing, 11(3), 121–132.CrossRefGoogle Scholar
  8. Borgne, H.L., Guérin-Dugué, A., and O’Connor, N.E. (2007). Learning midlevel image features for natural scene and texture classification. IEEE Transactions on Circuits and Systems forVideo Technology, 17(3), 286–297.CrossRefGoogle Scholar
  9. Burris, V., Smith, E., and Strahm, A. (2000). White supremacist networks on the Internet. Sociological Focus, 33(2), 215–235.CrossRefGoogle Scholar
  10. Caillol, H., Pieczynski, W., and Hillion, A. (1997). Estimation of fuzzy Gaussian mixture and unsupervised statistical image segmentation. IEEE Transactions on Image Processing, 6(3), 425–440.CrossRefGoogle Scholar
  11. Chau, M., and Xu, J. (2007). Mining communities and their relationships in blogs: A study of online hate groups. International Journal of Human–Computer Studies 65, 57–70.CrossRefGoogle Scholar
  12. Chellappa, R., Wilson, C.L., and Sirohey, S. (1995). Human and machine recognition of faces: A survey. Proceedings of the IEEE, 83(5), 705–741.CrossRefGoogle Scholar
  13. Chen, H., Shankaranarayanan, G., She, L., and Iyer, A. (1998). A machine learning approach to inductive query by examples: An experiment using relevance feedback, ID3, genetic algorithms, and simulated annealing. Journal of the American Society for Information Science and Technology, 49(8), 639–705.Google Scholar
  14. Chen, H., Thoms, S., and Fu, T. (2008). Cyber extremism in Web 2.0: An exploratory study of international Jihadist groups. IEEE International Conference on Intelligence and Security Informatics, 98–103.Google Scholar
  15. Das, S.R., and Chen, M.Y. (2007).Yahoo! for Amazon: Sentiment extraction from small talk on the web. Management Science, 53(9), 1375–1388.CrossRefGoogle Scholar
  16. De Vel, O. (2000). Mining e-mail authorship. Proceedings of Workshop on Text Mining, ACM International Conference on Knowledge Discovery and Data Mining (KDD’2000), Boston, MA.Google Scholar
  17. Diederich, J., Kindermann, J., Leopold, E., and Paass, G. (2000). Authorship attribution with support vector machines. Applied Intelligence, 19(1), 109–123.CrossRefzbMATHGoogle Scholar
  18. Dietterich, T.G., Hild, H., and Bakiri, G. (1990). A comparative study of ID3 and backpropagation for English text-to-speech mapping. Proceedings of the 7th International Conference on Machine Learning, 24–31.Google Scholar
  19. Dimitrova, N., Agnihotri, L., and Wei, G. (2000). Video classification based on HMM using text and faces. European Signal Processing Conference, Tampere, Finland.Google Scholar
  20. Ding, Y., Jacob, E.K., Zhang, Z., Foo, S., Yan, E., George, N.L., et al. (2009). Perspectives on social tagging. Journal of the American Society for Information Science and Technology, 60(12), 2388–2401.CrossRefGoogle Scholar
  21. Djeraba, C. (2002). Content-based multimedia indexing and retrieval. Multimedia, IEEE, 9(2), 18–22.CrossRefGoogle Scholar
  22. Duan, L.-Y., Xu, M., Chua, T.-S., Tian, Q., and Xu, C.-S. (2003). A mid-level representation framework for semantic sports video analysis. Proceedings of the 11th ACM international Conference on Multimedia, 33–44.Google Scholar
  23. Eickeler, S.,andMuller, S. (1999). Content-based video indexing of TV broadcast news using hidden Markov models. IEEE International Conference on Acoustics, Speech, and Signal Processing, 6, 2997–3000.Google Scholar
  24. Fischer, S., Lienhart, R., and Effelsberg, W. (1995). Automatic recognition of film genres. Proceedings of the 3rd ACM International Conference on Multimedia, 295–304.Google Scholar
  25. Forsyth, R.S., and Holmes, D.I. (1996). Feature finding for text classification. Literary and Linguistic Computing, 11(4), 163–174.CrossRefGoogle Scholar
  26. Fu, T., Abbasi, A., and Chen, H. (2008). A hybrid approach to web forum interactional coherence analysis. Journal of the American Society for Information Science and Technology, 59(8), 1195–1209.CrossRefGoogle Scholar
  27. Geisler, G., and Burns, S. (2007). Tagging video: Conventions and strategies of the YouTube community. Proceedings of the 7th ACM/IEEE-CS Joint Conference on Digital Libraries, 480–480.Google Scholar
  28. Gibert, X., Li, H., and Doermann, D. (2003). Sports video classification using HMMS. Proceedings of the 2003 International Conference on Multimedia and Expo, Baltimore, MD, 345–348.Google Scholar
  29. Girgensohn, A., and Foote, J. (1999). Video classification using transform coefficients. Proceedings of the Acoustics, Speech, and Signal Processing, 3045–3048.Google Scholar
  30. Guironnet, M., Pellerin, D., and Rombaut, M. (2005). Video classification based on low-level feature fusion model. Proceedings of the 13th European Signal Processing Conference, Antalya, Turkey.Google Scholar
  31. Henri, F. (1992). Computer conferencing and content analysis. In A. Kaye (Ed.), Collaborative Learning Through Computer Conferencing: The Najaden Papers, NewYork: Springer-Verlag, 117–136.CrossRefGoogle Scholar
  32. Hirst, G. and Feiguina, O. (2007). Bigrams of syntactic labels for authorship discrimination of short texts. Literary and Linguistic Computing, 22, 405–417.CrossRefGoogle Scholar
  33. Hsu, W., and Chang, S.-F. (2005). Visual cue cluster construction via information bottleneck principle and kernel density estimation. International Conference on Content-Based Image and Video Retrieval, 3568, 82–91.CrossRefGoogle Scholar
  34. Huang, J., Liu, Z., Wang, Y., Chen, Y., and Wong, E. (1999). Integration of multimodal features for video scene classification based on HMM. In IEEE Workshop Multimedia Signal Processing (MMSP-99) Copenhagen, Denmark, 53–58.Google Scholar
  35. Hung, M.-H., Hsieh, C.-H., and Kuo, C.-M. (2007). Rule-based event detection of broadcast baseball videos using mid-level cues. Proceedings of the 2nd International Conference on Innovative Computing, Information and Control, 240–240.Google Scholar
  36. Jiang, M., Jensen, E., Beitzel, S., and Argamon, S. (2004). Choosing the right bigrams for information retrieval. In Proceedings of the Meeting of the International Federation of Classification Societies.CrossRefGoogle Scholar
  37. Jing, F., Li, M., Zhang, H.-J., and Zhang, B. (2004). An efficient and effective region-based image retrieval framework. IEEE Transactions on Image Processing, 13(5), 699–709.CrossRefGoogle Scholar
  38. Koppel, M., and Schler, J. (2003). Exploiting stylistic idiosyncrasies for authorship attribution. Proceedings of the IJCAI Workshop on Computational Approaches to Style Analysis and Synthesis, 69–72.Google Scholar
  39. Koppel, M., Schler, J., and Argamon, S. (2009). Computational methods in authorship attribution. Journal of the American Society for Information Science and Technology, 60, 9–26.CrossRefGoogle Scholar
  40. Kumar, R., Raghavan, P., Rajagopalan, S., and Tomkins, A. (1999). Trawling the web for emerging cyber-communities. Computer Network, 31, (11–16), 1481–1493.CrossRefGoogle Scholar
  41. Lazebnik, S., Schmid, C., and Ponce, J. (2006). Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2, 2169–2178.Google Scholar
  42. Ledger, G.R., and Merriam, T.V.N. (1994). Shakespeare, Fletcher, and the two Noble Kinsmen. Literary and Linguistic Computing, 9, 235–248.CrossRefGoogle Scholar
  43. Lew, M.S., Sebe, N., Djeraba, C., and Jain, R. (2006). Content-based multimedia information retrieval: State of the art and challenges. ACM Transactions on Multimedia Computing, Communications and Applications, 2(1), 1–19.CrossRefGoogle Scholar
  44. Lewis, D. (1998). Naive (Bayes) at forty: The independence assumption in information retrieval. Machine Learning, 4–15.Google Scholar
  45. Lin,W.-H.,and Hauptmann, A. (2002). News video classification using SVM-based multimodal classifiers and combination strategies. Proceedings of the 10th ACM international Conference on Multimedia, 323–326.Google Scholar
  46. Lu, C., Drew, M.S., and Au, J. (2001). Classification of summarized videos using hidden Markov models on compressed chromaticity signatures. Proceedings of the 9th ACM International Conference on Multimedia, 479–482.Google Scholar
  47. Luo, J., and Boutell, M. (2005). Automatic image orientation detection via confidence-based integration of low-level and semantic cues. IEEE Transactions on Patent Analysis and Machine Intelligence, 27(5), 715–726.CrossRefGoogle Scholar
  48. Ma,Y.-F., and Zhang, H.-J. (2003). Motion pattern-based video classification and retrieval. EURASIP Journal on Applied Signal Processing, 2003(1), 199–208.Google Scholar
  49. McCallum, A., and Nigam, K. (1998). A comparison of event models for Naïve Bayes text classification. Proceedings of the AAAI Workshop on Learning for Text Categorization, 41–48.Google Scholar
  50. Mendenhall, T.C. (1887). The characteristic curves of composition. Science, 11(11), 237–249.CrossRefGoogle Scholar
  51. Messina, A., Montagnuolo, M., and Sapino, M.L. (2006). Characterizing multimedia objects through multimodal content analysis and fuzzy fingerprints. In IEEE International Conference on Signal-Image Technology and Internet-Based Systems, IEEE Computer Society Press, Los Alamitos.Google Scholar
  52. Mitra, M., Buckley, C., Singhal, A., and Cardie, C. (1997). An analysis of statistical and syntactic phrases. Proceedings of the 5th RIAO Conference, Computer-Assisted Information Searching on the Internet, 200–214.Google Scholar
  53. Montagnuolo, M., and Messina, A. (2007). Automatic genre classification of TV programmes using Gaussian mixture models and neural networks. Proceedings of the 18th International Conference on Database and Expert Systems Applications, 99–103.Google Scholar
  54. Oliveira de Melo, A.C., Marcos de Moraes, R., and dos Santos Machado, L. (2003). Gaussian mixture models for supervised classification of remote sensing multispectral images. Lecture Notes in Computer Science, 2905, 440–447.CrossRefGoogle Scholar
  55. O’Reilly, T. (2005). What is Web 2.0? Design patterns and business models for the next generation of software. Available at:
  56. Pan, J.-Y., and Faloutsos, C. (2002). VideoCube: A novel tool for video mining and classification. Proceedings of the 5th International Conference on Asian Digital Libraries: Digital Libraries: People, Knowledge, and Technology, 194–205.Google Scholar
  57. Peng, F., Schuurmans, D., Keselj, V., and Wang, S. (2003). Automated authorship attribution with character level language models. Proceedings of the 10th Conference of the European Chapter of the Association for Computational Linguistics, 267–274.Google Scholar
  58. Pieczynski,W., Bouvrais, J.,andMichel, C. (2000). Estimation of generalized mixture in the case of correlated sensors. IEEE Transactions on Image Processing, 9(2), 308–312.CrossRefGoogle Scholar
  59. Quinlan, J.R. (1986). Induction of decision trees. In J.W. Shavlik and T.G. Dietterich (Ed.), Readings in Machine Learning, Morgan Kaufmann, San Mateo, CA, 81–106.Google Scholar
  60. Rabiner, L.R., and Juang, B.H. (1986). A tutorial on hidden Markov models. IEEE ASSP Magazine, 4–15.CrossRefGoogle Scholar
  61. Rasheed, Z., Sheikh, Y., and Shah, M. (2003). Semantic film preview classification using low-level computable features. Proceedings of the 3rd International Workshop on Multimedia Data and Document Engineering.Google Scholar
  62. Rourke, L., Anderson, T., Garrison, D.R., and Archer, W. (2001). Methodological issues in the content analysis of computer conference transcripts. International Journal of Artificial Intelligence in Education, 12, 8–22.Google Scholar
  63. Sahami, M. (1996). Learning limited dependence Bayesian classifiers. Proceedings of the 2nd International Conference on Knowledge Discovery and Data Mining, 335–338.Google Scholar
  64. Salem, A., Reid, E., and Chen, H. (2008). Multimedia content coding and analysis: Unraveling the content of Jihadi extremist groups’ videos. Studies in Conflict and Terrorism, 31(7), 605–626.CrossRefGoogle Scholar
  65. Samal, A., and Iyengar, P.A. (1992). Automatic recognition and analysis of human faces and facial expressions: A survey. Pattern Recognition, 25(1), 65–77.CrossRefGoogle Scholar
  66. Schafer, J. (2002). Spinning the web of hate: Web-based hate propagation by extremist organizations. Journal of Criminal Justice and Popular Culture, 9(2), 69–88.Google Scholar
  67. Shannon, C.E. (1948). A mathematical theory of communication. Bell System Technical Journal, 27(4), 379–423.MathSciNetCrossRefGoogle Scholar
  68. Sharma, A.S., and Elidrisi, M. (2008). Classification of multi-media content (videos onYouTube) using tags and focal points. Unpublished manuscript.Google Scholar
  69. Smoliar, S.W., and HongJiang, Z. (1994). Content based video indexing and retrieval. Multimedia, IEEE, 1(2), 62–72.CrossRefGoogle Scholar
  70. Tweedie, F., and Baayen, R. (1998). How variable may a constant be? Measures of lexical richness in perspective. Computers and the Humanities, 32(5), 323–352.CrossRefGoogle Scholar
  71. Vapnik, V.N. (1995). The nature of statistical learning theory. New York: Springer-Verlag.CrossRefGoogle Scholar
  72. Vapnik, V.N. (1998). Statistical learning theory. New York, NY, Wiley-Interscience.zbMATHGoogle Scholar
  73. Vasconcelos, N., and Lippman, A. (2000). Statistical models of video structure for content analysis and characterization. IEEE Transactions on Image Processing, 9(1), 3–19.CrossRefGoogle Scholar
  74. Xu, D., and Chang, S.-F. (2008).Video event recognition using kernel methods with multilevel temporal alignment. IEEE Transactions on Patent Analysis and Machine Intelligence, 30(11), 1985–1997.CrossRefGoogle Scholar
  75. Xu, L.-Q., and Li, Y. (2003). Video classification using spatial-temporal features and PCA. International Conference on Multimedia and Expo, 3, 485–488.Google Scholar
  76. Yang,Y., and Pedersen, J. O. (1997). A comparative study on feature selection in text categorization. Proceedings of the 14th International Conference on Machine Learning, 412–420.Google Scholar
  77. Zhang, J., Marszalek, M., Lazebnik, S., and Schmid, C. (2007). Local features and kernels for classification of texture and object categories: A comprehensive study. International Journal of Computer Vision, 73(2), 213–238.CrossRefGoogle Scholar
  78. Zheng, R., Li, J., Chen, H., and Huang, Z. (2006). A framework for authorship identification of online messages: Writing-style features and classification techniques. Journal of the American Society for Information Science and Technology, 57(3), 378–393.CrossRefGoogle Scholar
  79. Zhou, W., Vellaikal, A., and Kuo, C.C.J. (2000). Rule-based video classification system for basketball video indexing. Proceedings of ACM Workshops on Multimedia, 213–216.Google Scholar
  80. Zhou,Y.-H., Cao,Y.-D., Zhang, L.-F., and Zhang, H.-X. (2005). An SVM-based soccer video shot classification. Proceedings of 2005 International Conference on Machine Learning and Cybernetics, 9, 5398–5403.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC 2012

Authors and Affiliations

  1. 1.Department of Management Information SystemsUniversity of ArizonaTusconUSA

Personalised recommendations