Skip to main content

Video Knowledge Discovery Based on Convolutional Neural Network

  • Conference paper
  • First Online:
Cloud Computing, Smart Grid and Innovative Frontiers in Telecommunications (CloudComp 2019, SmartGift 2019)

Abstract

Under the background of Internet+education, video course resources are becoming more and more abundant, at the same time, the Internet has a large number of not named or named non-standard courses video. It is increasingly important to identify courses name in these abundant video course teaching resources to improve learner efficiency. This study utilizes a deep neural network framework that incorporates a simple to implement transformation-invariant pooling operator (TI-pooling), after the audio and image information in course video is processed by the convolution layer and pooling layer of the model, the TI-pooling operator will further extract the features, so as to extract the most important information of course video, and we will identify the course name from the extracted course video information. The experimental results show that the accuracy of course name recognition obtained by taking image and audio as the input of CNN model is higher than that obtained by only image, only audio and only image and audio without ti-pooling operation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Fayyad, U, Shapiro, G.P., Smyth, P.: From data mining to knowledge discovery in databases [EB/OL]. http://www.kdnuggets.com/gpspubs/imag-kdd-overview-1996-Fayyad.Pdf. Accessed 22 Jun 2003

  2. Wang, A., Wang, J., Lin, H., et al.: A multiple distributed representation method based on neural network for biomedical event extraction. BMC Med. Inform. Decis. Mak. 17(S3), 171 (2017)

    Article  Google Scholar 

  3. Lishuang, L., Yang, L., Meiyue, Q.: Extracting biomedical events with parallel multi-pooling convolutional neural networks. IEEE/ACM Trans. Comput. Biol. Bioinf. 1–1 (2018)

    Google Scholar 

  4. LeCunand, Y. Bengio, Y.: Convolutional networks for images, speech, and time series. In: The Handbook of Brain Theory and Neural Networks, vol. 3361, no. 10, p. 1 (1995)

    Google Scholar 

  5. Schmidhuber J. Multi-column deep neural networks for image classification. In: Computer Vision & Pattern Recognition (2012)

    Google Scholar 

  6. Karpathy, A., Toderici, G., Shetty, S., et al.: Large-scale video classification with convolutional neural networks. In: Computer Vision & Pattern Recognition (2014)

    Google Scholar 

  7. Peng, S.: Application of knowledge discovery in subject service. Northeast Normal University

    Google Scholar 

  8. Wu, D.: Prediction of employee turnover based on database knowledge discovery. Sci. Technol. Innov. 14 (2019)

    Google Scholar 

  9. Xu, R., Wang, Q.Q.: PhenoPredict: a disease phenome-wide drug repositioning approach towards schizophrenia drug discovery. J. Biomed. Inform. 56(C), 348–355 (2015)

    Article  Google Scholar 

  10. Li, X.: Decision analysis of banks based on data mining and knowledge discovery. Fintech Times 1, 56–59 (2014)

    Google Scholar 

  11. Kerzendorf, W.E. Knowledge discovery through text-based similarity searches for astronomy literature (2017)

    Google Scholar 

  12. Yudistira, N., Akbar, S.R., Arwan, A.: Using strongly typed genetic programming for knowledge discovery of course quality from e-Learning’s web log. In: 2013 5th International Conference on Knowledge and Smart Technology (KST) (2013)

    Google Scholar 

  13. Alfonseca, E., Rodríguez, P., Pérez, D.: An approach for automatic generation of adaptive hypermedia in education with multilingual knowledge discovery techniques. Comput. Educ. 49(2), 0–513 (2007)

    Article  Google Scholar 

  14. Lowe, D.G.: Object recognition from local scale-invariant features. In: The proceedings of the seventh IEEE international conference on Computer Vision 1999, vol. 2, pp. 1150–1157. IEEE (1999)

    Google Scholar 

  15. Lazebnik, S., Schmid, C., Ponce, J., et al.: Semi-local affine parts for object recognition. In: British Machine Vision Conference (BMVC 2004), vol. 2, pp. 779–788 (2004)

    Google Scholar 

  16. Maas, A.L., Hannun, A.Y., Ng, A.Y.: Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of ICML, vol. 30, no. 1 (2013)

    Google Scholar 

  17. He, K., Zhang, X., Ren, S., Sun, J.: Delving deep into rectifiers: surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 1026–1034 (2015)

    Google Scholar 

  18. Xu, B., Wang, N., Chen, T., Li, M.: Empirical evaluation of rectified activations in convolutional network. arXiv preprint arXiv:1505.00853 (2015)

  19. Qiu, S, Xu, X, Cai, B.: FReLU: flexible rectified linear units for improving convolutional neural networks (2017)

    Google Scholar 

  20. Laptev, D., Savinov, N., Buhmann, J.M., et al.: TI-POOLING: transformation-invariant pooling for feature learning in convolutional neural networks (2016)

    Google Scholar 

Download references

Acknowledgement

This work was financially supported by the Teaching Reform Research Project of Undergraduate Colleges and Universities of Shandong Province (Z2016Z036), the Teaching Reform Research Project of Shandong University of Finance and Economics (jy2018062891470, jy201830, jy201810), Shandong Provincial Social Science Planning Research Project (18CHLJ08), Scientific Research Projects of Universities in Shandong Province (J18RA136).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to LiZhen Cui .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Lin, J., Liu, C., Cui, L., Huang, W., Song, R., Zhao, Y. (2020). Video Knowledge Discovery Based on Convolutional Neural Network. In: Zhang, X., Liu, G., Qiu, M., Xiang, W., Huang, T. (eds) Cloud Computing, Smart Grid and Innovative Frontiers in Telecommunications. CloudComp SmartGift 2019 2019. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 322. Springer, Cham. https://doi.org/10.1007/978-3-030-48513-9_28

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-48513-9_28

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-48512-2

  • Online ISBN: 978-3-030-48513-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics