Skip to main content

A Semi-supervised Method to Classify Educational Videos

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11734))

Abstract

Currently, topic modelling has regained interest in the world of e-learning, where it is necessary to search through an extensive database of online learning objects, mainly in the form of educational videos. The main problem is the retrieval of those learning objects that are best suited to students’ keyword searches. Today this problem is still an open topic. According to this, this paper aims to provide a more sophisticated method to improve the search of educational videos and thus show those that best fit the learning objectives of students. To do this, a semi-supervised method to cluster and classify a large data-set of educational videos from the Universitat Politècnica de València has been developed. The proposed method employs open content resources from Wikipedia as labelled data to train the model.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    edX MOOC platform: https://www.edx.org/.

  2. 2.

    https://tfhub.dev/google/nnlm-es-dim128-with-normalization/1.

  3. 3.

    We decided to use this embeddings method since its suitability is well demonstrated by state of the art in this field. Furthermore, it was already trained on a considerable amount of data.

  4. 4.

    This design decision was supported by the fact that it behaved better than other algorithms and it required a smaller data-set than the commonly used neural networks.

References

  1. Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. J. Mach. Learn. Res. 3(Feb), 1137–1155 (2003)

    MATH  Google Scholar 

  2. Berkhin, P.: A survey of clustering data mining techniques. In: Kogan, J., Nicholas, C., Teboulle, M. (eds.) Grouping Multidimensional Data, pp. 25–71. Springer, Heidelberg (2006). https://doi.org/10.1007/3-540-28349-8_2

    Chapter  Google Scholar 

  3. Downes, S.: Learning objects: resources for distance education worldwide. Int. Rev. Res. Open Distrib. Learn. 2(1) (2001)

    Google Scholar 

  4. Jiang, Z., Zhang, Y., Li, X.: MOOCon: a framework for semi-supervised concept extraction from MOOC content. In: Bao, Z., Trajcevski, G., Chang, L., Hua, W. (eds.) DASFAA 2017. LNCS, vol. 10179, pp. 303–315. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-55705-2_24

    Chapter  Google Scholar 

  5. Labutov, I., Huang, Y., Brusilovsky, P., He, D.: Semi-supervised techniques for mining learning outcomes and prerequisites. In: Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 907–915. ACM (2017)

    Google Scholar 

  6. Overell, S., Sigurbjörnsson, B., Van Zwol, R.: Classifying tags using open content resources. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 64–73. ACM (2009)

    Google Scholar 

  7. Papadimitriou, C.H., Raghavan, P., Tamaki, H., Vempala, S.: Latent semantic indexing: a probabilistic analysis. J. Comput. Syst. Sci. 61(2), 217–235 (2000)

    Article  MathSciNet  Google Scholar 

  8. Steyvers, M., Griffiths, T.: Probabilistic topic models. Handb. Latent Semant. Anal. 427(7), 424–440 (2007)

    Google Scholar 

  9. Tucker, B.: The flipped classroom. Educ. Next 12(1), 82–83 (2012)

    Google Scholar 

  10. Wallach, H.M.: Topic modeling: beyond bag-of-words. In: Proceedings of the 23rd International Conference on Machine learning, pp. 977–984. ACM (2006)

    Google Scholar 

  11. Wang, L.: Support Vector Machines: Theory and Applications, vol. 177. Springer, Heidelberg (2005). https://doi.org/10.1007/b95439

    Book  MATH  Google Scholar 

  12. Zhu, X.J.: Semi-supervised learning literature survey. Technical report, University of Wisconsin-Madison, Department of Computer Sciences (2005)

    Google Scholar 

Download references

Acknowledgements

This work was partially supported by the project TIN2017-89156-R of the Spanish government, and by the grant program for the recruitment of doctors for the Spanish system of science and technology (PAID- 10-14) of the Universitat Politècnica de València.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stella Heras .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Stoica, A.S., Heras, S., Palanca, J., Julian, V., Mihaescu, M.C. (2019). A Semi-supervised Method to Classify Educational Videos. In: Pérez García, H., Sánchez González, L., Castejón Limas, M., Quintián Pardo, H., Corchado Rodríguez, E. (eds) Hybrid Artificial Intelligent Systems. HAIS 2019. Lecture Notes in Computer Science(), vol 11734. Springer, Cham. https://doi.org/10.1007/978-3-030-29859-3_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-030-29859-3_19

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-030-29858-6

  • Online ISBN: 978-3-030-29859-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics