Skip to main content

Trend Analysis of Machine Learning Research Using Topic Network Analysis

  • Conference paper
  • First Online:

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 799))

Abstract

In this paper, a topic network analysis approach is proposed which integrates topic modeling and social network analysis. We collected 16,855 scientific papers from six top journals in the field of machine learning published from 1997 to 2016 and analyzed them with the topic network. The dataset is break down into 4 intervals to identify topic trends and performed the time-series analysis of topic network. Our experimental results show centralization of the topic network has the highest score from 2002 to 2006, and decreases for next 5 years and increases again. For last 5 years, centralization of the degree centrality and closeness centrality increases, while centralization of the betweenness centrality decreases again. Also, data analytic and computer vision are identified as the most interrelated topic among other topics. Topics with the highest degree centrality evolve component analysis, text mining, biometric and computer vision according to time. Our approach extracts the interrelationships of topics, which cannot be detected with conventional topic modeling approaches, and provides topical trends of machine learning research.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Kim, C., Hong, Y.-S.: Classification techniques for XML document using text mining. J. Korea Soc. Comput. Inf. 11(2), 15–23 (2006)

    MathSciNet  Google Scholar 

  2. Moon, J.-P., Lee, W.-S., Chang, J.-H.: A proper folder recommendation technique using frequent item sets for efficient e-mail classification. J. Korea Soc. Comput. Inf. 16(2), 33–46 (2011)

    Article  Google Scholar 

  3. Blei, D.M., Andrew, Y.N., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)

    MATH  Google Scholar 

  4. Park, J.H., Song, M.: A study on the research trends in library & information science in Korea using topic modeling. J. Korean Soc. Inf. Manag. 30(1), 7–32 (2013)

    Google Scholar 

  5. Blei, D.M.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012)

    Article  Google Scholar 

  6. Wasserman, S., Faust, K.: Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge (1994)

    Book  Google Scholar 

  7. Duvvuru, A., Kamarthi, S., Sultornsanee, S.: Undercovering research trends: network analysis of keywords in scholarly articles. In: Proceedings of the 9th International Joint Conference on Computer Science and Software Engineering, pp. 265–270 (2012)

    Google Scholar 

  8. Griffiths, T.L., Steyvers, M.: Finding scientific topics. In: Proceedings of the National Academy of Sciences of the USA, vol. 101, no. 1, pp. 5228–5235, April 2004

    Article  Google Scholar 

  9. Bae, J., Han, N., Song, M.: Twitter issue tracking system by topic modeling techniques. J. Intell. Inf. Syst. 20(2), 109–122 (2014)

    Google Scholar 

  10. Blei, D.M., Lafferty, J.D.: Correlated topic models. In: Proceedings of Neural Information Processing Systems, pp. 147–154 (2005)

    Google Scholar 

  11. Mei, Q., et al.: Topic modeling with network regularization. In: Proceedings of International Conference on World Wide Web, pp. 101–110 (2008)

    Google Scholar 

  12. Mao, X.-L., et al.: SSHLDA: a semi-supervised hierarchical topic model. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 800–809 (2012)

    Google Scholar 

  13. Wang, X., McCallum, A.: Topics over time: a non-Markov continuous-time model of topical trends. In: Proceedings of the 12th International Conference on Knowledge Discovery and Data Mining, pp. 424–433 (2006)

    Google Scholar 

  14. Kim, C., Hong, Y.-S.: Trend analysis of data mining research using topic network analysis. J. Korea Soc. Comput. Inf. 11(5), 141–148 (2016)

    Google Scholar 

  15. R: The R Project for Statistical Computing. https://www.r-project.org/

  16. Gruen, B., Hornik, K.: topicmodels: an R package for fitting topic models. J. Stat. Softw. 40(13), 1–29 (2011)

    Google Scholar 

  17. Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)

    Article  Google Scholar 

  18. Delp, P., Thesen, A., Motiwalla, J., Seshardi, N. (eds.) System Tools for Project Planning. International Development Institute, Bloomington (1977)

    Google Scholar 

  19. Freeman, L.C.: Centrality in social networks: conceptual clarification. Soc. Netw. 1, 215–239 (1979)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Deepak Sharma .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Sharma, D., Kumar, B., Chand, S. (2018). Trend Analysis of Machine Learning Research Using Topic Network Analysis. In: Panda, B., Sharma, S., Roy, N. (eds) Data Science and Analytics. REDSET 2017. Communications in Computer and Information Science, vol 799. Springer, Singapore. https://doi.org/10.1007/978-981-10-8527-7_4

Download citation

  • DOI: https://doi.org/10.1007/978-981-10-8527-7_4

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-10-8526-0

  • Online ISBN: 978-981-10-8527-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics