Abstract
In this paper, a topic network analysis approach is proposed which integrates topic modeling and social network analysis. We collected 16,855 scientific papers from six top journals in the field of machine learning published from 1997 to 2016 and analyzed them with the topic network. The dataset is break down into 4 intervals to identify topic trends and performed the time-series analysis of topic network. Our experimental results show centralization of the topic network has the highest score from 2002 to 2006, and decreases for next 5 years and increases again. For last 5 years, centralization of the degree centrality and closeness centrality increases, while centralization of the betweenness centrality decreases again. Also, data analytic and computer vision are identified as the most interrelated topic among other topics. Topics with the highest degree centrality evolve component analysis, text mining, biometric and computer vision according to time. Our approach extracts the interrelationships of topics, which cannot be detected with conventional topic modeling approaches, and provides topical trends of machine learning research.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Kim, C., Hong, Y.-S.: Classification techniques for XML document using text mining. J. Korea Soc. Comput. Inf. 11(2), 15–23 (2006)
Moon, J.-P., Lee, W.-S., Chang, J.-H.: A proper folder recommendation technique using frequent item sets for efficient e-mail classification. J. Korea Soc. Comput. Inf. 16(2), 33–46 (2011)
Blei, D.M., Andrew, Y.N., Jordan, M.I.: Latent dirichlet allocation. J. Mach. Learn. Res. 3, 993–1022 (2003)
Park, J.H., Song, M.: A study on the research trends in library & information science in Korea using topic modeling. J. Korean Soc. Inf. Manag. 30(1), 7–32 (2013)
Blei, D.M.: Probabilistic topic models. Commun. ACM 55(4), 77–84 (2012)
Wasserman, S., Faust, K.: Social Network Analysis: Methods and Applications. Cambridge University Press, Cambridge (1994)
Duvvuru, A., Kamarthi, S., Sultornsanee, S.: Undercovering research trends: network analysis of keywords in scholarly articles. In: Proceedings of the 9th International Joint Conference on Computer Science and Software Engineering, pp. 265–270 (2012)
Griffiths, T.L., Steyvers, M.: Finding scientific topics. In: Proceedings of the National Academy of Sciences of the USA, vol. 101, no. 1, pp. 5228–5235, April 2004
Bae, J., Han, N., Song, M.: Twitter issue tracking system by topic modeling techniques. J. Intell. Inf. Syst. 20(2), 109–122 (2014)
Blei, D.M., Lafferty, J.D.: Correlated topic models. In: Proceedings of Neural Information Processing Systems, pp. 147–154 (2005)
Mei, Q., et al.: Topic modeling with network regularization. In: Proceedings of International Conference on World Wide Web, pp. 101–110 (2008)
Mao, X.-L., et al.: SSHLDA: a semi-supervised hierarchical topic model. In: Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pp. 800–809 (2012)
Wang, X., McCallum, A.: Topics over time: a non-Markov continuous-time model of topical trends. In: Proceedings of the 12th International Conference on Knowledge Discovery and Data Mining, pp. 424–433 (2006)
Kim, C., Hong, Y.-S.: Trend analysis of data mining research using topic network analysis. J. Korea Soc. Comput. Inf. 11(5), 141–148 (2016)
R: The R Project for Statistical Computing. https://www.r-project.org/
Gruen, B., Hornik, K.: topicmodels: an R package for fitting topic models. J. Stat. Softw. 40(13), 1–29 (2011)
Porter, M.: An algorithm for suffix stripping. Program 14(3), 130–137 (1980)
Delp, P., Thesen, A., Motiwalla, J., Seshardi, N. (eds.) System Tools for Project Planning. International Development Institute, Bloomington (1977)
Freeman, L.C.: Centrality in social networks: conceptual clarification. Soc. Netw. 1, 215–239 (1979)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sharma, D., Kumar, B., Chand, S. (2018). Trend Analysis of Machine Learning Research Using Topic Network Analysis. In: Panda, B., Sharma, S., Roy, N. (eds) Data Science and Analytics. REDSET 2017. Communications in Computer and Information Science, vol 799. Springer, Singapore. https://doi.org/10.1007/978-981-10-8527-7_4
Download citation
DOI: https://doi.org/10.1007/978-981-10-8527-7_4
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8526-0
Online ISBN: 978-981-10-8527-7
eBook Packages: Computer ScienceComputer Science (R0)