Generating an Evolving Skills Network from Job Adverts for High-Demand Skillset Discovery
Understanding the needs of highly-dynamic job market sectors is of crucial importance to job seekers, employers, and educational bodies alike. This paper describes efforts to identify skill demand composition and dynamics by constructing and interpreting a time series of skills networks that are routinely identified through an established agglomerative hierarchical clustering with breadth-first search order based on co-word occurrences. We focus on Data Science as an example of a highly dynamic sector. Data collected from job adverts between 2016–2017 is pre-processed to identify distinct evolving skills networks observed over at least 12 months. These result in 40 time-series that are used to track the evolving skills clusters and to define the skillsets in high-demand. To return a quantitative scientific result, we implement three traditional statistical models (Naive, Simple Exponential Smoothing (SES), and Holt’s linear trend) to forecast future skills cluster composition. The analysis is done based on the centrality and density indices generated for each evolving cluster within the skills networks. Forecasts based on the previous quarter(s) are then checked against actual observations in terms of positioning within a density- and centrality-based strategic quadrant. The F-measures observed (75% and 73% for two top methods) demonstrate the suitability of our approach to identify core skillsets in the near future based on recent data with a high level of accuracy.
KeywordsGraph mining Network clustering Time-evolving network
This work has been co-funded by the European Union’s Horizon 2020 research and innovation programme under the QualiChain project, Grant Agreement No 822404.
- 3.Sibarani, E.M., Scerri, S., Morales, C., Auer, S., Collarana, D.: Ontology-guided job market demand analysis: a cross-sectional study for the data science field. In: SEMANTiCS 2017, pp. 25–32. ACM, New York (2017). https://doi.org/10.1145/3132218.3132228
- 6.Xu, X., Yuruk, N., Feng, Z., Schweiger, T.A.J.: SCAN: a structural clustering algorithm for networks. In: KDD 2007, pp. 824–833. ACM, New York (2007). https://doi.org/10.1145/1281192.1281280
- 8.He, Q.: Knowledge discovery through co-word analysis. Libr. Trends 48(1), 133–159 (1999)Google Scholar
- 14.Polanco, X.: Co-word analysis revisited: modelling co-word clusters in terms of graph theory. In: Proceedings of the 10th International Conference on Scientometrics and Informetrics, vol. 2, pp. 662–663 (2005)Google Scholar
- 15.Hyndman, R.J., Athanasopoulos, G.: Forecasting: Principles and Practice. https://otexts.com/fpp2/. Accessed 30 May 2019