Advertisement

Modeling Compensation of Data Science Professionals in BRIC Nations

  • M. J. Smibi
  • Vivek MenonEmail author
Conference paper
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 755)

Abstract

This paper proposes a model for predicting the compensation of data science professionals in BRIC nations based on the worldwide Data Science Survey conducted by Kaggle in 2017. In this paper, we have used the Rosling’s approach to adjust the compensation amount in BRIC currencies with respect to Purchasing Power Parity (PPP) units. Exploratory data analysis is used to identify the factors that influence the compensation amount, and an XGBoost algorithm is employed to predict the compensation. We evaluate the performance of the model by generating the Root Mean Squared Log Error (RMSLE) score. The results indicate a robust prediction using the XGBoost algorithm.

References

  1. 1.
  2. 2.
  3. 3.
    Columbus, L.: IBM predicts demand for data scientists will soar 28% by 2020 (2017). https://www.forbes.com/sites/louiscolumbus/2017/05/13/ibm-predicts-demand-for-data-scientists-will-soar-28-by-2020/
  4. 4.
    Kaggle, M.L.: Data science survey (2017). https://www.kaggle.com/kaggle/kaggle-survey-2017
  5. 5.
    Carnoy, M., et al.: University expansion in a changing global economy: triumph of the BRICS (2013)Google Scholar
  6. 6.
    Vizgunov, A., Glotov, A., Pardalos, P.M.: Comparative analysis of the BRIC countries stock markets using network approach. In: Proceedings in Mathematics and Statistics, pp. 191–201 (2013)Google Scholar
  7. 7.
    Mazzioni, S., et al.: The relationship between intangibility and economic performance: study with companies traded in Brazil, Russia, India, China and South africa (BRICS). In: Advances in Scientific and Applied Accounting, pp. 122–148 (2014)Google Scholar
  8. 8.
    Scott, E.: Higher Education Salary Evaluation Kit. American Association of University Professors (1977)Google Scholar
  9. 9.
    Moore, N.: Faculty salary equity: issues in regression model selection. Res. High. Educ. 34, 107–126 (1993)CrossRefGoogle Scholar
  10. 10.
    Billard, L.: Study of salary differentials by gender and discipline. Stat. Public Policy 4, 1–14 (2017)CrossRefGoogle Scholar
  11. 11.
    Shmueli, G.: To explain or to predict. Stat. Sci. 25(3), 289–310 (2010)MathSciNetCrossRefGoogle Scholar
  12. 12.
    Jabri, M.: Salary and purchasing power parity (2017). https://www.kaggle.com/mhajabri/salary-and-purchasing-power-parity
  13. 13.
  14. 14.
  15. 15.
    Shmueli, G., Bruce, P.C., Patel, N.R.: Data mining for business analytics: concepts, techniques, and applications with XLMiner (2016)Google Scholar
  16. 16.
    Python API Reference—xgboost 0.6 Documentation. http://xgboost.readthedocs.io/en/latest/python/python_api.html
  17. 17.
    Jain A.: Complete guide to parameter tuning in XGBoost (with codes in Python) (2016). https://www.analyticsvidhya.com/blog/2016/03/complete-guide-parameter-tuning-xgboost-with-codes-python/
  18. 18.
  19. 19.
    Pentreath, N., Ghotra, M.S., Dua R.: Machine Learning with Spark, 2nd edn (2017)Google Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.Department of ManagementAmrita Vishwa VidyapeethamKochiIndia
  2. 2.Department of Computer Science and EngineeringAmrita Vishwa VidyapeethamKollamIndia

Personalised recommendations