Abstract
This paper proposes a model for predicting the compensation of data science professionals in BRIC nations based on the worldwide Data Science Survey conducted by Kaggle in 2017. In this paper, we have used the Rosling’s approach to adjust the compensation amount in BRIC currencies with respect to Purchasing Power Parity (PPP) units. Exploratory data analysis is used to identify the factors that influence the compensation amount, and an XGBoost algorithm is employed to predict the compensation. We evaluate the performance of the model by generating the Root Mean Squared Log Error (RMSLE) score. The results indicate a robust prediction using the XGBoost algorithm.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Economist: The world’s most valuable resource is no longer oil, but data (2017). https://www.economist.com/news/leaders/21721656-data-economy-demands-new-approach-antitrust-rules-worlds-most-valuable-resource
Best Jobs in America (2017). https://www.glassdoor.com/List/Best-Jobs-in-America-LST_KQ0,20.htm
Columbus, L.: IBM predicts demand for data scientists will soar 28% by 2020 (2017). https://www.forbes.com/sites/louiscolumbus/2017/05/13/ibm-predicts-demand-for-data-scientists-will-soar-28-by-2020/
Kaggle, M.L.: Data science survey (2017). https://www.kaggle.com/kaggle/kaggle-survey-2017
Carnoy, M., et al.: University expansion in a changing global economy: triumph of the BRICS (2013)
Vizgunov, A., Glotov, A., Pardalos, P.M.: Comparative analysis of the BRIC countries stock markets using network approach. In: Proceedings in Mathematics and Statistics, pp. 191–201 (2013)
Mazzioni, S., et al.: The relationship between intangibility and economic performance: study with companies traded in Brazil, Russia, India, China and South africa (BRICS). In: Advances in Scientific and Applied Accounting, pp. 122–148 (2014)
Scott, E.: Higher Education Salary Evaluation Kit. American Association of University Professors (1977)
Moore, N.: Faculty salary equity: issues in regression model selection. Res. High. Educ. 34, 107–126 (1993)
Billard, L.: Study of salary differentials by gender and discipline. Stat. Public Policy 4, 1–14 (2017)
Shmueli, G.: To explain or to predict. Stat. Sci. 25(3), 289–310 (2010)
Jabri, M.: Salary and purchasing power parity (2017). https://www.kaggle.com/mhajabri/salary-and-purchasing-power-parity
Hirst, T., Rosling, H.: How to compare income across countries (2015). http://www.open.edu/openlearn/science-maths-technology/mathematics-and-statistics/how-compare-income-across-countries
Implied PPP conversion rate (2017). http://www.imf.org/external/datamapper/PPPEX@WEO/OEMDC/ADVEC/WEOWORLD/IND?year=2017
Shmueli, G., Bruce, P.C., Patel, N.R.: Data mining for business analytics: concepts, techniques, and applications with XLMiner (2016)
Python API Reference—xgboost 0.6 Documentation. http://xgboost.readthedocs.io/en/latest/python/python_api.html
Jain A.: Complete guide to parameter tuning in XGBoost (with codes in Python) (2016). https://www.analyticsvidhya.com/blog/2016/03/complete-guide-parameter-tuning-xgboost-with-codes-python/
Kaggle Forums (2014). https://www.kaggle.com/general/9933
Pentreath, N., Ghotra, M.S., Dua R.: Machine Learning with Spark, 2nd edn (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Smibi, M.J., Menon, V. (2019). Modeling Compensation of Data Science Professionals in BRIC Nations. In: Abraham, A., Dutta, P., Mandal, J., Bhattacharya, A., Dutta, S. (eds) Emerging Technologies in Data Mining and Information Security. Advances in Intelligent Systems and Computing, vol 755. Springer, Singapore. https://doi.org/10.1007/978-981-13-1951-8_57
Download citation
DOI: https://doi.org/10.1007/978-981-13-1951-8_57
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1950-1
Online ISBN: 978-981-13-1951-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)