Recruitment Data Analysis Using Machine Learning in R Studio

  • R. DevakunchariEmail author
  • Niketha Anand
  • Anusha Vedhanayaki
  • Y. J. Visishta
Conference paper
Part of the Lecture Notes in Networks and Systems book series (LNNS, volume 98)


Job classification is a system used to divide all jobs within a company and put them on a different regulated scale based on the overall work, salary level, skills, and commitment associated with a clear cut job. This likewise encourages organizations to look at similar jobs in various organizations inside their industry. Since the knowledge base is huge, it is difficult to physically identify the errors. Therefore, two machine learning algorithms namely Support Vector Machine and Random Forest are used to calculate the error percentage. The algorithm with the least error percentage is implemented to recruit employees. Our analysis overall shows that the random forest is more efficient than SVM.


Support Vector Machine Multi class classification Random forest R studio Machine learning 


  1. 1.
    Langley, P.: Editorial: On Machine Learning, University of California. Springer (2001)Google Scholar
  2. 2.
    Harve, R.: Machine Learning in Recruitment & How To Do It Right.
  3. 3.
    Jatain, A., Ranjan, A.: A review study on big data analysis using R studio. Int. J. Comput. Sci. Mob. Comput. 6, 8–13 (2017)Google Scholar
  4. 4.
  5. 5.
    Alghamlas, M., Alabduljabbar, R.: Predicting the suitability of IT students’ skills for the recruitment in Saudi labor market. In: 2nd International Conference on Computer Applications & Information Security (ICCAIS), pp. 1–5, Saudi Arabia (2019)Google Scholar
  6. 6.
    Sisodia, D.S., Vishwakarma, S., Pujahari, A.: Evaluation of machine learning models for employee churn prediction. In: International Conference on Inventive Computing and Informatics (ICICI), pp. 1016–1020, Coimbatore (2017)Google Scholar
  7. 7.
    Chern, A., Liu, Q., Chao, J., et al.: Automatically detecting errors in employer industry classification using job postings. Data Sci. Eng. 3, 221 (2018)CrossRefGoogle Scholar
  8. 8.
    Duan, K., Keerthi, S.: Which is the best multiclass svm method? an empirical study. LNCS, vol. 3541, pp 278–285. Springer (2005)Google Scholar
  9. 9.
    Amudha, P., Sivakumari, S.: Big data analytics using support vector machine. In: International Conference on Soft-computing and Network Security (ICSNS) 2018, pp. 1–6. IEEE, Coimbatore (2018)Google Scholar
  10. 10.
    Dai, H.: Research on SVM improved algorithm for large data classification. In: 3rd International Conference on Big Data Analysis. IEEE (2018)Google Scholar
  11. 11.
    Hsu, C.W., Lin, C.J.: A comparison of methods for multi-class support vector machines. IEEE Trans. Neural Netw. 13, 415–425 (2002)CrossRefGoogle Scholar
  12. 12.
    Li, T., Zhang, C., Ogihara, M.: A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression. Bioinformatic 20, 2429–2437 (2004)CrossRefGoogle Scholar
  13. 13.
    Yang, Y., Liu, R., Chen, Y., Li, T., Tang, Y.: Normal cloud model-based algorithm for multi-attribute trusted cloud service selection. IEEE Access 6, 37644–37652 (2018)CrossRefGoogle Scholar
  14. 14.
    Zhu, M., Xia, J., Jin, X., Yan, M., Cai, G., Yan, J., Ning, G.: Class weights random forest algorithm for processing class imbalanced medical data. IEEE Access 6, 4641–4652 (2018)CrossRefGoogle Scholar
  15. 15.
    Díaz-Uriarte, R., Alvarez de Andrés, S.: Gene selection and classification of microarray data using random forest. BMC Bioinf. 7(1), 3 (2006)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  • R. Devakunchari
    • 1
    Email author
  • Niketha Anand
    • 1
  • Anusha Vedhanayaki
    • 1
  • Y. J. Visishta
    • 1
  1. 1.Department of Computer Science EngineeringSRMISTChennaiIndia

Personalised recommendations