Introducing an expert system for prediction of soccer player ranking using ensemble learning
- 21 Downloads
Soccer is one of the most played sports in the world with many individuals involved. Recognizing talented players and team selection is a challenging task for coaches. Coaches need to employ different methods in order to rank soccer players and select them by their corresponding rank. In this paper, we propose a new web-based approach for ranking soccer players by using information available from online sources. The first step to do this task is collecting information about players. This information is fetched from the Internet and will be preprocessed or augmented by professional users at a web-based expert system. Information is highly dynamic in a sense that data change constantly. To build a ranking system for players, machine learning approaches are employed. We use different classification algorithms on prepared data and choose the best model from applied methods to rank new players in each state of the dataset. To improve classification results, a weighted ensemble method using a genetic algorithm for optimizing weights is proposed. We used this model to predict players’ rank. The ranking is done separately for different types of ranks with two, three, or four number of rankings. Experiments were done in the Persian premier league and have shown promising results for predicting player ranks with improvement in accuracy for four-, three-, and two-class predictions. The results show that (1) achieving higher performance will be harder with each level of granularity that is added to ranking classes of system. (2) A web-based system can be useful in order to develop a ranking system in sports. (3) The new ensemble method is able to improve classification models by improving the best model. We believe that using our innovative system, challenges for team selection and talent recognition can be solved. This assumption is proved with final results of the system and feedbacks from professionals.
KeywordsEnsemble learning Sports information system Expert system Genetic algorithm Soccer player ranking
The authors would like to thank Mr. Majid Rezaei and other coaches and national soccer players for their expert opinions and comments and their involvements in making our dataset.
Compliance with ethical standards
Conflict of interest
All authors declare that they have no conflict of interests. The authors alone are responsible for the content and writing of the paper.
- 2.Bachrach Y, Graepel T, Kasneci G, Kosinski M, Van Gael J (2012) Crowd IQ: aggregating opinions to boost performance. In: Proceedings of the 11th international conference on autonomous agents and multiagent systems, vol 1, pp 535–542Google Scholar
- 3.Bialkowski A, Lucey P, Carr P, Yue Y, Sridharan S, Matthews I (2014) Large-scale analysis of soccer matches using spatiotemporal tracking data. In: Proceedings of the 2014 IEEE international conference on data mining, IEEE, IEEE Computer Society, Washington, DC, USA, ICDM ’14, pp 725–730. https://doi.org/10.1109/ICDM.2014.133
- 4.Choi YS, Moon BR, Seo SY (2005) Genetic fuzzy discretization with adaptive intervals for classification problems. In: Proceedings of the 7th annual conference on genetic and evolutionary computation. ACM, pp 2037–2043Google Scholar
- 6.De Stefano C, Della Cioppa A, Marcelli A (2002) An adaptive weighted majority vote rule for combining multiple classifiers. In: 16th International conference on pattern recognition, 2002. Proceedings. IEEE, vol 2, pp 192–195Google Scholar
- 7.Dietterich TG (2000) Ensemble methods in machine learning. In: International workshop on multiple classifier systems. Springer, pp 1–15Google Scholar
- 13.Ishibuchi H, Yamamoto T (2003) Deriving fuzzy discretization from interval discretization. In: The 12th IEEE international conference on fuzzy systems, 2003. FUZZ’03. IEEE, vol 1, pp 749–754Google Scholar
- 14.Jalilian G, Khabiri M (2005) Describing the status of Iran’s Football Premier League clubs and clubs in major leagues, comparison with China, Malaysia and the United Kingdom (In Persian). J Strateg Manag Rev 5(1):41–54Google Scholar
- 15.Jiawei H, Kamber M, Han J, Kamber M, Pei J (2006) Data mining: concepts and techniques. Morgan Kaufmann, Burlington. https://doi.org/10.1016/B978-0-12-381479-1.00001-0 zbMATHGoogle Scholar
- 16.Karsak EE (2000) A fuzzy multiple objective programming approach for personnel selection. In: 2000 IEEE international conference on systems, man, and cybernetics. IEEE, vol 3, pp 2007–2012Google Scholar
- 17.Kianmehr K, Alshalalfa M, Alhajj R (2008) Effectiveness of fuzzy discretization for class association rule-based classification. In: International symposium on methodologies for intelligent systems. Springer, pp 298–308Google Scholar
- 20.Kotsiantis S, Kanellopoulos D, Pintelas P et al (2006) Handling imbalanced datasets: a review. GESTS Int Trans Comput Sci Eng 30(1):25–36Google Scholar
- 24.Li Y, Zhang Y (2012) Application of data mining techniques in sports training. In: 2012 5th International conference on biomedical engineering and informatics. IEEE, pp 954–958. https://doi.org/10.1109/BMEI.2012.6513050
- 26.Mohammad Kazemi R, Tondnevis F, Khabiri M (2008) Analysis of price of sports marketing in the Iranian professional soccer league, comparing the current situation with South Korean and Japanese League (In Persian). J Strateg Manag Rev 6(12):121–132Google Scholar
- 32.Torabi T, Ghorbani M, Bagheri M, Zarifi S (2015) New methods of financing of football clubs in developed countries and Its compatibility with developing countries (A case study in professional football clubs in the Premier League Iran and the United Kingdom), in Persian. Invest Knowl 4:217–232Google Scholar
- 33.Trawinski K (2010) A fuzzy classification system for prediction of the results of the basketball games. In: International conference on fuzzy systems, pp 1–7. https://doi.org/10.1109/FUZZY.2010.5584399
- 35.Witten IH, Frank E, Ma Hall (2011) Data mining: practical machine learning tools and techniques, vol 54, 3rd edn. Morgan Kaufmann, Burlington. https://doi.org/10.1002/1521-3773(20010316)40:6<9823::AID-ANIE9823>3.3.CO;2-C Google Scholar