Abstract
Sports analytics is gaining much attention in the research community nowadays. This paper deals with a prominent problem in sports analytics, namely, winning probability calculation. In particular, we focus on the two-team sports. A novel model called TLGProb is proposed by stacking a non-linear regression model – Gaussian process regression (GPR) to address complex association between match outcomes and players’ performances. For evaluation, we selected a popular sports event around the world – National Basketball Association (NBA) as the domain for experiments. Finally, using TLGProb, we correctly predicted 85.28% of outcomes among 1,230 matches in NBA 2014/2015 season.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Breiman, L.: Random Forests. Mach. Learn. 45(1), 5–32 (2001)
Polese, G., Troiano, M., Tortora, G.: A data mining based system supporting tactical decisions. In: 14th International Conference on Software Engineering and Knowledge Engineering, pp. 681–684. ACM (2002)
Gardner, E.S.: Exponential smoothing: the state of the art. J. Forecast. 4(1), 1–28 (1985)
Haghighat, M., Rastegari, H., Nourafza, N.: A review of data mining techniques for result prediction in sports. Adv. Comput. Sci. Int. J. 2(5), 7–12 (2013)
Hausch, D.B., Ziemba, W.T.: Handbook of Sports and Lottery Markets. Elsevier, Amsterdam (2011)
Bhandari, I., Colet, E., Parker, J., Pines, Z., Pratap, R., Ramanujam, K.: Advanced scout: data mining and knowledge discovery in NBA data. Data Min. Knowl. Discov. 1(1), 121–125 (1997)
Kingma, D., Ba, J.: Adam: A method for stochastic optimization (2014)
Lázaro-Gredilla, M., Quiñonero-Candela, J., Rasmussen, C.E., Figueiras-Vidal, A.R.: Sparse spectrum Gaussian process regression. J. Mach. Learn. Res. 11, 1865–1881 (2010)
Beckler, M., Wang, H., Papamichael, M.: NBA Oracle (2013). https://www.mbeckler.org/coursework/2008-2009/10701_report.pdf
Miljković, D., Gajić, L., Kovačević, A., Konjović, Z.: The use of data mining for basketball matches outcomes prediction. In: 2010 IEEE 8th International Symposium on Intelligent Systems and Informatics (SISY), pp. 309–312. IEEE (2010)
Oliver, D.: Basketball on Paper: Rules and Tools for Performance Analysis. Potomac Books Inc., Washington (2004)
Ottaviani, M., Sørensen, P.N.: Surprised by the parimutuel odds? Am. Econ. Rev. 99(5), 2129–2134 (2009)
Rasmussen, C.E., Williams, C.K.: Gaussian Processes for Machine Learning. MIT Press, Cambridge (2006)
Saunders, C., Gammerman, A., Vovk, V.: Ridge regression learning algorithm in dual variables. In: 15th International Conference on Machine Learning, pp. 515–521 (1998)
Simonoff, J.S.: Smoothing Methods in Statistics. Springer Science & Business Media, Heidelberg (2012)
Winston, W.L.: Mathletics: How Gamblers, Managers, and Sports Enthusiasts Use Mathematics in Baseball, Basketball, and Football. Princeton University Press, Princeton (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Lam, M.W.Y. (2017). TLGProb: Two-Layer Gaussian Process Regression Model for Winning Probability Calculation in Two-Team Sports. In: Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L., Zurada, J. (eds) Artificial Intelligence and Soft Computing. ICAISC 2017. Lecture Notes in Computer Science(), vol 10246. Springer, Cham. https://doi.org/10.1007/978-3-319-59060-8_26
Download citation
DOI: https://doi.org/10.1007/978-3-319-59060-8_26
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-59059-2
Online ISBN: 978-3-319-59060-8
eBook Packages: Computer ScienceComputer Science (R0)