Machine Learning on Soccer Player Positions

Machine Learning on Soccer Player Positions

Umberto Di Giacomo, Francesco Mercaldo, Antonella Santone, Giovanni Capobianco
Copyright: © 2022 |Pages: 19
DOI: 10.4018/IJDSST.286678
Article PDF Download
Open access articles are freely available for download

Abstract

During the last few years, sports analytics has been growing rapidly. The main usage of this discipline is the prediction of soccer match results, even if it can be applied with interesting results in different areas, such as analysis based on the player position information. In this paper, the authors propose an approach aimed to recognize the player position in a soccer match, predicting the specific zone in which the player is located in a specific moment. Similar objectives have not yet been considered. The authors consider supervised machine learning techniques by considering a dataset obtained through video capturing and tracking system. The data analyzed refer to several professional soccer games captured at the Alfheim Stadium in Tromso, Norway. The approach can be used in real time in order to verify if a player is playing according to the guidelines of the coach. In the experimental analysis, three different types of classification have been performed (i.e., three different divisions of the field), reaching the best results with random tree algorithm.
Article Preview
Top

Sport analytics refers to the use of data and advanced statistics, for example machine learning techniques, to measure performance with the aim to take informed decisions and gain a competitive sports advantage. In other words, sport analytics is the practice of applying mathematical and statistical principles to different sports, such as baseball (Dietrich et al., 2014), basketball (Jain and Kaur,2017) and hockey (Liu and Schulte,2018). In soccer, the most usage is about the prediction of results and the definition of strategies that can be used to win a game or to obtain an improvement of the team performances. Usually, the models constructed in these analysis are based on several aspects about the game, such as tactical, technical or physical information.

However, although each sport has its own characteristics, sport analytics uses the same basic methods and approaches as any other kind of data analysis and, when properly applied, can yield tremendous competitive advantages to a team or an individual player.

The analysis that can be performed with sport analytics is typically divided into two different parts: bio-mechanical and notational analysis (Hughes and Franks,2004). Both techniques involve the analysis and improvement of the sport performance giving good feedbacks to coaches and athletes. Sport biomechanics is concerned with fine detail about individual sport techniques while, on the other hand, notational analysis is more concerned with gross movements or movement patterns in games or teams, and is primarily interested in strategy and tactics. These types of analysis are useful because, if we consider the well- chosen performance indicators to evaluate a specific game, it can be possible to highlight advantageous strategy or important aspect of team performance. In other words, they help coaches to identify good and bad performances of team member or the whole team. (Bartlett,2001).

As we have said before, one of the most widespread use of sport analytics, in soccer environment, is related to the prediction of soccer match results. In literature there are several works focused about the most important factors that influence the results of a game. In (Capobianco et al.,2019), the authors propose a new feature set aimed to model a soccer match. The set is related to characteristics obtainable not only at the end of the match, and it is used to predict the results of the match and the number of goal

scored by the team that won the game. In (Joseph et al.,2006), an approach based on Bayesian Networks to predict match results has been presented. The analysis showed that the Bayesian networks is generally superior to other techniques such as the MC4, a decision tree learner, naive Bayesian learner (NB), and k-nearest neighbor learner (KNN) for this domain in terms of predictive accuracy. Specifically, authors obtain an accuracy equal to 59% which outperformed other machine learning models i.e., 41.7% (obtained by the MC4 classification algorithm), 47.86% (with the NB algorithm) and 50.58% (with the KNN algorithm). A similar analysis has been proposed in (Liti et al.,2017), where the authors predict the outcome of soccer matches finished with a draw at the end of the first half using the information stored during the first part of the match; while, in (Razali et al.,2017) a Bayesian Network approach to predict the outcome of English Premier League matches has been constructed. In (Berrar and Dubitsky,2019) the authors suggest that a key factor in soccer match outcome prediction lies in the successful incorporation of domain knowledge into the machine learning modeling process.

In soccer, there are other types of work concerning specific aspects of the game or player performance analysis. For example, in (Fernandez and Cervone,2019) it has been presented a model that quantifies the expected outcome of a soccer possession at any time during the possession, driven by a fine-grained evaluation of the full spatio-temporal characteristics of the 22 players and the ball. In (Kharrat et al.,2017) the authors try to examine who are the best players in European football, and demonstrate how the players’ ratings evolve over time, using plus-minus rating. Finally, in (Schultze and Wellbrock,2017) it has been proposed a weighted plus/minus metric to be used as an instrument to evaluate player performance.

Complete Article List

Search this Journal:
Reset
Volume 16: 1 Issue (2024)
Volume 15: 2 Issues (2023)
Volume 14: 4 Issues (2022): 1 Released, 3 Forthcoming
Volume 13: 4 Issues (2021)
Volume 12: 4 Issues (2020)
Volume 11: 4 Issues (2019)
Volume 10: 4 Issues (2018)
Volume 9: 4 Issues (2017)
Volume 8: 4 Issues (2016)
Volume 7: 4 Issues (2015)
Volume 6: 4 Issues (2014)
Volume 5: 4 Issues (2013)
Volume 4: 4 Issues (2012)
Volume 3: 4 Issues (2011)
Volume 2: 4 Issues (2010)
Volume 1: 4 Issues (2009)
View Complete Journal Contents Listing