Open Access Open Access  Restricted Access Subscription Access

Using Machine Learning to Analyze Football Teams and Predict the Outcome of a Football Match

Adish Golechha, Akshat Muke


Football, as one of the most popular sports on the planet, has always attracted a large number of fans. Over 150 million men and women of all ages play it in over 200 countries. Modern football has seen a paradigm shift from being just one of the most physical sports to now being one of the most complex sports due to the involvement of multiple new factors such as home games, away games, form of an individual, total shots taken, total shots blocked, weather conditions etc. In recent years, the importance of these statistics in football and its usage has been seen by all. In this study, football statistics have been used to predict the outcome of football matches via various machine learning algorithms as well as by analyzing them. A thorough examination of these statistics enables a manager to devise strategies for team management as well as upcoming matches. To calculate accuracy, three Machine Learning algorithms were used: Logistic Regression, Support Vector Machine, and Multinomial Nave Bayes. All three of them were compared and it was observed that the most efficient accuracy was that of SVM which was (61.29%). It was also noted that the accuracy achieved by Logistic Regression (60.9%) was very close to that of SVM.


Football match, machine learning algorithms, support vector machine, logistic regression, multinomial naive bayes

Full Text:



Klaus Rosmanitz. European Football-Soccer. [Online]. English Online AT, Accessed on: April 19, 2022. Available:

Frank Keogh. (2013 Oct 3). Football betting-the global gambling industry worth billions. [Online]. BBC. Accessed on: April 19, 2022. Available:

Hugo Mathien. (2016 Oct). European Soccer Database. [Online]. Kaggle. Accessed on: April 19, 2022. Available:

Football Data Co. UK. Historical Data in England Football. [Online]. Accessed on: April 19, 2022. Available:

Sci-kit Learn. Support Vector Machines. [Online]. Accessed on: April 19, 2022. Available:

Fidell S, Tabachnick B, Mestre V, Fidell L. Aircraft noise-induced awakenings are more reasonably predicted from relative than from absolute sound exposure levels. The Journal of the Acoustical Society of America. 2013 Nov;134(5):3645-53.

Sci-kit Learn. Naive Bayes. [Online]. Accessed on: April 19, 2022. Available: http://scikit-

Stylianos Kampakis, Andreas Adamides. Using Twitter to predict foot-ball outcomes. [Online]. Accessed on: April 19, 2022. Available:

Matheus Kempa. Machine Learning Algorithms for Football Predictions. [Online]. Medium. Accessed on: April 19, 2022. Available: algorithms-for-football-prediction-using-statistics-from-brazilian-championship-51b7d4ea0bc8

Md. Ashiqur Rahman. A deep learning framework for football match prediction. [Online]. Springer Link. Accessed on: April 19, 2022. Available:–1821–5

Mark Lawrenson. (2013 Sep 12). How do handicaps predict outcomes? [Online]. Pinnacle. Accessed on: April 19, 2022. Available:

Siraj Raval. Predicting Winning Teams. [Online]. GitHub. Accessed on: April 19, 2022. Available: Teams/blob/master/Prediction.ipynb

Yoni Lev. The Most Predictable League. Kaggle. Accessed on: April 19, 2022. [Online]. Avail- able:

Razali N, Mustapha A, Clemente FM, Ahmad MF, Salamat MA. Pattern Analysis of Goals Scored in Malaysia Super League 2015. Indones J Electr Eng Comput Sci. 2018; 11(2): 718–724.

Min B, Kim J, Choe C, Eom H, McKay RB. A compound framework for sports results prediction: A football case study, Knowl-Based Syst. 2008; 21(7): 551–562.

Anthony C, Fenton N, Neil M. Pi-football: A Bayesian network model for forecasting Association Football match outcomes. Knowl-Based Syst. 2012; 36: 322–339.



  • There are currently no refbacks.

Copyright (c) 2022 Journal of Communication Engineering & Systems