Skip to content

lefecko/Soccer_Match_Predictor

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

28 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

⚽ Soccer Match Outcome Predictor

Python Machine Learning Status

A machine learning project that predicts the outcome of professional soccer matches
(Home Win / Draw / Away Win) using historical match data and engineered performance metrics.


📌 Project Overview

This project builds a supervised classification model to predict match results based on:

  • Team form (rolling averages)
  • Goals scored & conceded
  • Home vs Away performance
  • Head-to-head statistics
  • Engineered rating features (e.g., goal differential, Elo rating)

The objective is to explore feature engineering, model selection, and evaluation in a real-world sports analytics setting.


🧠 Problem Statement

Given two teams and their historical performance data, predict the match outcome:

  • 🟢 Home Win
  • 🟡 Draw
  • 🔵 Away Win

This is a multi-class classification problem.


📊 Dataset

The dataset contains:

  • Match date
  • Home team / Away team
  • Full-time goals
  • Shots, shots on target
  • Possession (if available)
  • Rolling performance statistics
  • Engineered features

Possible data sources:

  • Kaggle soccer datasets
  • API-Football

⚙️ Tech Stack

  • Python
  • Pandas
  • NumPy
  • Scikit-learn
  • Matplotlib / Seaborn

🔍 Feature Engineering

Examples of features used:

  • Rolling average goals scored (last 5 matches)
  • Rolling average goals conceded
  • Goal differential
  • Home win percentage
  • Away win percentage
  • Head-to-head win ratio
  • Elo rating difference (optional)

🏗️ Model Pipeline

  1. Data Cleaning
  2. Feature Engineering
  3. Train/Test Split
  4. Model Training
  5. Cross-Validation
  6. Evaluation & Comparison

📈 Evaluation Metrics

Since this is a multi-class classification problem:

  • Accuracy
  • Precision
  • Recall
  • F1 Score
  • Confusion Matrix

About

Primer League predictor model created in foundations of AI/ML Class group project. Contributed to a number of internal algorithms, and was primary author on resulting research paper.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Jupyter Notebook 98.1%
  • Python 1.9%