back to work
arsenal-forecast

sports analytics and probabilistic forecasting

arsenal-forecast

A probabilistic machine learning system that forecasts Arsenals 2025/26 season outcomes across Premier League and UEFA Champions League using Monte Carlo simulation.

pythonscikit-learnmonte carlogradient boosting

metric

65% title probability

view repository

overview

A football forecasting model for Arsenal EPL and UCL title race title race.

challenge

Transform colloquial notions of "winning vs bottling" into rigorous, data-driven probabilities while accounting for fixture difficulty, current form, and multi-competition dynamics.

approach

Built a gradient boosting classifier trained on 2022-2024 EPL data with features including team strength, rolling form, and home advantage. Ran 10,000 Monte Carlo simulations per competition to estimate title probabilities, then combined EPL and UCL forecasts for multi-trophy scenario analysis.

outcome

Delivered 65% EPL title probability for Arsenal vs 24% for Man City, 32% UCL win probability, and 21% chance of winning the double. The system reveals that Arsenal can afford one slip-up while City needs near-perfection.

stack

python, monte-carlo, EPL and UCL datasets

highlights

10,000 Monte Carlo simulations per competition
Match-level probability predictions with opponent strength weighting
Multi-trophy scenario analysis (double, single trophy, neither)
54% model accuracy on historical data validation