Premier League Rule Impact Analysis

RPythonExcelR MarkdownJupyter Notebooktidyversedplyrggplot2Data WranglingStatistical Testing (t-test, z-test, CI)

Analyzed EPL 2021-22 to 2023-24 match and event datasets to quantify IFAB timekeeping-rule impact. Built a reproducible R Markdown and notebook workflow for cleaning, event standardization, extra-time and late-goal trend analysis, and 4 inferential tests (t-test, z-test, confidence intervals, and correlation checks).

About the Project

Outcome-focused analysis showing a clear late-game shift after rule changes, including higher scoring and more goals after 90 minutes. The project turns raw match-event logs into evidence-backed insights and publishes them in a live, recruiter-friendly report.

Key Features

  • Led a 6-member analytics effort and combined three-season match/event datasets (2021-22 to 2023-24)
  • Expanded nested event strings into structured columns (Match_Id, Home_Away, Event_Minute, Event_Type)
  • Standardized event taxonomy and cleaned inconsistent numeric/date fields
  • Built visual analysis for cumulative goals, total goals, added-time distributions, and goals after 90 minutes
  • Validated impact with 4 statistical tests and found a 15% rise in overall scoring in 2023-24
  • Confirmed a 70% rise in late goals with 30% more fouls and 20% more substitutions, then published live
Premier League Rule Impact Analysis | Mayur Bijarniya