Step 2 of 7

Meet the dataset

Open the dataset

Your data is in one Google Sheets file with four tabs. Click the link below to open it. Go to File → Make a copy to create your own working version. Close the original. You will only work on your copy.

Open the dataset →

Work only on your copy. Do not edit the original shared sheet.

Understand your columns

Before you write a single formula, spend a few minutes understanding the data. The spreadsheet has two data tabs.

Tab: Match Data

Home team: the country playing in their own country, or designated as the home side for a neutral-venue match.

Away team: the visiting country.

Draw: a match that ended level — both teams scored the same number of goals.

ColumnWhat it containsExample
dateDate the match was played2022-11-22
home_teamCountry playing at homeFrance
away_teamCountry playing awayAustralia
home_scoreGoals scored by the home team4
away_scoreGoals scored by the away team1
tournamentCompetition nameFIFA World Cup
sourceopen-source or estimatedopen-source
resultMatch result (you will fill this in)— empty —
The most important thing to understand: each match is stored as one row. A country can appear in either the home_team column or the away_team column. When you calculate any metric, you need to look at both columns. Every formula you write will need to account for this.
Quiz — answer before moving on

In this scenario: home_team = France, away_team = Australia, home_score = 4, away_score = 1. What happened?

Tab: xG Data

xG stands for Expected Goals — a metric used in professional football analytics to measure the quality of chances a team creates, not just goals scored. A team with consistently high xG is creating genuinely dangerous opportunities. You will use this tab in Metric 4.

ColumnWhat it containsExample
teamCountry nameSpain
confederationFootball regionUEFA
matches_playedMatches tracked for xG10
xg_per_matchExpected goals scored per game1.94
xga_per_matchExpected goals conceded per game0.87

Your analysis tab

Your spreadsheet already includes a tab called My Analysis with a pre-built table. Here is what each column represents:

ColumnWhat it represents
A: TeamCountry name (already filled in)
B: Total MatchesAll matches played between 2020 and 2024
C: WinsNumber of matches won
D: DrawsNumber of matches drawn
E: LossesNumber of matches lost
F: Win Rate (%) — 5 yearsPercentage of matches won across all 5 years
G: Goals ScoredTotal goals scored across all matches
H: Goals ConcededTotal goals conceded across all matches
I: Goal Diff / MatchAverage goal difference per game
J: Last 5 WinsNumber of wins in the last 5 matches
K: Win Rate (%) — Last 5 matchesWin rate across the last 5 matches only
L: MomentumCurrent form compared to long-term average
M: xG / MatchExpected goals per match (VLOOKUP from xG Data)

Note: Do not modify the Match Data or xG Data tabs. All your work goes in My Analysis.