College Football Rankings

quick-and-dirty
Published

February 20, 2025

This is a quick-and-dirty implementation and extension of Fiddler on the Proof’s Reasonable Rankings for College Football.

Data

You can skip this part if you’d like.

Code
from  types import SimpleNamespace

import pandas as pd
import duckdb
import numpy as np

import requests
from bs4 import BeautifulSoup

import matplotlib.pyplot as plt
import seaborn as sns

url = "https://www.sports-reference.com/cfb/years/2024-schedule.html#schedule"

response = requests.get(url)

soup = BeautifulSoup(response.content)

# Find the table (inspect the webpage to find the correct table tag and attributes)
table = soup.find("table", id="schedule")

# use these table headers
headers = [
    "week",
    "date",
    "time",
    "day",
    "winner",
    "winner_points",
    "home",
    "loser",
    "loser_points",
    "notes"
]

# Extract table rows
rows = []
for tr in table.find("tbody").find_all("tr"):
    row = []
    for td in tr.find_all("td"):
        row.append(td.text)
    rows.append(row)

# Create pandas DataFrame
scores_raw = (
    pd.DataFrame(rows, columns=headers)
    .dropna(subset=["week"])
)
scores_raw.iloc[0]
week                              1
date                   Aug 24, 2024
time                       12:00 PM
day                             Sat
winner                 Georgia Tech
winner_points                    24
home                              N
loser            (10) Florida State
loser_points                     21
notes                              
Name: 0, dtype: object

Cleanup.

Code
scores = duckdb.query("""
with scores as (
  select
    row_number() over() as game_id
    , trim(regexp_replace(winner, '(\(\d+\))', '')) as winner
    , trim(regexp_replace(loser, '(\(\d+\))', '')) as loser
    , cast(coalesce(nullif(winner_points, ''), '0') as int) as winner_points
    , cast(coalesce(nullif(loser_points, ''), '0') as int) as loser_points
  from scores_raw
  where 1=1 and week is not null
)
select
  *
  , loser_points
  , winner_points - loser_points as diff_points
from scores
where 1=1 
    and winner_points + loser_points > 0
""")

duckdb.query("select * from scores limit 4")
<>:1: SyntaxWarning: invalid escape sequence '\('
<>:1: SyntaxWarning: invalid escape sequence '\('
/var/folders/bm/vmrvtfpd56v9sw0pb7c_rp0w0000gp/T/ipykernel_23445/3745766914.py:1: SyntaxWarning: invalid escape sequence '\('
  scores = duckdb.query("""
┌─────────┬────────────────────┬────────────────┬───────────────┬──────────────┬────────────────┬─────────────┐
│ game_id │       winner       │     loser      │ winner_points │ loser_points │ loser_points_1 │ diff_points │
│  int64  │      varchar       │    varchar     │     int32     │    int32     │     int32      │    int32    │
├─────────┼────────────────────┼────────────────┼───────────────┼──────────────┼────────────────┼─────────────┤
│       1 │ Georgia Tech       │ Florida State  │            24 │           21 │             21 │           3 │
│       2 │ Montana State      │ New Mexico     │            35 │           31 │             31 │           4 │
│       3 │ Southern Methodist │ Nevada         │            29 │           24 │             24 │           5 │
│       4 │ Hawaii             │ Delaware State │            35 │           14 │             14 │          21 │
└─────────┴────────────────────┴────────────────┴───────────────┴──────────────┴────────────────┴─────────────┘

Make the game matrix. We’ll discuss this later.

Code
teams = duckdb.query("""
with teams as (
  select distinct winner as team
  from scores
  union all
  select distinct loser as team
  from scores
)
select team
from teams
order by team
""")

team_mapping = {team: i for i, team in enumerate(teams.to_df()['team']) }

game_matrix = np.zeros((scores.shape[0], teams.shape[0]))
for i, row in scores.to_df().iterrows():
  game_matrix[i, team_mapping[row['winner']]] = 1
  game_matrix[i, team_mapping[row['loser']]] = -1

game_matrix.shape
(919, 367)

What’s a game worth?

To simplify the modeling, we want to map a score-differential to a weight between 0 and 1. This is the function they recommend. It accepts the number of points the winners won by (i.e. a positive number) and returns a weight between 0 and 1 (1 is better than 0).

def weigh(points_won_by, alpha=1):
  return (2 / (1+np.exp(-points_won_by / alpha))) - 1

There has a tunable parameter, alpha, which means we can decide how to weigh different wins, e.g. - when alpha=0, “a win is a win,” so a team that wins by 1 point is just as good as a team that wins by 14 points - when alpha=5, a team that wins by 1 point doesn’t get a strong weight (closer to 0), while a team that wins by 14 points gets a weight of ~0.8.

As alpha increases, teams need larger point differentials to do well.

Code
pd.concat([
  pd.DataFrame({"Won By": np.arange(0, 2*7)})\
    .assign(alpha=a)\
    .assign(Weight=lambda df: weigh(df["Won By"], alpha=a))
  for a in [0, 1, 2, 5]
  ], ignore_index=True)\
  .pipe(lambda df: sns.relplot(data=df, x="Won By", y="Weight", hue="alpha", kind="line"))

So what alpha should we choose? I would work backwards from a score, e.g. “a 14-point (2 touchdowns) win is a decisive win.” If we look at the distribution of points the winner won by, we see 14 points is the median, so that’s a decent sanity check.

Code
sns.displot(data=scores.to_df(), x="diff_points", kind="ecdf")

Using our eyeballs, the plot above shows alpha=2 is a reasonable parameter. Here’s where all the weights would land:

The plot on the left is the diff_score -> weight mapping, and the plot on the right is a flipped ecdf of the resulting weights, so the ecdf is actually on the x-axis.

Code
# I use SimpleNamespace to keep things tidy.
wex = SimpleNamespace()

wex.fig, wex.ax = plt.subplots(1, 2)

wex.data = scores.to_df().assign(weight=lambda df: weigh(df.diff_points, alpha=2))

sns.lineplot(x=wex.data.diff_points, y=wex.data.weight, ax=wex.ax[0])
sns.ecdfplot(y=wex.data.weight, ax=wex.ax[1])

Ranking the Teams

With this we can set up a linear equation:

games_matrix * team_weights = game_weights
^known^^^^^^   ^unknown^^^^   ^known^^^^^^        

Where

games_matrix:
  shape: (num_games, num_teams)
  values: 1 for the winner, -1 for the loser, 0 otherwise

team_weights:
  shape: (num_teams, 1)

game_weights:
  shape: (num_games, 1)
  values: between 0 and 1
Code
num_games = scores.shape[0]
num_teams = teams.shape[0]

assert game_matrix.shape == (num_games, num_teams)
# each winner gets a 1
assert (game_matrix==1).sum() == num_games
# each loser gets a -1
assert (game_matrix==-1).sum() == num_games

example_game_weights = weigh(scores.to_df().diff_points, 2)
assert example_game_weights.shape == (num_games,)
# every value between 0 and 1
assert ((example_game_weights>=0) & (example_game_weights<=1)).sum() == num_games

Solving for this:

Code
def solve_for_team_weights(games, weights, teams=teams):
  pseudoinverse = np.linalg.pinv(games)
  rankings = pseudoinverse @ np.array(weights)

  return pd.Series({
      team: rankings[i]
      for i, team in enumerate(teams.to_df()['team'])
  })

rankings = solve_for_team_weights(
  game_matrix, 
  weigh(scores.to_df().diff_points, 2),
  teams
)

# get top 12
rankings.sort_values(ascending=False).head(12)
Ohio State        1.665191
Oregon            1.548168
Notre Dame        1.468274
Penn State        1.333366
Texas             1.324272
Indiana           1.217931
Georgia           1.100962
Illinois          1.087538
South Carolina    1.084789
Mississippi       1.081674
Brigham Young     1.078441
Michigan          1.046878
dtype: float64

Sensitivity

I’m curious how changing alpha changes the top-12 composition. Recall that increasing alpha means we weigh bigger score differentials more heavily.

As we increase alpha, we see that:

  • Ohio State is always #1
  • Alabama makes it for alpha>2
  • The higher (i.e. worse) seeds are slightly more competitive
  • Miami gets in at alpha~=12
Code
sens = SimpleNamespace()
sens.data = []
sens.fig, sens.ax = plt.subplots()

for a in np.linspace(0, 20, 11):
  sens.tw = (
    solve_for_team_weights(
      game_matrix,
      weigh(scores.to_df().diff_points, a),
      teams
    )
    .sort_values(ascending=False)
    .head(12)
    # I don't like pandas
    .reset_index().rename(columns={"index": "team", 0: "team_score"})
    .assign(alpha=np.round(a), seed=lambda df: df.index+1)
  )
  sens.data.append(sens.tw)

(
  pd.concat(sens.data, ignore_index=True)
  .pivot(index="team", columns="alpha", values="seed")
  # sort by avg rank
  .pipe(lambda df: df.loc[df.max(axis=1).sort_values().index])
  .pipe(lambda df: sns.heatmap(df, annot=True, ax=sens.ax))
)