Regression to the Mean: Identifying Sustainable vs. Lucky Performance

Regression to the mean is one of the most powerful — and most ignored — concepts in fantasy sports decision-making. It describes the statistical tendency for extreme performances to drift back toward historical averages over time, and understanding it separates managers who win on skill from those who win on coincidence. This page covers the definition, mechanism, practical scenarios, and decision-making thresholds that make regression a usable tool rather than a vague warning.

Definition and scope

Francis Galton first documented regression to the mean in 1886 studying human height, but the principle has since become foundational to any field where performance data is collected over time — including, very much, fantasy sports.

The core idea: when a player posts an extreme result (unusually high or low), part of that result is driven by genuine skill or situation, and part is driven by random variance. Because random variance doesn't persist, subsequent results will tend to be less extreme. A receiver who catches 9 of 12 targets one week isn't necessarily better than a receiver who catches 6 of 12 — the difference may be entirely noise.

The scope matters here. Regression applies most forcefully to statistics that have low "stickiness" — meaning they correlate weakly from one period to the next. Batting average on balls in play (BABIP) in baseball is the textbook example: research aggregated by FanGraphs shows that pitchers have a league-average BABIP of roughly .300, and individual pitcher BABIP fluctuates heavily due to defense and luck before regressing toward that mean. A pitcher posting a .220 BABIP over six starts is almost certainly outperforming sustainable levels.

In football, red zone target share and touchdown rate on receptions are two statistics with weak week-to-week correlation. In basketball, three-point percentage over a short sample (fewer than 40 attempts) is notoriously unreliable.

How it works

Regression isn't a correction mechanism — it's a probabilistic description. The universe doesn't "know" a player has been lucky. What's happening is simpler: extreme outcomes require extreme random contributions, and those contributions are unlikely to repeat at the same magnitude.

A useful mental model breaks any single-game fantasy performance into three components:

True talent level — the player's genuine expected output based on role, matchup, and skill
Situational variance — opponent quality, weather, game script, target volume that game
Random variance — fumble recoveries, uncatchable ball interference, a tipped pass that happens to land in a receiver's hands

Only the first component persists. The other two fluctuate. When a player's box score is built heavily on components 2 and 3, the following week's number will likely look different — not because the player changed, but because the lucky contributions won't recur at the same rate.

This is why advanced stats for fantasy matter more than raw point totals. Expected points added (EPA), yards after contact, and target-to-reception efficiency metrics strip out some situational noise and reveal the underlying true talent signal more reliably.

Common scenarios

Touchdowns without volume: A running back scores 3 touchdowns on 8 carries. The touchdowns inflate his fantasy score to elite levels, but 8 carries is a volume number that belongs to a backup — not a workhorse. Touchdowns correlate weakly with future touchdowns at low carry counts (Benjamin Morris, FiveThirtyEight, "Running Backs and Touchdowns"). The carries tell the real story.

BABIP-inflated batting averages: A hitter posts a .380 BABIP over the first three weeks of the season. League average BABIP for hitters is approximately .300 (FanGraphs reference baseline). Unless the player has exceptional contact quality metrics — hard-hit rate above 45%, low strikeout rate — the batting average will fall.

Hot shooting in basketball: A player shoots 48% from three over the first two weeks on 30 attempts. If his career three-point percentage is 34%, he's almost certainly in the lucky tail of his own distribution. Once the sample expands past 100 attempts, the number will migrate toward his established rate.

Low completion percentage for quarterbacks: The flip side also applies. A quarterback posting a 55% completion rate in Weeks 1–2 may not suddenly be broken — batted balls, drops, and a brutal matchup against a top-5 cornerback can temporarily suppress numbers that are genuinely fine.

The comparison that matters most: touchdowns versus yards versus target share. Target share (or air yards share in football) is a usage metric with stronger week-to-week correlation than touchdowns or yards, which carry higher variance. Managers who prioritize sustainable usage metrics when making start-sit decisions will outperform those chasing last week's box score.

Decision boundaries

Identifying regression risk requires a two-part test:

Sample size: The smaller the sample, the higher the regression expectation. Fewer than 4 games in football or 15 games in baseball is generally insufficient to draw stable conclusions from counting stats alone.
Process vs. outcome: Did the high performance come with high volume and strong underlying efficiency metrics? Or was it touchdowns on low carries, homers on a weak BABIP, and a face-up shooting percentage that doesn't match shot type?

The waiver wire strategy decision becomes cleaner with this framework. A player who produced big numbers in Week 1 on 6 targets is a sell-high candidate, not a must-add. A player who produced 11 targets and 110 yards without a touchdown may be a buy-low opportunity — the volume is real, the touchdown will come.

Managers who understand regression also recalibrate their trade strategy guide accordingly: buy players whose counting stats understate their usage metrics, and sell players riding unsustainable efficiency rates before the market corrects. The fantasy analytics tools category on this site exists precisely to surface this underlying signal beneath the noise of weekly scores.

The central truth is this: sustainable production has a process behind it. Lucky production has a story behind it. Over a full season, the process wins.

Regression to the Mean: Identifying Sustainable vs. Lucky Performance

Definition and scope

How it works

Common scenarios

Decision boundaries

References

Read Next