This is a continuation of a post from yesterday. I figured we needed some good numbers on the relationship between recruiting and performance, and I was uniquely positioned to do the analysis. If you want details on methodology or more results, check out the other post.
Using recruiting results and performance between 2006 and 2013, I estimated the impact of recruits on performance. Now I want to address an obvious follow up question: which teams have outperformed their recruiting classes and which teams have done worse. The table below has the team's performance in terms of points per game above or below what was predicted for them based on the strength of their recruiting classes. I break the results into two groups - 2006 to 2009 and 2010 to 2013.
We all could have guessed the teams that standout as over-performers. Boise State was first for '06-'09 and second for '10-'13, only because they were passed by Oregon. Boise State, Utah, BYU make the list for both periods, and Utah States makes the latter list; recruiting services might be missing something in the Utah/Idaho area. Oregon and Oregon State make both lists, as does Nevada. Navy and Air Force are both in the top six in '06-'09 and Navy makes the list again in '10-'13. Go America! And you know Saban has a good thing going when Alabama tops the recruiting ranks and is still among the top programs for performance relative to recruited talent.
We should take a moment and put Boise State in perspective. I found that a 4 or 5 star recruit is worth a little more than 1/2 a point per game for the next four years (I'll use .6 for this example). Boise State has been about 20 points per game better than their recruiting would suggest. This means the team was playing as though they had an additional (20/.6=) 33.3 4 or 5 star recruits on the team, or 8-9 more per class.
The under-performing list is dominated by dumpster fires, teams that no one thought would be any good, but turned out to be even worse. A few programs stand out: Michigan, Notre Dame and Florida State for the '06-'09 period and Texas for the '10-'13 period. Texas A&M and Utah State have the proud distinction of moving from under- to over-performing. Eastern Michigan, FIU, Idaho, Memphis, Miami (Ohio), New Mexico State, Tulane and UAB have the less-proud distinction of making both lists. Central Michigan fell from over- to under-performing.
BPR | A system for ranking teams based only one wins and losses and strength of schedule. See BPR for an explanation. |
EPA (Expected Points Added) | Expected points are the points a team can "expect" to score based on the distance to the end zone and down and distance needed for a first down, with an adjustment for the amount of time remaining in some situations. Expected points for every situation is estimated using seven years of historical data. The expected points considers both the average points the offense scores in each scenario and the average number of points the other team scores on their ensuing possession. The Expected Points Added is the change in expected points before and after a play. |
EP3 (Effective Points Per Possession) | Effective Points Per Possession is based on the same logic as the EPA, except it focuses on the expected points added at the beginning and end of an offensive drive. In other words, the EP3 for a single drive is equal to the sum of the expected points added for every offensive play in a drive (EP3 does not include punts and field goal attempts). We can also think of the EP3 as points scored+expected points from a field goal+the value of field position change on the opponent's next possession. |
Adjusted for Competition | We attempt to adjust some statistics to compensate for differences in strength of schedule. While the exact approach varies some from stat to stat the basic concept is the same. We use an algorithm to estimate scores for all teams on both sides of the ball (e.g., offense and defense) that best predict real results. For example, we give every team an offensive and defensive yards per carry score. Subtracting the offensive score from the defensive score for two opposing teams will estimate the yards per carry if the two teams were to play. Generally, the defensive scores average to zero while offensive scores average to the national average, e.g., yards per carry, so we call the offensive score "adjusted for competition" and roughly reflects what the team would do against average competition |
Impact | see Adjusted for Competition. Impact scores are generally used to evaluate defenses. The value roughly reflects how much better or worse a team can expect to do against this opponent than against the average opponent. |
[-] About this table
Includes the
top 180 QBs by total plays
Total <=0 | Percent of plays that are negative or no gain |
Total >=10 | Percent of plays that gain 10 or more yards |
Total >=25 | Percent of plays that gain 25 or more yards |
10 to 0 | Ratio of Total >=10 to Total <=0 |
Includes the
top 240 RBs by total plays
Total <=0 | Percent of plays that are negative or no gain |
Total >=10 | Percent of plays that gain 10 or more yards |
Total >=25 | Percent of plays that gain 25 or more yards |
10 to 0 | Ratio of Total >=10 to Total <=0 |
Includes the
top 300 Receivers by total plays
Total <=0 | Percent of plays that are negative or no gain |
Total >=10 | Percent of plays that gain 10 or more yards |
Total >=25 | Percent of plays that gain 25 or more yards |
10 to 0 | Ratio of Total >=10 to Total <=0 |
Includes
the
top 180 players by pass attempts)
3rdLComp% |
Completion % on 3rd and long (7+
yards) |
SitComp% |
Standardized completion % for
down and distance. Completion % by down and distance are weighted by
the national average of pass plays by down and distance. |
Pass <=0 | Percent of pass plays that are negative or no gain |
Pass >=10 | Percent of pass plays that gain 10 or more yards |
Pass >=25 | Percent of pass plays that gain 25 or more yards |
10 to 0 | Ratio of Pass >=10 to Pass<=0 |
%Sacks |
Ratio of sacks to pass plays |
Bad INTs |
Interceptions on 1st or 2nd down
early before the last minute of the half |
Includes the top 240 players by carries
YPC1stD |
Yards per carry on 1st down |
CPCs |
Conversions (1st down/TD) per
carry in short yardage situations - the team 3 or fewer yards for a 1st
down or touchdown |
%Team Run |
Player's carries as a percent of team's carries |
%Team RunS |
Player's carries as a percent of team's carries in short
yardage situations |
Run <=0 |
Percent of running plays that
are negative or no gain |
Run >=10 |
Percent of running plays that
gain 10 or more yards |
Run >=25 | Percent of running plays that gain 25 or more yards |
10 to 0 | Ratio of Run >=10 to Run <=0 |
Includes the top 300 players by targets
Conv/T 3rd | Conversions per target on 3rd Downs |
Conv/T PZ | Touchdowns per target inside the 10 yardline |
%Team PZ | Percent of team's targets inside the 10 yardline |
Rec <=0 | Percent of targets that go for negative yards or no net gain |
Rec >=10 | Percent of targets that go for 10+ yards |
Rec >=25 | Percent of targets that go for 25+ yards |
10 to 0 | Ratio of Rec>=0 to Rec<=0 |
Includes the top 300 players by targets
xxxx | xxxx |
...
Includes players with a significant number of attempts
NEPA | "Net Expected Points Added": (expected points after play - expected points before play)-(opponent's expected points after play - opponent's expected points before play). Uses the expected points for the current possession and the opponent's next possession based on down, distance and spot |
NEPA/PP | Average NEPA per play |
Max/Min | Single game high and low |
Includes players with a significant number of attempts
NEPA | "Net Expected Points Added": (expected points after play - expected points before play)-(opponent's expected points after play - opponent's expected points before play). Uses the expected points for the current possession and the opponent's next possession based on down, distance and spot |
NEPA/PP | Average NEPA per play |
Max/Min | Single game high and low |
Adjusted | Reports the per game EPA adjusted for the strength of schedule. |
Defensive Possession Stats
Points/Poss | Offensive points per possession |
EP3 | Effective Points per Possession |
EP3+ | Effective Points per Possession impact |
Plays/Poss | Plays per possession |
Yards/Poss | Yards per possession |
Start Spot | Average starting field position |
Time of Poss | Average time of possession (in seconds) |
TD/Poss | Touchdowns per possession |
TO/Poss | Turnovers per possession |
FGA/Poss | Attempted field goals per possession |
%RZ | Red zone trips per possession |
Points/RZ | Average points per red zone trip. Field Goals are included using expected points, not actual points. |
TD/RZ | Touchdowns per red zone trip |
FGA/RZ | Field goal attempt per red zone trip |
Downs/RZ | Turnover on downs per red zone trip |
Defensive Play-by-Play Stats
EPA/Pass | Expected Points Added per pass attempt |
EPA/Rush | Expected Points Added per rush attempt |
EPA/Pass+ | Expected Points Added per pass attempt impact |
EPA/Rush+ | Expected Points Added per rush attempt impact |
Yards/Pass | Yards per pass |
Yards/Rush | Yards per rush |
Yards/Pass+ | Yards per pass impact |
Yards/Rush+ | Yards per rush impact |
Exp/Pass | Explosive plays (25+ yards) per pass |
Exp/Rush | Explosive plays (25+ yards) per rush |
Exp/Pass+ | Explosive plays (25+ yards) per pass impact |
Exp/Rush+ | Explosive plays (25+ yards) per rush impact |
Comp% | Completion percentage |
Comp%+ | Completion percentage impact |
Yards/Comp | Yards per completion |
Sack/Pass | Sacks per pass |
Sack/Pass+ | Sacks per pass impact |
Sack/Pass* | Sacks per pass on passing downs |
INT/Pass | Interceptions per pass |
Neg/Rush | Negative plays (<=0) per rush |
Neg/Run+ | Negative plays (<=0) per rush impact |
Run Short | % Runs in short yardage situations |
Convert% | 3rd/4th down conversions |
Conv%* | 3rd/4th down conversions versus average by distance |
Conv%+ | 3rd/4th down conversions versus average by distance impact |
Offensive Play-by-Play Stats
Plays | Number of offensive plays |
%Pass | Percent pass plays |
EPA/Pass | Expected Points Added per pass attempt |
EPA/Rush | Expected Points Added per rush attempt |
EPA/Pass+ | Expected Points Added per pass attempt adjusted for competition |
EPA/Rush+ | Expected Points Added per rush attempt adjusted for competition |
Yards/Pass | Yards per pass |
Yards/Rush | Yards per rush |
Yards/Pass+ | Yards per pass adjusted for competition |
Yards/Rush+ | Yards per rush adjusted for competition |
Exp Pass | Explosive plays (25+ yards) per pass |
Exp Run | Explosive plays (25+ yards) per rush |
Exp Pass+ | Explosive plays (25+ yards) per pass adjusted for competition |
Exp Run+ | Explosive plays (25+ yards) per rush adjusted for competition |
Comp% | Completion percentage |
Comp%+ | Completion percentage adjusted for competition |
Sack/Pass | Sacks per pass |
Sack/Pass+ | Sacks per pass adjusted for competition |
Sack/Pass* | Sacks per pass on passing downs |
Int/Pass | Interceptions per pass |
Neg/Run | Negative plays (<=0) per rush |
Neg/Run+ | Negative plays (<=0) per rush adjusted for competition |
Run Short | % Runs in short yardage situations |
Convert% | 3rd/4th down conversions |
Conv%* | 3rd/4th down conversions versus average by distance |
Conv%+ | 3rd/4th down conversions versus average by distance adjusted for competition |
Offensive Possession Stats
Points/Poss | Offensive points per possession |
EP3 | Effective Points per Possession |
EP3+ | Effective Points per Possession adjusted for competition |
Plays/Poss | Plays per possession |
Yards/Poss | Yards per possession |
Start Spot | Average starting field position |
Time of Poss | Average time of possession (in seconds) |
TD/Poss | Touchdowns per possession |
TO/Poss | Turnovers per possession |
FGA/Poss | Attempted field goals per possession |
Poss/Game | Possessions per game |
%RZ | Red zone trips per possession |
Points/RZ | Average points per red zone trip. Field Goals are included using expected points, not actual points. |
TD/RZ | Touchdowns per red zone trip |
FGA/RZ | Field goal attempt per red zone trip |
Downs/RZ | Turnover on downs per red zone trip |
PPP | Points per Possession |
aPPP | Points per Possession allowed |
PPE | Points per Exchange (PPP-aPPP) |
EP3+ | Expected Points per Possession |
aEP3+ | Expected Points per Possession allowed |
EP2E+ | Expected Points per Exchange |
EPA/Pass+ | Expected Points Added per Pass |
EPA/Rush+ | Expected Points Added per Rush |
aEPA/Pass+ | Expected Points Allowed per Pass |
aEPA/Rush+ | Expected Points Allowed per Rush |
Exp/Pass | Explosive Plays per Pass |
Exp/Rush | Explosive Plays per Rush |
aExp/Pass | Explosive Plays per Pass allowed |
aExp/Rush | Explosive Plays per Rush allowed |
BPR | A method for ranking conferences based only on their wins and losses and the strength of schedule. See BPR for an explanation. |
Power | A composite measure that is the best predictor of future game outcomes, averaged across all teams in the conference |
P-Top | The power ranking of the top teams in the conference |
P-Mid | The power ranking of the middling teams in the conference |
P-Bot | The power ranking of the worst teams in the conference |
SOS-Und | Strength of Schedule - Undefeated. Focuses on the difficulty of going undefeated, averaged across teams in the conference |
SOS-BE | Strength of Schedule - Bowl Eligible. Focuses on the difficulty of becoming bowl eligible, averaged across teams in the conference |
Hybrid | A composite measure that quantifies human polls, applied to converences |
Player Game Log
Use the yellow, red and green cells to filter values. Yellow cells filter for exact matches, green cells for greater values and red cells for lesser values. By default, the table is filtered to only the top 200 defense-independent performances (oEPA). The table includes the 5,000 most important performances (positive and negative) by EPA.
Use the yellow, red and green cells to filter values. Yellow cells filter for exact matches, green cells for greater values and red cells for lesser values. By default, the table is filtered to only the top 200 defense-independent performances (oEPA). The table includes the 5,000 most important performances (positive and negative) by EPA.
EPA | Expected points added (see glossary) |
oEPA | Defense-independent performance |
Team Game Log
Use the yellow, red and green cells to filter values. Yellow cells filter for exact matches, green cells for greater values and red cells for lesser values.
Use the yellow, red and green cells to filter values. Yellow cells filter for exact matches, green cells for greater values and red cells for lesser values.
EP3 | Effective points per possession (see glossary) |
oEP3 | Defense-independent offensive performance |
dEP3 | Offense-independent defensive performance |
EPA | Expected points added (see glossary) |
oEPA | Defense-independent offensive performance |
dEPA | Offense-independent defensive performance |
EPAp | Expected points added per play |
Thursday, September 18, 2014
Wednesday, September 17, 2014
Recruiting and Performance
Executive Summary:
- 4 and 5 star recruits are worth about 1/2 a point per game on average for the next four years.
- That's about 25 points total
- By recruiting, the top 5 in 2014 should be
- #1 Alabama
- #2 Ohio State
- #3 Florida State
- #4 LSU
- #5 Florida
- Consistent over and underperformers
****If you're only interested in results, you can skip this part****
A note on methods - I measure the independent variable (recruiting) using 247sports composite rankings. I use both a count for 4 and 5 star signings and "points" - 247sports has a clever method for ranking classes that awards points for each recruit, with diminishing returns as class size increases. I consider the past four recruiting classes (e.g., for the 2013 season, I use the 2013, 2012, 2011 and 2010 recruiting classes). Unfortunately, there are not enough 5 star recruits to use this as a unique variable, so in most cases I combine 4 and 5 star recruits.
I measure the dependent variable (performance) using an algorithm that converts game scores into a measure of team performance independent of schedule strength and game site (i.e., home/road). I exclude bowl games and games against FCS competition.
I have performance data from 2006 and recruiting data from 2003. From this, I can match 939 teams over that period on recruiting and performance (117 teams per season). I use regression to estimate performance from recruiting metrics.
Results (for the nerds amongst us):
Stars is the count of four and five stars signed in that year, Stars (t-?) is the number of years between that class and the season in question. For example, Stars (t-1) refers to the class signed earlier that year (the 2013 class when talking about performance in 2013) and Stars (t-4) is the class three years earlier (the 2010 class when talking about performance in 2010). The same is true of the second model, where I use points instead of star counts.
****Everyone should start reading here****
First, when we use 247sports points to predict performance, we get an R^2 of .38. This means that this recruiting measure explains about 38% of the total variation in a team's performance. Player evaluation, coaching, and luck (e.g., injuries) should explain most of the remaining 62%.
On average, a team is .68 points better per game for each freshman 4 or 5 star recruit, about .57 points better for each sophomore and junior, and .32 points better for each senior. The big drop from junior to senior is most likely a product of the best juniors leaving early for the draft (or otherwise leaving the program).
Using the coefficients from the second model, I plugged recruiting results from 2011, 2012, 2013, and 2014 to predict performance in 2014. I list the top 50 below.
The top 10 is a who's who of college football. It shouldn't surprise anyone that Alabama comes out on top with 18 5 stars in the last four classes. Ohio State's position at #2 is mostly a product of 48 4 stars in the last three classes; that is more than the bottom 2/3 of programs combined.
Some teams are over-performing their recruiting this season, others failing to meet expectations. Boston College at #69 beat #9 USC. BYU (#71) manhandled Texas (#10). And we were supposed to believe it was an upset when #27 Virginia beat #38 Louisville. Oklahoma is only one spot ahead of Tennessee by recruiting; the gap on the field is much larger. I don't think we will see Florida at #5 or Michigan at #11 at the end of the season. Oregon (#18) and Baylor (#31), on the other hand, are doing more with less.
I measure the dependent variable (performance) using an algorithm that converts game scores into a measure of team performance independent of schedule strength and game site (i.e., home/road). I exclude bowl games and games against FCS competition.
I have performance data from 2006 and recruiting data from 2003. From this, I can match 939 teams over that period on recruiting and performance (117 teams per season). I use regression to estimate performance from recruiting metrics.
Results (for the nerds amongst us):
Stars is the count of four and five stars signed in that year, Stars (t-?) is the number of years between that class and the season in question. For example, Stars (t-1) refers to the class signed earlier that year (the 2013 class when talking about performance in 2013) and Stars (t-4) is the class three years earlier (the 2010 class when talking about performance in 2010). The same is true of the second model, where I use points instead of star counts.
****Everyone should start reading here****
First, when we use 247sports points to predict performance, we get an R^2 of .38. This means that this recruiting measure explains about 38% of the total variation in a team's performance. Player evaluation, coaching, and luck (e.g., injuries) should explain most of the remaining 62%.
On average, a team is .68 points better per game for each freshman 4 or 5 star recruit, about .57 points better for each sophomore and junior, and .32 points better for each senior. The big drop from junior to senior is most likely a product of the best juniors leaving early for the draft (or otherwise leaving the program).
Using the coefficients from the second model, I plugged recruiting results from 2011, 2012, 2013, and 2014 to predict performance in 2014. I list the top 50 below.
The top 10 is a who's who of college football. It shouldn't surprise anyone that Alabama comes out on top with 18 5 stars in the last four classes. Ohio State's position at #2 is mostly a product of 48 4 stars in the last three classes; that is more than the bottom 2/3 of programs combined.
Some teams are over-performing their recruiting this season, others failing to meet expectations. Boston College at #69 beat #9 USC. BYU (#71) manhandled Texas (#10). And we were supposed to believe it was an upset when #27 Virginia beat #38 Louisville. Oklahoma is only one spot ahead of Tennessee by recruiting; the gap on the field is much larger. I don't think we will see Florida at #5 or Michigan at #11 at the end of the season. Oregon (#18) and Baylor (#31), on the other hand, are doing more with less.
Subscribe to:
Posts (Atom)