Six weeks ago I wrote about a college basketball projection "system" that I had started working on, which uses a team's previous year's KenPom rating along with how many minutes they had returning to project how strong they'd be in the following year.
As with any simple equation that tries to predict something extremely complicated, there were many flaws with this, probably the most obvious being that it completely ignored incoming recruits. In an effort to change that, I went through the Scout archives and added the number of four- and five-star recruits each team had each year to the data set. The results looked like this:
Definitely promising; 5* recruits are worth a little less than twice as much as four-stars, and both of those variables are statistically significant. Something bothered me about this though; if a team returned most of their minutes, and also brought in some really good recruits, the regression would consistently overrated them. The reason for this seemed simple enough: there are only so many minutes to go around, so if a team has a lot of starters returning, the freshmen are going to have less of an impact.
To fix that I created a rating from a formula that considered RetMin%: (five-star * 2 + four-star * 1) * (1- RetMin%). I then had a burst of creativity, called that rating "RECRUIT", and included it in a new regression:
I couldn't have asked for much better results than that, with "RECRUIT" fitting into the regression perfectly and even bumping the adjusted R-squared up by about .02.
I would like to continue improving on this. Two realistic additions have come to mind so far. The first is adding some combination of KenPom rating in year N-2 and how many minutes the team is returning from that year. I tried this with just the KP rating from two years prior, and it didn't improve things much, but I think it might be worthwhile if we knew how many of those players were coming back, and thus how relevant that rating is. This would also prevent Indiana from having an unreasonbly good '09-'10 projection; Eric Gordon and D.J. White are not walking through that door.
The other thing I'd like to add eventually is RetPts%, which I think would make a small but meaningful difference. Just for fun, here are the 10 best '09-'10 projections using the "RECRUIT" regression described above, only looking at conferences I've collected data for (ACC, B10, B12, SEC):
As I noted on Twitter, Tennessee really does return all but nine minutes. Hopefully our SEC preview will be up a week from Monday.
-
TopicRepliesLast Activity
-
ColoDetox
ryangambito started this All Sports thread040 minutes ago -
Flower Delivery Toronto
denise8u0 started this All Sports thread01 hour ago -
Mini Dirt Bikes
denise8u0 started this All Sports thread01 hour ago -
Manual Directory Submission Service
joseph54o started this All Sports thread02 hours ago -
SEO Service
joseph54o started this All Sports thread02 hours ago
Today's Best Stuff
For Bloggers
Company Info
Help
What is Yardbarker?
Join the Yardbarker Network (YBN) for more promotion, traffic, and money.


















Add Comment