Found September 22, 2009 on Vegas Watch: Yardbarker Blogger Network

Six weeks ago I wrote about a college basketball projection "system" that I had started working on, which uses a team's previous year's KenPom rating along with how many minutes they had returning to project how strong they'd be in the following year.

As with any simple equation that tries to predict something extremely complicated, there were many flaws with this, probably the most obvious being that it completely ignored incoming recruits. In an effort to change that, I went through the Scout archives and added the number of four- and five-star recruits each team had each year to the data set. The results looked like this:

Definitely promising; 5* recruits are worth a little less than twice as much as four-stars, and both of those variables are statistically significant. Something bothered me about this though; if a team returned most of their minutes, and also brought in some really good recruits, the regression would consistently overrated them. The reason for this seemed simple enough: there are only so many minutes to go around, so if a team has a lot of starters returning, the freshmen are going to have less of an impact.

To fix that I created a rating from a formula that considered RetMin%: (five-star * 2 + four-star * 1) * (1- RetMin%). I then had a burst of creativity, called that rating "RECRUIT", and included it in a new regression:

I couldn't have asked for much better results than that, with "RECRUIT" fitting into the regression perfectly and even bumping the adjusted R-squared up by about .02.

I would like to continue improving on this. Two realistic additions have come to mind so far. The first is adding some combination of KenPom rating in year N-2 and how many minutes the team is returning from that year. I tried this with just the KP rating from two years prior, and it didn't improve things much, but I think it might be worthwhile if we knew how many of those players were coming back, and thus how relevant that rating is. This would also prevent Indiana from having an unreasonbly good '09-'10 projection; Eric Gordon and D.J. White are not walking through that door.

The other thing I'd like to add eventually is RetPts%, which I think would make a small but meaningful difference. Just for fun, here are the 10 best '09-'10 projections using the "RECRUIT" regression described above, only looking at conferences I've collected data for (ACC, B10, B12, SEC):

As I noted on Twitter, Tennessee really does return all but nine minutes. Hopefully our SEC preview will be up a week from Monday.

Add Comment

Comments (0)
Want more? Juice it up!
Today's Best Stuff
For Bloggers
Company Info
Help
What is Yardbarker?

Join the Yardbarker Network (YBN) for more promotion, traffic, and money.

Yardbarker is a sports site where everyone’s an insider! Sports fans, bloggers, and professional athletes all hang out in the Yard to read and discuss sports articles, view the latest sports photos and videos, and debate and interact with each other.