As the title suggests, this is regarding basketball. Specifically, men's Division I college basketball. This effort was part of my master's report, which ultimately was an exercise in data mining rather than statistical analysis (though some straight-forward statistical modeling was employed). I essentially wrote a simple web-crawler to download all the play-by-play data from the website of the Big 12 Sports Conference. The data was then parsed and analyzed using a combination of Excel and SAS. The culmination of the paper (though you should read it in it's entirety, for a nominal fee: http://ijr.cgpublisher.com/product/pub.191/prod.123.) rests in the following model:
Admittedly, the model is useless for prediction (sorry gamblers), as it depends on the post-hoc conference rankings at the end of the season. Sure, it may bit naive to assume that these rankings follow a linear pattern, but the error terms did appear normally distributed for the model. Still, I believe the coefficients of the model are useful for inferring the approximate point value of a rebound, assist, blocked shot, and steal. Possession gaining events like steals and defensive rebounds are worth approximately 1 additional point, while possession ending events like a turnover, cost about a point. This seems reasonable since the typical shooting percentage is around 50%, so gaining/losing a possession can be expected to net one more/less point.
In terms of application, I think this model would be useful for a division I coach to quantify the contribution of non-scoring events from the box score alone to rate the contribution of his players to the total margin of victory. Adjusting for total minutes played (i.e. tabulating the point differential per minutes on the court) might allow them to compare players of the same position with respect to these non-scoring statistics.
No comments:
Post a Comment