Hi Folks,
I am very new to the world of hockey analytics, but I have some experience with data analysis. I am home from uni over the holidays and I am already bored
So I am interested in the entry draft. It seems like the place where teams need to make the most decisions, and a very pivotal time for franchises in general. Is it possible to pick out "diamonds in the rough" in later rounds?
My general first pass strategy is as follows: define a binary response variable "has played in more than X NHL games Y years after they were drafted" and train a basic classifier, and see my accuracy. The end goal, however, would be to create a ranking, and compare how well my rankings perform compared to the actual draft rankings.
I have looked over some of the stickies and done some forum searching, but I still have a few questions.
My questions are:
1. Data Sources: Do I need to build an HTML scraper, or has someone already compiled this information?
2. Previous studies: What related previous studies have I missed? I didn't really find much.
Thoughts?