Hockey Outsider
Registered User
- Jan 16, 2005
- 9,486
- 15,782
There's been a lot of talk over the past few years about secondary assists. I wanted to examine the data and understand the predictive values of goals, primary assists, and secondary assists. We intuitively think that goal-scoring and playmaking are innate talents, which are relatively stable from year to year - but I wanted to understand how the data looks when it's closely examined.
The key to this analysis is getting pairs of data for YEAR and YEAR+1. In other words, if we have information about how many goals, primary assists, and secondary assists a player scored in, say, 2014, can we estimate how many goals, primary assists and secondary assists a player recorded in 2015?
I looked at data from the 2008 to 2019 regular seasons - but we have to set up our parameters. There are going to be significant fluctuations in players' stats from year to year. Some of these I've controlled for, and some of these I haven't. All the data that I've analyzed is on a per-minute basis, to avoid distortion due to changes in games played and/or ice time. Three additional filters that I've applied:
Before going further, we have to validate the data - if our starting point is flawed, then all of the following analysis is worthless. I obtained all of my data from naturalstattrick.com. I did a limited amount of spot-checking. For example, NHL.com shows that Kane has 324 ES assists going back to 2008. That's exactly the same number as what my database has. Unfortunately NHL.com doesn't show the level of detail I need (primary vs second assists at even strength). I looked at Kane's 2016 (Hart & Art Ross) season, and manually went through his scoring logs. I counted 23 ES goals, 28 ES primary assists, and 12 ES secondary assists. This is identical to the breakdown in my database. Based on the spot-checking that I've done (and this has to be done by spot-checking, since there are more than 10,000 lines of data), I'm satisfied that the data I'm using is a reasonable starting point.
The key to this analysis is getting pairs of data for YEAR and YEAR+1. In other words, if we have information about how many goals, primary assists, and secondary assists a player scored in, say, 2014, can we estimate how many goals, primary assists and secondary assists a player recorded in 2015?
I looked at data from the 2008 to 2019 regular seasons - but we have to set up our parameters. There are going to be significant fluctuations in players' stats from year to year. Some of these I've controlled for, and some of these I haven't. All the data that I've analyzed is on a per-minute basis, to avoid distortion due to changes in games played and/or ice time. Three additional filters that I've applied:
- I'm only looking at data for forwards. From a first glance at the data, it was obvious that forwards and defensemen recorded primary assists at nearly the same rate; forwards recorded far more goals, and defensemen recorded far more secondary assists. To avoid distortion, I'm only looking at forwards.
- I'm only looking at even-strength scoring. No, I'm not one of those people who think that a powerplay points counts for less than an even-strength point. But I think that PP deployment materially impacts a player's results, and this can skew the numbers if it isn't properly accounted for. (I also want to be clear that the filter is even-strength - not necessarily five-on-five).
- I'm only looking at players who have at least 900 minutes in both seasons (ie at least 900 ES minutes in YEAR, and at least 900 ES minutes in YEAR+1). Data for a player-season is discarded if it fails to meet both criteria. The 900 minute threshold is admittedly arbitrary - I wanted a high enough threshold to filter out fluky data from small sample sizes, but low enough that I'd still end up with over a thousand data points.
Before going further, we have to validate the data - if our starting point is flawed, then all of the following analysis is worthless. I obtained all of my data from naturalstattrick.com. I did a limited amount of spot-checking. For example, NHL.com shows that Kane has 324 ES assists going back to 2008. That's exactly the same number as what my database has. Unfortunately NHL.com doesn't show the level of detail I need (primary vs second assists at even strength). I looked at Kane's 2016 (Hart & Art Ross) season, and manually went through his scoring logs. I counted 23 ES goals, 28 ES primary assists, and 12 ES secondary assists. This is identical to the breakdown in my database. Based on the spot-checking that I've done (and this has to be done by spot-checking, since there are more than 10,000 lines of data), I'm satisfied that the data I'm using is a reasonable starting point.
Last edited: