Power play success expressed through the time it takes to score a PP goal
Power play success is typically shown as a percentage: number of power play goals as a percentage of the number of power play opportunities. Nothing wrong with that. But I was curious to see how long a team typically takes to score during the 120 seconds of power play. Yeah, big caveat; see below in the Data section. And pretty soon I realized it would not be as easy as I initially thought. You’d think I would have figured that out by now…
THE DATA
I wanted to use data from the NHL site, since I was (and still am) not aware of any data source available with the kind of information I needed. So I did a couple of things (code available on my github page):
- created R-scripts to scrape NHL’s play-by-play JSON files for every play and it’s time of occurrence in the game;
- created R-scripts to scrape NHL’s game summary HTM(L) files to get team strength for goals, who was on the ice, and penalty details;
- Loaded all this data into a SQLite database (if you are looking for a lightweight SQL environment, hosted freely and accessible from any machine, I highly recommend checking out https://sqlitebrowser.org/) and wrote some SQL scripts to bring things together so to speak (with some help from Google Sheets, believe it or not);
- Loaded the data into Tableau Public, made a number of calculated fields to select the PP goals meeting the criteria mentioned further down, and exported the resulting chart into a stand-alone data source;
- Got the NHL Power play summary data for the selected time frame and combined that with the above mentioned data set;
- I filtered out only power plays for minor penalties (120 seconds) and with a one-man-advantage to, you know, compare pucks to pucks equally. The time frame is the 2017-2018 season and the first 307 games of the 2018-2019 season (no promises to update this at the end of the season);
- Time to look at the data in Tableau! Quite frankly, looking at the data visually made me think for a minute; I assumed that scoring quickly into the power play is a good thing. But looking at the Penguins here, they took more time and perhaps wore out their opponents more, still scoring on the power play.
RESULTS
So the more I was working with the data, the more I wondered what values would be considered good vs. bad. Not that I think that teams could/would delay scoring on a PP, but is it more beneficial to score at 1:59 of the PP, using up as much time of the 2 minutes as possible and wearing out the opponent, or is scoring 10 seconds into the PP a mental blow to the opposing team, having a bigger impact than the physical wearing out?
I posted a poll on twitter to find out general opinion, and although I appreciate seven folks taking the time to respond, the sample size is a bit small to come to a conclusive answer. And even still, it would have been pretty much even:
The issue goes beyond one page for the first time, as I am showing traditional PP success with a feature to look at team’s PP shot locations, this alternative view of PP success, and how they relate or compare.was not going to fit everything I wanted to show on one page.
Page 1 shows the traditional power play percentage by power play opportunities and power play goals;
Page 2 shows all goals scored in the specified time frame on one-man advantages after minor penalties, called power play efficiency. The main conclusion that these type of goals seem to be randomly and widely spread over the 2 minutes of the power play. The team averages are then displayed, and looked in more detail (all team averages are between 50 and 68 seconds in the power play) to better see the differences;
Page 3 shows the combination of power play percentage and efficiency.
The interactive version can be found on my Tableau Public page, as usual.
More to follow!