Kukla's Korner

The Puck Stops Here

Next entry: Penguins Early Team To Beat

Previous entry: First Look At The AHL

Dave Staples Annual Anti-Stats Rant

The Edmonton Oilers blogs include some of the better sabermetrics blogs out there. There is mudcrutch hockey irreverent Oiler fans, copper n blue and battle of Alberta.  Each have a strong interest in sabermetrics and have written some interesting posts on the subjects.  With the high quality of the Oiler blogosphere it is hard for the mainstream media to keep up.  However, Dave Staples at Cult of Hockey does a good job.  He writes many interesting pieces about the Edmonton Oilers and hockey in general.  These are not just the leftover articles that do not get published in the newspaper (as some weaker media blogs tend to do).  He regularly publishes some very good information on the internet. 

However, one of his recurring themes (that seems to be in protest to the independent Oiler bloggers with strong sabermetric backgrounds) is that hockey statistics are not very useful.  This is a case he frequently overstates in his posts.  Last year he wrote Why Plus/Minus is a rotten, useless, misleading and irrelevant stat for NHL players

This title is overly strong, but he basically argues that +/- is problematic because it depends upon context.  For example, a player on a good team will get a better +/- than one on a bad team regardless of the quality of the players in question.  He also argues that sometimes players get pluses when they didn’t do anything significant to contribute to a goal or minuses when another teammate screwed up allowing a goal and they didn’t do anything wrong.  While all this is true, it hardly makes +/- useless.  It merely makes it important to keep context and the random error in +/- numbers in account when trying to argue using +/- to back up your argument.  I wrote a responce to his rant here

Nearly a year later, Dave Staples is back with another rant entitled what good are hockey statistics if they don’t even tell us who scores and causes goals?   This title clearly overstates his argument again.  Hockey statistics clearly do tell us who scores goals.  The number of goals a player scores is the oldest statistic kept.  The problem he is trying to elude to is that one cannot clearly and unambigously show how many goals a given player creates for his team over a season.  There are crude estimates of goals created but they do not adequately address the issue.  The problem is that hockey statistics do not give all the information necessary to understand a hockey game - and it may be impossible to gather enough information to ever fully understand everything statistically.  That does not mean that considerable understanding cannot be found statistically.  That does not mean that significant advances in understanding are not on the horizon.  It only means that sabermetrics is only part of the picture toward understanding hockey.  It is a useful tool that one cannot throw away because it is not a 100% fully developed, fully proven tool.

Dave Staples goes on to an interview with somebody named Corey Pronman who is attempting to do something about this statistical problem, but is probably headed in the wrong direction.  Mr. Pronman has introduced a subjective element into his statistics.  He watches replays of gals and subjectively choses the players who he thinks deserve credit for the goal or blame for the goal and gives them a subjective plus/minus rating he calls individual plus/minus or real plus/minus.  The nature of this statistic is that it will inevitably show which players are best at doing whatever Mr. Pronman thinks is important in scoring goals and which players regularly do what Mr. Pronman thinks cause their team to allow goals.  That subjective element cannot be removed.  Everything is seen through Mr. Pronman’s filter.  That in and of itself is not a bad thing, but we cannot learn anything about which players are important that isn’t already built into Mr. Pronman’s subjective evaluation of goals.  The subjective element of statistics needs to be removed and reduced as much as possible to truly learn about the underlying elements of the game.

The main problem with hockey statistics is that not enough information is gathered.  In a typical game, six or seven goals are scored.  We record who scores and assists on those goals and who was on the ice for +/-.  That is it.  The numbers you see for a player in a typical box score are based on tallying an average of six or seven events in a game.  Roughly one event for every ten minutes of playing time is tallied.  That leaves significant information out of the analysis.  The effort to fix this is to record more information.  That is the effort behind the Corsi number .  In Corsi, each shot directed at the goal is included (that includes shots on goal, missed shots and blocked shots).  This gathers far more entries into the dataset for evaluation of players.  Perhaps all of those elements are not equal.  A goal scored is clearly more important than a missed shot.  Somebody in the future may find a better way to tally all of these events (and find other meaningful events that can be tallied with a minimum of subjectivity) to give a better dataset in which to analyze players contributions.  From this dataset, one can work out better and better models of the contributions of players and be better able to quantify the value of a given player.  That is the goal of hockey sabermetrics.

I think the idea of subjectively going over statistics to award those players one thinks did the right thing and penalize those who didn’t when goals are scored is not the right approach.  The approach is to build as large a dataset of significant events and find the appropriate objective analysis of those events.  It is not to add in individual bias in scoring by making it more subjective.

To answer Dave Staples question:  What good are hockey stats if they don’t even tell us who scores and causes goals?, hockey stats do a good (but imperfect) job of telling us who scores and causes goals and that clearly shows that they have a strong value.  With future understanding, hockey stats can be made even more useful than they are today.

Filed in: | The Puck Stops Here | Permalink
 Tags: sabermetrics,

Comments

Avatar

Puck Stops Here:

I think you’re mistaken in regards to my main point on plus/minus. Essentially, it’s a team stat and a good team stat.

If a team is +60 over a season at even strength scoring, you know for certain it’s a good team.

When you apply this team stat to individual players, however, it tells you much less, because an individual player is only 1/6th responsible for what goes on the ice when a goal is scored for or against his team.

This application of team stats to individual players is a fixture of hockey analysis, but when you have 30 or 40 per cent false positives and false negatives—as team plus/minus does when applied to individual players—it renders the stat unreliable. It’s neither fair nor accurate.

My goal, if nothing else, is to cut down that rate of false positives and false negatives in a plus/minus stat for players, to be more fair and more accurate.

The goal of hockey sabrmetrics may be to find objective standards, but in the work we’re doing the problems of subjectivity can be controlled if a team of people go by an objective set of standards in making a call. That’s what we do. If you want to review the standards, contact me and I’ll send them to you.

This method can provide useful information. It already does in other sports. It provides a stat like “assists” in basketball and “errors” in baseball. I know baseball stats guys hate “errors,” but it’s proven to be a durable and somewhat useful stat.

The new fielding stats in baseball also have an element of subjectivity and scoring chance stats in hockey, which are widely used, have a big subjective element.

Here’s another example . . .

There’s no end of controversy about referee calls in a hockey game, which are subjective calls governed by an objective standard.

But over a season, if we see Player A has four penalties and Player B has 40, we don’t go saying, “Hey, that means little, because this was too subjective.”

We say: “Player A plays a cleaner game than Player B.”

We say this because we know that most often the refs get it right, even if they make the odd mistake.

Another example is the strike in baseball, a subjective call made on the spot by someone trained on an objective set of standards. Now any one call can be hugely controversial, but baseball fans accept the results over the long term. They accept a pitcher who strikes out 250 guys over a season is a better strikeout artist than a guy who strikes out just 50, even if his strikeout total is governed by a subjective call (and one that doesn’t have the benefit of replay to get it right, a benefit we have).

If over a season, Corey finds finds Florida Player A has contributed to 50 goals and made mistakes on 40, making him +10, while Player B has contributed to 30 goals and made mistakes on 40, making him -10, I’m going to put a lot more weight on his finding—which is governed by his judgment but guided by guidelines—than I do on any kind of team stat crudely applied to either player.

So I don’t believe the issue you highlight is dire. I’ve been assigning errors for two years now. Some people don’t like it. Others do.

Perhaps it’s more on the line of an enhanced scouting system on a given team, a scouting system that has some built in objective standard to guide it along. If that’s all it is, that’s fine with me.

If others want to go a different route and keep on applying team stats to individual players, well, that’s their business, of course.

P.S. You write: “Perhaps all of those elements are not equal.  A goal scored is clearly more important than a missed shot.”

Now, it’s true that on occasion I’m given to over-statement. But, perhaps, this was a bit of an understatement.

Posted by David Staples on 10/20/09 at 01:32 PM ET

PuckStopsHere's avatar

Dave

There are several reasonably well proven methods to handle the comparison of players +/- ratings on different teams.  Here is one method that treats +/- as a rate stat and here is another that treats them as a counting stat.

Your problem of team effects on individual players is largely solved under both methods - of course there are always other effects that one might then worry about.

I think a better solution than worrying about false positives and false negatives (and inputting the bias that comes in determining them) is to gather sufficient data that the false counts are not significant to the result and can be adjusted for, as is done in the +/- examples.

The problem in picking which players are important in goals scored is that there is too much subjectivity (at least as defined in your article).  For example, is the guy who carried the puck up the ice important in scoring a goal?  Does it matter if he brought the puck up the ice on the previous shift before getting a faceoff deep in the offensive zone and and going off on a line change?  In order to subjectively give out goals, +/- etc by the method you spell out you must assume answers to these questions.  Either you give them credit or you do not - the subjectivity of this judgment is built into your system.  I argue we need to gather the data and study this.  We can amass a list of who brought the puck into the offensive zone for each goal and how long before the goal etc and quantify these effects.  Instead, the system you describe subjectively chooses whether or not to include them and hence cannot ever gather results that go beyond the subjective inputs in the system.  In baseball with errors (for example) things are far better defined, because we know better what questions to ask and thus how to define things.  That said, conventional wisdom is that the number of balls a player successfully fields (his assists + putouts) is a better measure of his defence than his error total (for a given position - and with a couple other pieces of fine print).  A player who makes more outs and bobbles a couple balls getting errors is better than a player who makes less outs (has less range) but does not get as many errors.

Posted by PuckStopsHere on 10/20/09 at 02:22 PM ET

moore00's avatar

David, you make good points. 

If you read through his stuff, PSH is the slow kid in the class.

Posted by moore00 from Columbus, OH/Grand Rapids, MI on 10/20/09 at 02:41 PM ET

Avatar

TSPH,

As the fellow who directed you to this post, I urge you to re-read the article.

Staples’ isn’t against hockey statistics; he seeks to develop more meaningful hockey statistics that explain rather than mislead. Moreover, the error assignment is Staples’ project (he’s been working it for the last couple of years on the Oilers). The interview is with a fellow who is going to take Staples’ methodology and apply it to the Panthers.

There is merit to a refined plus/minus that credits/debits players DIRECTLY involved in the play. Traditional plus/minus only reveals whether or not a player was on the ice at the time the goal was scored. It does not reveal if the player in question had any direct impact on the play or was in proper position of coverage.

For example, I can tell you that Shawn Horcoff and Ales Hemsky were minus 1 on the last play and that gives you the information that they were beat for a goal. But if I tell you that Hemsky was covering the right point and prevented the pass/shot from that zone, while Horcoff was in front of the net where he lost his assignment, screened the goaltender and the left point shot deflected off him into the net, doesn’t that provide you with more information on the cause? And now knowing this, why should Hemsky and Horcoff get assigned the same result (a minus)?

As this concept develops, I think you will find the hockey error is not subjective as much as it is interpretive.

Posted by Matthew McCallum from Redding, California on 10/20/09 at 02:50 PM ET

Avatar

I notice between the time I started and finished writing—bookending a meeting that I had to run off and attend—David Staples posted.

Posted by Matthew McCallum from Redding, California on 10/20/09 at 02:57 PM ET

Add a Comment

Please limit embedded image or media size to 575 pixels wide.

Add your own avatar by joining Kukla's Korner, or logging in and uploading one in your member control panel.

Captchas bug you? Join KK or log in and you won't have to bother.

Commenting is not available in this weblog entry.
Feed

Most Recent Blog Posts

About The Puck Stops Here

imageThe Puck Stops Here was founded during the 2004/05 lockout as a place to rant about hockey. The original site contains over 1000 posts, some of which were also published on FoxSports.com.

Who am I?
A diehard hockey fan.

Why am I blogging?
I want to.

Why are you reading it?
???

Email: .(JavaScript must be enabled to view this email address)

When learning from experts it’s best to learn personally from them, or from their blog. We can provide that with poker lessons blog, your home to learn poker personally.

Do you get shocked from the luck in the game of poker? Stop getting shocked and start being a Poker Shoker

high yield savings account






Donate to Kukla’s Korner