Clarifying BABIP
Just the luck, a major area of baseball statistical interest for me gets a major post by Mellinger and I’m out of town with the Fanpost already falling down the main page. Well, I want to respond to a couple misconceptions about BABIP and I felt the comments section of our community response wouldn’t do it justice. This is partly a summation of other’s ideas, partly my take on it, so feel free to correct me if you believe I have misstated some part.
First and foremost, BABIP of batters IS a skill. No credible researcher that has studied it will argue against that. Mellinger’s take that good hitters have high BABIP’s is absolutely correct… it is also irrelevant to how hitter’s luck is measured with BABIP. Now, there is still ongoing research to understand it better, but generally most people will agree that BABIP is mainly a measure of a batter’s ability to hit the ball hard interacting with the opposition’s defense. (A quick note, the determination of this is backwards, I have started with a conclusion here then move to evidence, but past research done correctly has answered this for us and simplifies this explanation)
When we talk about pitchers, the fact that they face, over the course of a season, all different types of batters with differing abilities to hit the ball hard tends to average their BABIP out to a mean value, based on their own park and defense. That’s why people tend to focus on the over/under .300 BABIP as a measure of luck. There are always outliers that have to be considered, but this is still a key assumption of DIPS theory. I won’t get into pitchers anymore because it has been written about much more extensively elsewhere.
But back to hitters. We can further break down the results of batted balls from hitting the ball hard to making solid contact and power of the swing and again, interacting with defense (+ other aspects + ‘error’). However, over the course of a season, players tend to play against the "average" defense, so it cancels itself out when comparing player to player. So if we want to understand what happens to a batted ball, we need to focus on finding a statistical proxy to "solid contact" and "bat power" and whatever "other effects." Most baseball fans will instantly be able to answer part of this, a line drive seems to fit part of that perfectly. Well, the statistics agree, line drives are highly correlated to BABIP. Now work done by myself, Studman of THT, Bendix and Dutton of BAT, and MattS of The Good Phight blog have all added other potential instruments to use in determining what causes batted balls to result in hits for those other effects. These variables can then be regressed to fit a mathematical model to explain what causes BABIP.
So where does luck come in? Well, regressions rarely yield 1 +1 = 2 type answers. They inherently have error cause by the unknown, i.e. 1 +1 + E = 3 (known + unknown = BABIP). Mathematically, E is very easy to quantify, but intuitively, it is harder to explain what it means. This E is what many of us consider luck. It’s some unexplained force that is causing batted balls to behave outside our understanding. Now, without getting into some heavy duty philosophy on fate, choice, and control, people will take that and define it as they will. But in the end, the fact is batters have not been shown to be able to control that error. Like I said though, this is far from the final word, more research is required.
The case against Mike Aviles is the E was enormous in 2008. Judging by the normal proxies, he wasn’t really doing anything amazing with respect to "solid contact" and "bat power." Above average? Yes. League leader? Not even close. Now, as a Royals fan, I hope my statistical methods are completely wrong and Aviles is league MVP next year, but it requires a lot of people to be very wrong about what causes hits to be hits.
Sorry if this is long, but I felt Mellinger's post was more due to mis-understanding than anything and this isn't a simple issue.
6 recs |
27 comments
Comments
Where does speed play into it?
Just a plain eye view of Aviles seems to be that he hits alot of balls up the middle. He legged out quite a few infield singles (seemed like more than others on the team). Maybe I’m wrong but it seemed like Aviles was doing that. Also his bat control seemed to be taking advantage of defensive alignment slicing a lot of singles between 1st and 2bagger when DDJ was on base.
I don't know how to put this but I'm kind of a big deal.
by kcscoliny on Feb 12, 2009 2:53 PM EST reply actions 0 recs
New xBABIP models take speed into account
Like the THT one I linked to a while back. In any case, as Aviles compiles more MLB PAs, we’ll get a better idea as to what “his” personal BABIP range is. His CHONE speed score is 5.1, which is pretty decent. But even the Bendix study/Dutton study which took into account handedness, speed, etc., still had Aviles as one of the luckiest players in the majors, as did Zep, PrOPS, and even the simple LD% +.12 method.
Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.
by devil_fingers on Feb 12, 2009 2:58 PM EST up reply actions 0 recs
Ya I read that
and while he is bound to regress maybe those things will lead to less of a regression than people think. What was his xBABIP?
I don't know how to put this but I'm kind of a big deal.
by kcscoliny on Feb 12, 2009 3:01 PM EST up reply actions 0 recs
which version?
its not a hard and fast equation, everybody is trying to estimate it with increasing accuracy. I think I had him in the 320 range vs. a 360 or so actual. that easily can be a 30 point drop in BA
by ZeppelinDZ on Feb 12, 2009 3:06 PM EST up reply actions 0 recs
I was trying to think of THT's which is .318
I don't know how to put this but I'm kind of a big deal.
by kcscoliny on Feb 12, 2009 3:11 PM EST up reply actions 0 recs
zep, have you emailed mellinger?
I wanna know what love is, I want you to show me
by LeoBloom on Feb 12, 2009 3:36 PM EST reply actions 0 recs
no
but this isn’t really a “hey Mellinger, your article was crappy” type of thing, I really like his work. I have his blog on rss. The stats going on in BABIP regressions are kinda hard to understand yet its on such a central concept to baseball. Seriously…Why do hits become hits? There is a 150 years of conventional wisdom that tries to answer that. A local journalist writing to the general population isn’t expected to approach baseball in an academic fashion. I think he just mis-represented our knowledge of hitting with that article.
We know he reads this blog, and if he (or anyone else) wants to respond directly to me, my email address is zeppelindz@gmail. com. I really enjoy discussing this stuff at any level (drunken bar talk to peer reviewed journal articles)
by ZeppelinDZ on Feb 12, 2009 3:45 PM EST up reply actions 0 recs
yes, that is really one of the central questions
and I can remember being a little leaguer and noticing the obvious: some of my best “hits” weren’t hits, they were line drive outs of even pop flies, a flare to short center is great hitting, but a harder hit ball right to the CF is bad
by royalsreview on Feb 12, 2009 4:33 PM EST up reply actions 0 recs
Shame on you Sam, shame on you
After reading Mellinger’s blog regarding BABIP and the subsequent postings in RR, I have felt compelled to provide a layman’s perspective on the situation.
I believe that Sam is having trouble with the use of “luck” as a part of BABIP and using “luck” to describe a player’s past performance and to some extend I can see his point. Take Aviles as an example, yes, his BABIP for 2008 was high and it is more than likely to regress back to the mean. But does that mean that Aviles was lucky in 2008? How can we say that a person is lucky over a period of 400+ at bats? What I would say is that his BABIP is an indicator that he had a higher percentage of batted balls that became hits when compared to the league average. In all likelihood, his BABIP will not be as high this year and his overall offensive stats will show that.
Mellinger’s research is essentially useless, shouldn’t better players (over the long term) have better BABIP when compared to the league? BABIP is very useful for highlighting possible outliers or anamolies that have occurred (Bannister) and explaining why a player’s performance may have dropped from a previous year. What Sam needs to research (as many of you have done) are the “one-hit wonders” who may have had an outstanding season and then subsequently fell from grace.
I am not a huge stats guy, however, I see the benefit of BABIP and I am not encouraged when a writer downplays its usefulness to such a wide audience. The impact on the “average” fan, who does not have as much in-depth knowledge of advanced metrics, could be detrimental to those who are trying to gain footholds with a broader base. To this I say shame on you Sam Mellinger, shame on you.
-You play to win the game - Herm Edwards
by truebluetradition on Feb 12, 2009 4:09 PM EST reply actions 0 recs
Does BABIP take lineup position into account?
I mean Aviles did significantly better with a runner on first base then he did with the bases empty, or with a runner on any other base.
Aviles
Bases empty: 235 ABs .298/.321/.460/.781
Runner on 1st: 184 ABs .359/.394/.505/.899
Runners in scoring positon:107 ABs .316/.374/.453/.827
I also seem to remember that a lot of his hits went through the gaps left by the shifted defenders when that runner was on.
So maybe Aviles is simply the perfect #2 hitter…amazing with a runner on first, above average otherwise.
I refuse to set up a signature....DAMMIT
by RoyalPug on Feb 12, 2009 6:42 PM EST reply actions 0 recs
BABIP doesn't, some xBABIP estimation might
certainly an interesting hypothesis to investigate, but that kind of goes with the clutch hitting thing that lots of people have debunked.
by ZeppelinDZ on Feb 12, 2009 7:07 PM EST up reply actions 0 recs
I don't think that what RoyalPug
said is really towards “clutch” so much as to whether Aviles is good at actually aiming his hits. With runners on he is performing excellent situational hitting by placing his hits where he has an extra advantage (a gap) at actually getting a hit due to the runner being on. That would be one possible explanation of those splits shown above, as RP implied.
Feb 13!!
by kabrink on Feb 12, 2009 8:03 PM EST up reply actions 0 recs
beat me to it
Yeah as i was going to explain, I don’t think its clutch hitting(or else he would hit better wRISP) so much as a natural tendency to hit the ball to the area vacated by the 1st baseman when a runner is on. I would assume he probably hits the same number of balls there when noone is one(leading to outs) which would explain the difference.
I refuse to set up a signature....DAMMIT
by RoyalPug on Feb 12, 2009 8:13 PM EST up reply actions 0 recs
without doing any in-depth analysis
his hitting chart for the K
it doesn’t ‘look’ like that is the case, only 1 single to the right side all season, other parks don’t look much different.
but again, you get into one of the fundamental assumptions of batted balls, that players really have very little control over what the ball does after it hits the bat. Maybe Aviles CAN do things no other players do.
by ZeppelinDZ on Feb 13, 2009 9:31 AM EST up reply actions 0 recs
clarify
right side of the infield, don’t know how many of his singles in RF were grounders
by ZeppelinDZ on Feb 13, 2009 9:33 AM EST up reply actions 0 recs
Copy and Paste from other fan post
I’m going to copy and paste what I wrote in mellingers fan post
And that’s the thing that bothers me about BABIP is that defense in my opinion plays a large role in whether a ball is considered a "hit" or an "out". So I will refuse to say that having a high BABIP is a "skill".
The same hit against the Yankees might not make it against the Blue Jays. That’s just the way it is and that’s not skill.
Of course like you said, it should average out over the course of the year but still, it plays a role.
by Royal from Queens on Feb 12, 2009 7:05 PM EST reply actions 0 recs
defense plays a huge role in creating hits
but its a stretch to explain the difference between good and bad hitters as consistently playing against better or worse defense. With unbalanced schedules, maybe it plays a role in certain division that value defense more, and/or maybe there is a league effect of some sorts.
by ZeppelinDZ on Feb 12, 2009 7:12 PM EST up reply actions 0 recs
What about hard-hit grounders?
I’m not comfortable with line drives as the only indicator of balls that are more likely to become base hits. As someone said earlier, you have speed. And you also have hard-hit grounders (of course that might cut both ways, a harder-hit grounder right to the third baseman might actually increase your chances of making an out). But on the whole, I’d like to see a study of how often harder-hit grounders become base hits (we would have to both define and measure hh grounders or "scorchers"). You hit a dribbler in the hole, you’re probably out. If there’s a guy on first, you’ve probably made two outs. You hit a scorcher in the hole, you’re on first base. It seems to me we’re not quite advanced enough in our measurement of various types of hits to be able to use BABIP effectively. We just lump everything into one of three Cro-Magnon categories: grounder, fly ball, line-drive. Some day, this is going to look pre-historic to future baseball analysts.
Along the same lines, consider the plane of a line-drive. Line-drives have all kinds of different arcs, of course, not to mention speeds. You have the line-drive that flies parallel to the ground until it just clears the infield, dropping in front of the outfielder. Then you have the Bo Jackson line-drive that clears the outfield wall before it starts to descend. Then there’s all of the gray area between a line-drive and a fly ball. Where is that line exactly? What does the angle/speed of line drive have to be, is it a universal definition, and do analysts accurately measure the arcs of batted balls in such a uniform way that we can be comfortable with line-drive statistics. Shouldn’t there be classes of both line-drives and grounders (and bunts…good bunts vs. bad ones), shouldn’t we take situational hitting into account—sac flies, hit-and-runs, advancing the runner—and remove all of these plays from BABIP stats?
I think, in a round-about way, Smellinger (that’ s what my bro and I affectionately call him) was sort of articulating all of these questions. Because if you were to measure hits and classify them in these specific ways, you’d have to concede that luck plays a smaller role that we might otherwise think it does.
Tony Goodwin used to claim that he had control over where in the ballpark he hit a ball. He was the ultimate situational hitter; if the scenario called for a hit to right field, he could inside-out his swing, adjusting for the speed/movement of the pitch. Or he could pull it to left, or find the gaps almost as if the bat were a pool cue. Now, this was always hard to believe, but I believed it. I think this ability is extremely rare, but I do believe it exists. And if it does, at least for those players, you would have to factor luck even further out of the equation.
I'm about to change my username to DannyDuffyfan
by jackie ballgame on Feb 13, 2009 10:57 AM EST reply actions 0 recs
if the scenario called for a hit to right field, he could inside-out his swing, adjusting for the speed/movement of the pitch. Or he could pull it to left, or find the gaps almost as if the bat were a pool cue
You turned him around, he was leftanded so he would inside-out to leftfield and pull to rf. Otherwise I agree with alot of wht you were saying.
I don't know how to put this but I'm kind of a big deal.
by kcscoliny on Feb 13, 2009 12:31 PM EST up reply actions 0 recs
My God, I just said 'Tony Goodwin'
Shoot me. Tony Gwynn.
I'm about to change my username to DannyDuffyfan
by jackie ballgame on Feb 13, 2009 10:59 AM EST reply actions 0 recs
Can I blab for a second about the pitching side of BABIP?
I should stop drinking coffee right now.
Here are my questions about the pitching side of BABIP; direct me to an article that addresses this. Are there pitches designed to induce harmless pop-ups? I would think knuckle balls fall into this category. Watch a good hitter hit a can of corn; he gets extremely pissed off, throws the bat in disgust. He definitely doesn’t want to hit a pop-up. And the pitcher loves it. It seems to me that many times, that’s the exact result the pitcher was trying for. I mean, why even use the phrase “fly-ball pitcher” unless we are suggesting that there are pitchers who deliberately attempt to induce pop-ups and are good at it? Of course, Banny covered this very well year before last. He said he intentionally pitched to contact, and that he was able to manipulate said contact. Of course, ground-ball pitchers make the same claim. So if the pitcher can manipulate what kind of contact he gets, doesn’t that also factor luck (to some degree) further out of the equation that most people who use BABIP seem to do?
I freely admit I haven’t studied BABIP arguments nearly enough, and so treat these as questions rather than statements. I’m a student in the classroom, I’m just raising my hand here.
I'm about to change my username to DannyDuffyfan
by jackie ballgame on Feb 13, 2009 11:10 AM EST reply actions 0 recs
Banny definitelly succeeded
with the whole “pitching to contact” this season. Rich, sweet, hard contact.
Bringing you more-or-less replacement level analysis and commentary since sometime in 2008.
by devil_fingers on Feb 13, 2009 12:29 PM EST up reply actions 2 recs
and they hit it
out of the park, like you just did.
I just got back from your mom's basement.
by Warden11 on Feb 13, 2009 5:37 PM EST up reply actions 0 recs
a response to all of jackie's above points
most of what you are saying seem like valid ways players could influence BABIP. the counter argument to that is that empirically, players don’t ‘seem’ to be able to consistently do that. Maybe Tony Gwynn did. Sticking with Aviles tho, your argument becomes he must be exceptional in his ability to do these things, like Gwynn.
I not saying your wrong here, quite the opposite, you have pointed out a number of additional factors that should be considered.
Ask yourself this, what does Mike Aviles (or others like him) do to a baseball that nobody else in baseball does? That’s the question we want to answer for these outliers. Is he different or lucky?
by ZeppelinDZ on Feb 13, 2009 2:36 PM EST reply actions 0 recs
probably both
Excellent point—for hitters, BABIP is very closely tied to how hard the contact you generate is. It’s also tied to all sorts of factors, and without having any numbers to back this up, I do remember Aviles appearing to very deliberately aim the ball at the hole between 1st and 2nd when a runner was on first base. It’s not the difference between .320 and .360, but it looked like a real skill that could generate a couple extra hits each year that xBABIP won’t take into account.
I’m curious what you think about pitching BABIP-against? Mellinger’s post showed pitchers who over fairly large samples were able to bring their BABIP-against down to .290 or in some cases lower. Do you think there are/will be pitchers who will be able to beat .290 on average?
by kcdc1 on Feb 13, 2009 8:49 PM EST up reply actions 0 recs

by 














