| Sign Up | Google+

Fanposts

PRESENTED BY
PRESENTED BY

Batted Ball Data – The Difference Between FanGraphs and Baseball Reference

There are many different outcomes of a plate appearance – hit, strikeout, double, home run, walk, etc. Since 2002, statisticians found a new way to record the result of a batted ball as either a line drive, ground ball or fly ball – no matter if it was a hit or an out. Sometimes there is some subjectivity in recording these outcomes; two people can see the exact same play and one will say it’s a liner and the other will say it’s a fly ball. That’s what’s going on with FanGraphs and Baseball Reference. Both get their data from different sources (FanGraphs from Baseball Info Solutions and Baseball Reference from RetroSheet) and so there’s going to be some discrepancies in their results.

Using the Tigers’ projected starting 8 for 2013, here are the differences between FanGraphs and Baseball Reference using last year’s numbers (Victor Martinez is omitted because he didn’t play at all in 2012). The data from FanGraphs can be found by clicking here and the data from Baseball Reference can be found by clicking on the players’ names.

Ground Balls Fly Balls Line Drives
FG B Ref Dif FG B Ref Dif FG B Ref Dif Total Batted Balls
Miguel Cabrera 224 226 -2 191 197 -6 115 107 8 530
Prince Fielder 208 215 -7 168 176 -8 128 113 15 504
Austin Jackson 174 173 1 140 150 -10 98 89 9 412
Torii Hunter 205 205 0 100 116 -16 89 73 16 394
Andy Dirks 99 101 -2 97 100 -3 63 57 6 259, 258*
Alex Avila 123 124 -1 79 86 -7 63 55 8 265
Johnny Peralta 177 182 -5 157 171 -14 94 75 19 428
Omar Infante 199 202 -3 191 195 -4 98 91 7 488

*Total batted balls should equal regardless of using FanGraphs or Baseball Reference. The difference here is that Baseball Reference categorized one of Dirks’ batted balls as a bunt. Further information can be found here.

Oftentimes people will give these results as a percentage or rate. To get this, simply divide the batted ball type by the total batted balls.

Ground Balls Fly Balls Line Drives
FG B Ref Dif FG B Ref Dif FG B Ref Dif
Miguel Cabrera 42.3% 42.6% -0.4% 36.0% 37.2% -1.1% 21.7% 20.2% 1.5%
Prince Fielder 41.3% 42.7% -1.4% 33.3% 34.9% -1.6% 25.4% 22.4% 3.0%
Austin Jackson 42.2% 42.0% 0.2% 34.0% 36.4% -2.4% 23.8% 21.6% 2.2%
Torii Hunter 52.0% 52.0% 0.0% 25.4% 29.4% -4.1% 22.6% 18.5% 4.1%
Andy Dirks 38.2% 39.1% -0.9% 37.5% 38.8% -1.3% 24.3% 22.1% 2.2%
Alex Avila 46.4% 46.8% -0.4% 29.8% 32.5% -2.6% 23.8% 20.8% 3.0%
Johnny Peralta 41.4% 42.5% -1.2% 36.7% 40.0% -3.3% 22.0% 17.5% 4.4%
Omar Infante 40.8% 41.4% -0.6% 39.1% 40.0% -0.8% 20.1% 18.6% 1.4%

All of the differences are less than 5%, so it really doesn’t matter which site is used as the same conclusions can be drawn from using either site.

Practical use of this information can show if a player had good or bad fortune. Batting Average on Balls In Play (BABIP) is an often used sabermetric stat to show "luck." It has more meaning by also including batted ball data, as a high line drive rate can inflate BABIP. So there could be a legitimate reason why someone has a high BABIP without attributing it to "luck."

BABIP = (H-HR)/(AB-K-HR+SF)

An often underutilized sabermetric stat is xBABIP. xBABIP uses batted ball data to show what a player’s BABIP should have looked like. A higher xBABIP shows the player was "unlucky" while a lower xBABIP shows a player was "lucky." A quick calculation of xBABIP is to add .120 to the line drive rate. There are other xBABIP calcuations that use more batted ball data. The one I’m going to use can be found here.

xBABIP = (( GB – IFH ) * .159 + (FB-HR-IFFB) * .121 + LD * .750 + IFH + BUH ) / (GB + FB + LD + BU – HR – SH)

Where:

GB = Ground Balls

FB = Fly Balls

LD = Line Drives

HR = Home Runs

IFH = Infield hits

IFFB = Infield Fly Balls (AKA popups)

BU = Bunts

BUH = Bunt Hits

SH = Sacrifice Hits

Obviously two different sets of data can give two different results, which can be found below.

xBABIP
BABIP FG B Ref Dif
Miguel Cabrera 0.331 0.297 0.287 0.010
Prince Fielder 0.321 0.320 0.301 0.019
Austin Jackson 0.371 0.322 0.308 0.014
Torii Hunter 0.389 0.312 0.286 0.026
Andy Dirks 0.365 0.320 0.305 0.015
Alex Avila 0.313 0.296 0.277 0.020
Johnny Peralta 0.275 0.293 0.264 0.028
Omar Infante 0.291 0.299 0.290 0.009

Again, there isn’t much difference with the biggest one being Jhonny Peralta, who’s BABIP falls between his FanGraphs’ xBABIP and his Baseball Reference’s xBABIP. So one set of data says that Peralta was "unlucky" and the other says he was "lucky."

It is also a good idea to compare a player’s BABIP to his career BABIP as some players tend to over-perform or under-perform their xBABIP. More complicated xBABIP calculations could also include ballpark factors.

                                                                                                                                                                                                               

This is a FanPost and does not necessarily reflect the views of the Bless You Boys writing staff.

Recent FanPosts

View All Fan Posts

There are 4 Comments. Load Now. Loading

Shortcuts to mastering the comment thread. Use wisely.

C - Next Comment
X - Mark as Read

R - Reply
Z - Mark Read & Next

Shift + C - Previous
Shift + A - Mark All Read

Comment Settings

Live comment alert: Hide it!

Comments for this post are closed.

tracking_pixel_5351_tracker