This post has been de-listed
It is no longer included in search results and normal feeds (front page, hot posts, subreddit posts, etc). It remains visible only via the author's post history.
Whoops, messed up the title.
Hello everyone, I'm currently taking a course designed around writing research papers and for the most recent project I got to choose something related to my major (Math). Being a huge baseball fan, I decided to make a model of expected performance of MLB players for various batted ball velocities. If you want to read the whole paper, I uploaded it to imgur HERE, but it's a bit messed up since imgur made it one big picture instead of a picture per page. The large white spaces are page breaks.
I used the Baseball Savant database to find the batted ball data for 10 "league average" players (10 players with about 100 wRC and combined BSR of about 0). I then tallied whether it was an out, single, double, triple, or home run for each of the 3300 data points, sorted by batted ball velocity, and then calculated the average batting average and OPS for ranges of batted ball velocity. Then I made two scatter plots, one for batting average and one for OPS, using that calculated data.
I then found a polynomial trendline that matched the data by seeing which line of degree up to 6 fit the data best while also staying positive for the entire range (since no player is bad enough to have a negative AVG!)
Lastly I took the equation of that trendline and plugged it into Wolfram Alpha along with any hitter's average batted ball velocity to find the "expected" performance. Strangely, I originally thought that the results would be similar to BABIP/OBP (since no walks is mitigated by no strikeouts) and OPS, but it actually ended up being closer to AVG and AVG SLG. So to calculate expected OBP and OPS I ended up taking the calculated values and adding in career walk rate.
Anyway, enough with the wall of text, let's get to numbers! It's what you all actually care about. For the project I only analyzed 10 players originally, so if you want me to look at someone else, just comment and I'll get back to you as soon as I can (probably).
Note: I make no promises on the accuracy. Take it with a grain of salt.
Player | BBall Velo | AVG | OBP | OPS | exAVG | exOBP | exOPS |
---|---|---|---|---|---|---|---|
Josh Donaldson | 92.434 | 0.297 | 0.371 | 0.939 | 0.330 | 0.408 | 0.943 |
Bryce Harper | 90.901 | 0.330 | 0.460 | 1.109 | 0.293 | 0.388 | 0.836 |
Chris Davis | 92.227 | 0.262 | 0.361 | 0.923 | 0.324 | 0.399 | 0.922 |
Jason Heyward | 90.512 | 0.293 | 0.359 | 0.797 | 0.286 | 0.371 | 0.795 |
Justin Upton | 90.765 | 0.251 | 0.336 | 0.790 | 0.290 | 0.371 | 0.812 |
Alex Gordon | 89.000 | 0.271 | 0.377 | 0.809 | 0.261 | 0.340 | 0.704 |
Yoenis Cespedes | 93.177 | 0.291 | 0.328 | 0.870 | 0.351 | 0.399 | 0.982 |
Ben Zobrist | 89.050 | 0.276 | 0.359 | 0.809 | 0.261 | 0.351 | 0.717 |
Matt Wieters | 87.306 | 0.267 | 0.319 | 0.742 | 0.243 | 0.305 | 0.617 |
Ian Desmond | 90.726 | 0.236 | 0.294 | 0.674 | 0.290 | 0.338 | 0.777 |
Steve Pearce | 87.263 | 0.218 | 0.289 | 0.711 | 0.242 | 0.320 | 0.632 |
Joc Pederson (Pre-ASG) | 93.534 | 0.230 | 0.364 | 0.851 | 0.362* | 0.502* | 1.110* |
Joc Pederson (Post-ASG) | 89.268 | 0.178 | 0.317 | 0.617 | 0.264* | 0.404* | 0.779* |
*I think his low contact tool and high power tool broke it. It would be more realistic as 0.222/0.362/0.970 (His career OBP is 0.140 higher than his AVG) and 0.124/0.264/0.639
So there you have it, from this data it seems like Cespedes and Desmond should improve greatly next year while Harper, Gordon, Zobrist, and Wieters should decline.
Now, I know what you're about to say, "Batted ball velocity isn't the whole picture, the angle and direction also affects performance". My response to that is that I personally think that it is the largest piece of the puzzle, and the OPS predictions for Donaldson, Davis, Heyward, and Upton being practically the same seems to agree. Maybe Harper and the others have the ability to consistently put the ball exactly where they want it.
Either way, I'm open to criticism and other comments. I also posted it here because I think you guys will provide better comments about this post.
From this data, I would love it if we signed Cespedes, but we don't have room for much more salary growth. In a game where hitting the ball hard is important, Cespedes hits the ball much harder than average.
Also, if we can dump Elvis' salary, I would love to sign Desmond for a 1-2 year bounceback contract. If he puts up something close to a 0.777 OPS next year he's easily worth 20 million a year but I don't see him getting that this year due to his stats. He could potentially be THE hidden bargain of this offseason.
Let me know if there's another player you want me to analyze.
Thank you for your time.
EDIT: Forgot to mention, I have no idea if this will work for pitchers (using the against velocity). It should since Baseball Savant has that option but I don't know about the accuracy.
Subreddit
Post Details
- Posted
- 8 years ago
- Reddit URL
- View post on reddit.com
- External URL
- reddit.com/r/TexasRanger...