Top 5 Replacement Players That Will Surprise You: Baseball’s WAR
Wins above replacement (WAR) is the wild wild west of unstandardized baseball stats. With the 2014 MLB playoffs about a month away, I’ve started looking at some surprises in the season’s WAR rankings....
View ArticleA New Baseball Stat: Considering Weighted Isolated Power: wISO
When I hear about a new baseball metric, my ears perk up. I recently stumbled on an article introducing a new baseball metric, weighted isolated power or wISO. As a metric, ISO attempts to measure a...
View ArticleR-Studio: Putting the Lahman Database on Steroids
It’s easy to calculate advanced baseball metrics with the Lahman database. I’ve always compared the database to indoor plumbing, something we all use every day but sometimes take for granted. The...
View ArticleCluster Analysis of a Umpire’s Strikezone in R
Ever wonder how “high” or “low” an umpire’s strikezone is compared to the rest of the leauge? Thanks to some public data and the PitchRx package, it’s easy to use a cluster analysis to figure it out!...
View ArticleAdding Pitching Stats to the Lahman R Package
Lately I’ve rediscovered the Lahman package for R. Since I’ve got a Lahman database on my localhost, I normally use a db connection in R to grab the data I need. In the process it’s easy to forget how...
View ArticleVisualizing Batting Stats: Linear Regression
A couple of months back I wrote an article on why batting average is obsolete as an offensive metric in baseball. The argument was based off of a linear regression analysis of several other advanced...
View ArticleSabermetrics: PostgreSQL vs. MySQL
I recently migrated my Lahman and PitchFx databases to PostgreSQL form MySQL with phenomenal results. The verdict is, Postgres blows MySQL away and here’s why… Nulls Treated as Zero As far as...
View ArticleLahman Database in Postgres: An Updated Guide
I recently wrote a post on the advantages of PostgrSQL for statistical analysis in the Lahman database, so I thought it would be wise to follow that up with a short “how to.” First, go grab my table...
View ArticleHow to Add WAR Metrics to your Lahman Database
I get a lot of questions on how to calculate WAR in the Lahman database. In the past I’ve discussed ways to calculate wOBA and FIP in Lahman but WAR has always been difficult due to the “closed-source”...
View ArticleHacking The New Lahman Package 4.0-1 with R-Studio
The developers of the Lahman package for R have recently updated the package to include 2014 MLB stats! For those not familiar, this R package recreates Sean Lahman’s Baseball Database into a quick and...
View Article