Twinkie Town: An SB Nation Community

Navigation: Jump to content areas:


Sports blogs for fans, by fans.
New Blog: The Boxing Bulletin for Boxing Fans!

Starting Pitching Game Score analysis

With the current winning streak (last night not withstanding), I've been thinking about the Twins' starting pitching and wondering if there was any correlation between how I intuitively saw them doing and their game scores.

Star-divide

Now, truth be told, I don't know if game scores are a useful statistic for measuring a starting pitcher's performance, but I find that I always check them every morning on ESPN's box score.

To find an explanation of what a game score is and how it is calculated, look here.
 Essentially a game score begins at 50 and adds points for positive things the pitcher does (outs, K's, finishing innings late in the game), and takes away points for mistakes made by the pitcher (hits, run, ER's, HR's, BB's). The resulting number is a measure of how dominant that particular start was for the pitcher. 50 is average.

Using data from Baseball-reference.com up through last night's game, what do game scores tell us about the Twins' starting pitching this year?

To begin with, I'm not going to include Liriano, as his three starts are (for this purpose) irrelevant. That leaves Livan, Boof, Baker, Blackburn, Slowey, and Perkins. My sense this season is that Livan has given us some ups and downs but has been relatively consistent. Boof, while at times pretty good, was ultimately pretty bad. Baker seems to me to be the closest thing we have to a staff ace. That leaves Blackburn, Slowey, and Perkins--the three young guns. They've all had growing pains, but it seems to me like they're showing more positives than negatives, with Slowey and Blackburn showing some strong potential.

So if these are my intuitive assessments, (feel free to disagree in the comments below), how do they compare to the game score data?

The top Twins' pitchers thus far this year, by average game score:

  • Slowey--55, 13 starts
  • Baker--54.8, 13 starts
  • Blackburn--49.5, 17 starts
  • Perkins--47.1, 12 starts
  • Boof--43.4, 12 starts
  • Livan--42.4, 19 starts
  • (Franchise--31.7, 3 starts)

Slowey and Baker are the top two pitchers, which seems right to me. Blackburn is up there as well. The most surprising discovery is that Boof has actually been marginally better/more dominant than Livan this year. So what happens if we look at their 20 game splits throughout the year?

Slowey

  • Games 1-20: 42, 1 start
  • 21-40: 50.5, 2 starts
  • 41-60: 57.8, 4 starts
  • 61-80: 51.5, 4 starts
  • 80-100: 67.5, 2 starts

Baker

  •    1-20: 56.8, 4 starts
  • 21-40: 46, 2 starts
  • 41-60: 51, 1 start
  • 61-80: 54.8, 4 starts
  • 81-100: 61.5, 2 starts

Blackburn

  •    1-20: 54.3, 4 starts
  • 21-40: 42.3, 4 starts
  • 41-60: 53.3, 4 start
  • 61-80: 42.3, 4 starts
  • 81-100: 74, 1 starts

Perkins

  •    1-20: 0, 0 starts
  • 21-40: 51.5, 2 starts
  • 41-60: 40.8, 4 start
  • 61-80: 47.3, 4 starts
  • 81-100: 55, 2 starts

Boof

  •    1-20: 46.8, 4 starts
  • 21-40: 45.8, 5 starts
  • 41-60: 35, 3 start
  • 61-80: 0, 0 starts
  • 81-100: 0, 0 starts

Livan

  •    1-20: 52.4, 5 starts
  • 21-40: 44.8, 4 starts
  • 41-60: 30.3, 4 start
  • 61-80: 36.5, 4 starts
  • 81-100: 48.5, 2 starts

What does this tell us? Slowey and Baker have been pretty good all year, and are trending upward. Blackburn appears to alternate between being pretty good and being a little inconsistent, which makes sense for a rookie. All together he's been pretty good as well. Perkins, at this point, has started a little slow but is also trending upwards. It remains to be seen whether his ceiling is as high as Slowey's or Baker's, but at the worst he's looking like a great end of the rotation guy.

The most interesting discovery is that, for the most part, Boof and Livan are mirror images of each other. Both started strong through the first 40 games of the season, and then both imploded. Boof, as we know, got demoted to the bullpen, but their game scores suggest that Livan just as easily could have been sent down. In fact, through the second 40 games of the season Livan's average game scores were down by 15. It is too soon to tell if he's bringing those numbers back up, or is just the lucky recipient of above average run support. In either case, Livan is far from our "ace" using this measure of dominance--in fact, he is our #5 starter.

In order to develop a more comprehensive ranking assessment for our starting pitchers, I ran one last piece of analysis on these numbers, calculating the standard deviation for each pitcher. The standard deviation reveals the spread of numbers within a given sample, and is essentially a measure of consistency. The lower the standard deviation, the more consistent the numbers in the sample are, (or that a pitcher is more consistently performing at or close to their average level). Standard deviations are in the same units as the sample, allowing easy and accurate comparison. In this case, they are unit less because game scores have no units.

Using their standard deviations, (where lower is better), here are the Twins' most consistent pitchers:

  • Baker, 9.8
  • Perkins, 12.7
  • Blackburn, 16
  • Boof, 17.9
  • Livan, 18.4
  • Slowey, 19.7

Baker is clearly more consistent than the rest of the staff. The only noteworthy observations here is that Perkins is also pretty consistent, just at a lower performance level than Baker. Livan, Boof, and Slowey are on the other extreme. Their substantial standard deviations suggest a large fluctuation in their start-to-start performances. Slowey's standard deviation--more than double Baker's--is the most surprising to me considering how dominant he seems to have been this year.

Given this I decided to run the numbers again, this time taking out the highest and lowest game score for each pitcher. These modified standard deviations rank the pitchers accordingly:

  • Baker, 6.2
  • Perkins, 9.3
  • Slowey, 12.1
  • Blackburn, 13.4
  • Boof, 14.4
  • Livan, 16.4

With their best and worst games taken into account, Slowey jumps to the the third most consistent pitcher. This makes sense when you consider that his best game score was an 89, but his worst was a 10. Baker and Perkins continue to be mirrors of each other, just at different levels of performance. Perkins' best and worst games scores are eerily similar to Baker's, just lower (63 and 20 versus 72 and 33 for Baker). Boof and Livan improve, but are still pretty bad.

Looking at the differences between the regular standard deviation and the modified standard deviation gives us a shorthand measure of the consistency of the consistency. While certainly open to interpretation, I think it is really interesting to look at this number in conjunction to the Twins' pitchers, because seems to tell in a nutshell what each pitcher is about. I could be completely wrong about all this, but see what you think:

  • Livan, diff. = 2
  • Blackburn, diff. = 2.6
  • Perkins, diff. = 3.4
  • Boof, diff. = 3.5
  • Baker, diff. = 3.6
  • Slowey, diff. = 7.6

Most of the staff receives the same adjustment, with the major exceptions of Livan and Slowey. Livan has been consistently inconsistent, with lots of good games and lots of bad ones, in relatively equal proportions. Slowey, on the other hand, has had few games extremely divergent from his average. The primary reason for his 7.6 difference is that he has pitched far more games above his average score than bad ones below it. The bad ones he did pitch happened to be pretty awful, which tilted the scores accordingly.

So, in summation, I think each pitcher's game scores correlate pretty well with the overall picture we have of them this year. Factoring in their standard deviations as a measure of their consistency, I'd rank the starting pitchers this way.

  1. Scott Baker, 54.8 ave. GS, 6.2 modified SD (consistency rating)--consistently good and trending upwards
  2. Kevin Slowey, 55 ave. GS, 12.1 SD--most dominant of our pitchers, just needs to continue gaining experience. Greatest potential to become a true ace, even without a supposed "out" pitch.
  3. Nick Blackburn, 49.5 ave. GS, 13.4 SD--Pretty solid thus far, with a slightly higher demonstrated ceiling than...
  4. Glen Perkins, 47.1 ave. GS, 9.3 SD--Perkins is very consistent, but simply hasn't shown to this point that he can be a truly dominant starting pitcher. His best game score of the year is the lowest of all the starters, but the difference between his highest and lowest games scores is second best behind Baker (43 to Baker's 39. Slowey has the greatest discrepancy at 79. Livan, Boof, and Blackburn are at 59, 57, and 56 respectively). Perkins is what Livan should be--a solid innings eater that you know exactly what you're going to get from. A poor man's Scott Baker.
  5. Tie--Boof and Livan. Both have been wildly inconsistent and bad this year. If I had to call it, I'd give the edge to Livan because I do think he's had a positive influence on the younger guys. Livan, in many ways, in the anti-Santana--terrible but great record due to terrific run support.

So what do y'all think?

PS--Just for kicks, here's Matt Garza's line for this year:

  • 53.4 average GS, 15 modified SD (which compares him most closely to Slowey).

4 recs  |  Comment 19 comments

Story-email Email Printer Print

Comments

Display:

its

it’s a little bit over my head, but thanks.

by DedicatedFollowerOfFashion on Jul 8, 2008 5:25 PM EDT reply actions   0 recs

Great work!

Great, excellent work. Very well thought-out and easy to follow. It’s also very telling.

I tend not to lean too much on game score averages when comparing or criticizing pitchers. I think, like you, that there are games that can throw the whole spectrum off. I like what you did, though, in taking the best and worst games off. That is a much better way to compare and see the raw ability and skills of the pitchers.

Great work.

The stat I enjoy using the most is WHIP. I feel it’s underrated as a statistic, and I don’t know why. I think it is one of the best things to use when evaluating a pitcher. Walks plus Hit per Inning Pitched is what it stands for and it basically states how many base runners you allow per inning, on average.

Here are the numbers:

Slowey: 1.042 WHIP
Baker: 1.184 WHIP
Blackburn: 1.309 WHIP
Perkins: 1.479 WHIP
Boof!: 1.500 WHIP
Hernandez: 1.590 WHIP

Again, we find Livan at the bottom of the barrel. Granted, this stat doesn’t account for how many of those runners the pitcher allows across the plate, but I find it handy nevertheless.

I also enjoy looking at ERA+. I don’t want to ramble, so here is the BR definition:

“The ratio of the league’s ERA (adjusted to the pitcher’s ballpark) to that of the pitcher. > 100 is above average and < 100 is below average.”

So basically it compares the pitcher to the league’s average, which is adjusted to the specific ballpark. Let’s plug it in:

Baker – 118 – above average
Slowey – 104 – slightly above average
Blackburn – 104 – slightly above average
Perkins – 92 – slightly below average
Hernandez – 76 – Below average
Boof! – 63 – Way below average

Very interesting, yet again. Baker is a stud, we already knew that. But once again the numbers say that Livan isn’t the ‘ace’ we brought him in to be.

Who would have thunk?

Twins Fix
http://www.twinsfix.com

Bleacher Report Senior Writer:
http://bleacherreport.com/users/4918-Andrew_Kneeland

Check them out!

by Andersklasen on Jul 8, 2008 6:35 PM EDT reply actions   0 recs

WHIP

John Gordon was on a morning show in Rochester today, and he mentioned that WHIP was one of the stats that Terry Ryan gave the most weight to.

He also kept referring to it as Walks & Hits per 9 Innings Pitched, but still claimed that just over 1 was a good value. If there was a guy who did that, it would be pretty darned impressive.

"There are only two things that are infinite, the universe and human stupidity, and I'm not sure about the former." - Albert Einstein

by BeefMaster on Jul 9, 2008 7:00 PM EDT up reply actions   0 recs

John Gordon is getting old.

Twins Fix
http://www.twinsfix.com

by Andersklasen on Jul 9, 2008 7:13 PM EDT up reply actions   0 recs

seriously

John Gordon is none too bright a bulb.

But I like him anyway.

by montanatwinsfan on Jul 9, 2008 8:24 PM EDT up reply actions   0 recs

Excellent work.

Thanks, keep it up!

by Jesse on Jul 8, 2008 6:42 PM EDT reply actions   0 recs

Thanks for the comments

I ran some other pitchers for some comparisons as benchmarks.

Looking at dominant guys this season:
Dan Haren-59.9 ave GS, 8.6 SD (very consistent)
J. Duscherer-61.8 ave GS, 9 SD (also consistently dominant)

I also looked at Johan this year and from 2006:
Johan (Mets, 2008)-57.4 ave GS, 8.8 SD (still reliably good->the losses aren’t his problem)
Johan (Twins, 2006)—63.3 ave GS, 14.1 SD (better, but the downs early in the season hurt his overall consistency score).

Lastly, two other benchmarks:
Greg Maddux, 1995, 69.1 ave GS, 11.7 SD
Carlos Silva, 2006, 35.4 ave GS, 14.7 SD

by biggity2bit on Jul 8, 2008 7:17 PM EDT reply actions   1 recs

Interesting.

Twins Fix
http://www.twinsfix.com

Bleacher Report Senior Writer:
http://bleacherreport.com/users/4918-Andrew_Kneeland

Check them out!

by Andersklasen on Jul 8, 2008 7:51 PM EDT up reply actions   0 recs

Useful information

But, here’s a boulder sized grain of salt:

There’s a big difference between pitching against the Washington Nationals and the Boston Red Sox. In other words, these numbers are relative to who the hurler is facing. In that regard, you would have to factor in some sort of indication as to the quality of team they were facing.

INtuitively, I think Baker is the best pitcher. He’s been out of action some, so his sample size is smaller, but over time, his lack of innings may help him in that his arm has more bullets left in it.

After that, Blackburn has been the most consistent seemingly, while Slowey has pitched some gems while, at the same time, some duds. Not sure how effectively one can smooth out the highs and lows and come up with something numerical that would tell us more than what we can see just from the line score, but it is fun to give it a shot.

by Old Twins Cap on Jul 11, 2008 12:41 PM EDT up reply actions   0 recs

Good idea
There’s a big difference between pitching against the Washington Nationals and the Boston Red Sox. In other words, these numbers are relative to who the hurler is facing. In that regard, you would have to factor in some sort of indication as to the quality of team they were facing.

I think it would be really interesting to track the average game score a team allows – by doing that, it might be possible to normalize game scores compared to the opponents’ average. Of course, you have a bit of a chicken-egg problem with that (a team in a division with lots of great hurlers will have a higher game-score average against), I’d guess it would largely even out based on the quality of the offenses.

"There are only two things that are infinite, the universe and human stupidity, and I'm not sure about the former." - Albert Einstein

by BeefMaster on Jul 11, 2008 4:54 PM EDT up reply actions   0 recs

True

I agree with both of you. I’ve looked at each pitcher’s Leverage Index (a B-R stat that places a numerical valuation-according to the game situation-of each pitch, the overall result being the leverage index). According to that, Joe Nathan pitches in (by far) the highest leverage situations of all the pitches (that is, the outcome of the game is more at stake, pitch to pitch, when he pitches versus when other pitchers pitch). He’s the closer, so it makes sense. Among the starters, Blackburn actually has pitched in more higher leverage situations than any of the other SPs, with Slowey pitching in the least leveraged situations. The other four are all pretty closely grouped together. I haven’t yet had a chance to correlate leverage index scores (or WPA for that matter) with this GS stuff. Kinda makes you wonder a little about Slowey, tho, in that perhaps his high ceiling is more due to playing with high leads or against bad teams, with his clunkers being against good teams. Or perhaps not.

by biggity2bit on Jul 12, 2008 8:52 AM EDT up reply actions   0 recs

Leverage

The leverage stat doesn’t seem like it would be very useful for a starter – if he’s never pitching in high-leverage situations, it could just be because he always happens to get a lot of run support early in the game, or he gets way behind early. For relievers, it provides a picture of how they’re used, but I don’t see much utility in that stat for starting pitchers, since they generally are in there for two-thirds of the game, and the leverage value would be highly dependent on the score, half of which is out of their control.

"There are only two things that are infinite, the universe and human stupidity, and I'm not sure about the former." - Albert Einstein

by BeefMaster on Jul 12, 2008 11:48 AM EDT up reply actions   0 recs

True.

I was kinda thinking the same thing, even while I was looking at it. Either the pitcher could be doing poorly or getting no run support, or vice versa, and either could skew the number. I’d still like a way of knowing, tho, whether a starting pitcher is doing well in a tight game, or inflating their scores against either poor competition or an out of reach game.

by biggity2bit on Jul 12, 2008 8:39 PM EDT up reply actions   0 recs

WPA, maybe?

WPA (Win Percentage Added) – This tracks how much the pitcher’s performance (or, rather, the results of his pitches) affects the likelihood that the team will win. Gleeman has been tracking this on his blog periodically; I’m not sure where his numbers come from.

It actually seems like this might be what you want even more than leverage – WPA would track their performance, in those situations, not just how often they’re in them.

"There are only two things that are infinite, the universe and human stupidity, and I'm not sure about the former." - Albert Einstein

by BeefMaster on Jul 12, 2008 9:44 PM EDT up reply actions   0 recs

Will Young

used to do that every game…alas, he has sadly taken a break from blogging and has not updated in a while. It was a lot of fun while it lasted though…

by clutchhit20 on Jul 13, 2008 4:09 PM EDT up reply actions   0 recs

WPA

fangraphs.com updates WPA live during each game. You can find lots of other stats there as well.

by Hoya on Jul 14, 2008 4:49 PM EDT up reply actions   0 recs

Good

This is good stuff. Thanks for putting this together. I always respect a data driven analysis.

I really like what you’ve done with the modified SDs. Chopping the top and bottom scores helps uncover the “true” pitcher without allowing the outliers to get in the way of your analysis. This has given me a lot to chew on.

However, I would like to pick a nit because I dont’ think that comparing the modified to the raw SD is going to give you much in the way of meaningful interpretation. Really, all it tells us is the extent to which the pitcher was good and bad during the year. The same information could be calculated by simply subracting the pitcher’s top score from their bottom scores. I guess we could interpret this is the “risk/reward” associated with the pitcher (Slowey has a greater chance of throwing a CG Shutout and a greater chance of being trounced for 6 runs in 2 innings).

If we were to interpret the range of scores in this manner, we might as well just use the SD to garner the same information because it’s a more robust statistic.

Finally, I would be very interested to see the mean scores broken out by quartile. I think this would help us break down the pitcher’s potential and raw “stuff”. What I mean is to take each pitchers best games (the top 25%, or roughly 4 starts) and compare them. This gives us an idea of how good each pitcher is when they are “on”.

We could also break down each pitcher’s bottom 25% which would give us an idea of how well they do when they’re pitching poorly. My guess is that Livan would rank poorly in the top 25% stat but rank relatively high in the bottom 25% because he’s been able to mitigate the damage, even when he’s pitching poorly.

What would my life be like without the '91 World Series?

by MJesser on Jul 8, 2008 9:26 PM EDT reply actions   0 recs

Thanks

If I have time, I’ll look into the quartile results. My main interest in looking at the SD’s for each pitcher was simply to get an idea of how consistent or inconsistent they were. It’s seemed to me that thus far this season, Baker has been really great, followed by Slowey and Blackburn. Finding their standard deviation tells me that.
Now, whether or not there’s anything more that can be taken from the SD is a question I simply don’t know. That’s why I like your quartile results—perhaps you can get an accurate picture or prediction of what a given pitcher’s expected ceiling could be.
Lastly, I did think it was interesting that Perkins graded out as the second most consistent pitcher on the staff. He’s really flown under the radar for me this year. I’d love to see him improve a little, as then we’d have four really solid starters, with two (Baker and Slowey) demonstrating strong ace or #2 type starter potential.

by biggity2bit on Jul 9, 2008 11:59 AM EDT up reply actions   0 recs

Excellent work

It really just underscores what most of us thought anyway, but it is very nice to see some good stats attached to those thoughts.

It will be interesting to hear some of the “pro-Livan” fans on this site respond to these numbers.

p.s. – thanks for putting all of those numbers in plain english for idiots like me.

by montanatwinsfan on Jul 8, 2008 10:12 PM EDT reply actions   0 recs

Comments For This Post Are Closed


User Tools

TT is an SB Nation blog of, by and for the fans. We strive to be the best Minnesota Twins blog by providing quality content and analysis, as well as daily news and notes on the team. We hope you'll make Twinkie Town your home for all things Twins!
Start posting about the Twins »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recommended FanPosts

Small
Organization Review (Relief Pitchers)
Small
On Roy, Ramos, and RISK

Recent FanPosts

Small
Josh Johnson
Small
Anybody want to talk revenues?
Joel87bw5_small
Signing up for the Minors
Small
Roy, Ramos, and RISK, Part II
Small
30 Cents on the Dollar = 2B Indifference
P1060527_small
New Uni Thoughts
Small
Minor League Report...November 14, 2009
Pose_small
Prediction Time (My Guess at 2010 Organizational teams)

+ New FanPost All FanPosts >

Twinkie Town On Twitter

SPONSORS


Editor-In-Chief

Twinkietown_small Jesse

Senior Writer

Hrbek_small Jon Marthaler

The_jet_small cmathewson

Gladdentwins_small Adam Peterson

Hosken_powell_autograph_small RandBall's Stu