Twinkie Town: An SB Nation Community

Navigation: Jump to content areas:


Sports blogs for fans, by fans.
New Blog: Sounder At Heart for Seattle Sounders Fans!

The Twins and Bill James' Pythagorean Theorem

Joe Posnanski thinks that Ron Gardenhire is the best manager in baseball.  He gives a few reasons for thinking this, and all of those reasons are pretty easily refuted, but one in particular got me to thinking. He mentions that the Twins under Ron Gardenhire have consistently outperformed their Pythagorean record, as given by Bill James' Pythagorean formula for expected winning percentage. He is absolutely right. Over 6 full seasons, Ron Gardenhire's teams have won, on average, three more games than James' formula would suggest. Before we read too much into this trend, let's think a bit more about this formula.

The following observations about this formula are important:

1. There is no theoretical reason why expected winning percentage should be equal to RS^2 / (RS^2 + RA^2).

2. The formula will have downward biased estimates for teams which score very few runs (e.g. the Twins).

3. The formula is known to perform poorly in the presence of very good bullpens, and is also known to be biased towards the mean (so that really good teams will have lower predicted winning percentages, while really bad teams will have higher winning percentages as predicted by the formula).

By (1) I mean that this is simply a statistic that James stumbled upon which fit the data reasonably well. This is not a description of the data generating process. As an example of (2), consider a team which scores 800 runs and gives up zero. This team's expected winning percentage, according to the Bill James' formula, will be .500. That's a pretty poor prediction. I do not know the particular reasons for (3), but they are known issues, and are especially relevant to Ron Gardenhire's tenure with the Twins since the ball club actually won fewer games than predicted in 2007 and 2005, the two losing seasons.

So is beating Bill James an indicator of managerial excellence? It doesn't look that way. If you examine the performance of the Twins versus their "expected" performance over the past 20 years, you will notice some autocorrelation (meaning that beating the formula one year makes the team more likely to beat it the next year). This probably has a lot to do with the fact that a team with a strong bullpen one year tends to have a strong on the next year. You'll also notice a lot of negative differentials during losing seasons, and positive differentials (meaning that the team won more than expected) during winning seasons (an example of the formula being biased towards the mean). And you'll notice that the differential is largest in 2002 when the Twins won 94 games and the division despite scoring only 768 runs, the fewest they have scored while winning the division during Ron Gardenhire's tenure (an example of the formula being biased against teams that score very few runs while giving up even fewer).

This formula is, in my opinion, quite useless anyways since there is no theoretical justification for its use. If a team does not live up to their Pythagorean expectation, then this is probably because of a problem with the formula and does not necessarily mean the team's luck is about to turn. In the case of the Twins, the formula may be worse than useless. It is actually quite misleading. Unless the Twins start to score and give up more runs (which I don't see happening given all of the quality young pitching and anemic young hitting), or see their bullpen become mediocre (despite losing Neshek, I think we will still have an above average bullpen this year due to Nathan, depth, Rick Anderson and the fact that we have a long line of potential major league starters in the minors right now) then don't expect the Twins to live up to their Pythagorean expectation.

As of tonight the Twins are 29-27, with 261 runs scored and 269 runs against. The next time someone says "but their Pythagorean record is actually 27-29", tell them to go to hell.

PS: As a side note, Posnanski also mentioned that under Gardy the Twins have won 56% of their one-run ball games. This is not very impressive considering that during all 6 years the bullpen has been pretty lights out, and the Twins have won 55% of their games overall. I need to think about this issue a lot more, but I think that the strong bullpen might mean that they actually should have won more than 56% of those games.

0 recs  |  Comment 14 comments

Story-email Email Printer Print

Comments

Display:

Hmm

Hmmm, I don’t think Pythagrean records are perfect, but I think they are useful.

Especially since the Twins are right around .500, where the formula is more accurate.

I think we can see right where the Twins have won those couple of extra games. The Twins are 12-10 in one run games and 15-19 in all other games. It is unusualy for a team to be so much better in those close games than they are in normal games. It is even more odd when you consider that the Twins don’t really have a very good bullpen (or fielding) this year, like they’ve had in years past, which is a possible explanation for success in close games.

I would say the Twins are a couple games above where they should be at this point for all that, but it isn’t so much that it’s all that big a deal or any statistical outlier.

"You can't sit on a lead and run a few plays into the line and just kill the clock. You've got to throw the ball over the damn plate and give the other man his chance. That's why baseball is the greatest game of them all."
~ Earl Weaver
"In God we trust. All others must provide evidence."
~ Billy Beane

by AdamOnFirst on Jun 2, 2008 1:48 AM EDT reply actions   0 recs

.500?

First, I think a team that scores 800 and gives up zero: 8002 / (0 + 8002) = 1.00, which is intuitive.

But, on the whole I think Aaron is right. Why isn’t the Formula [RS / (RS + RA)] or [RS3 / (RS3 + RA^3)]. I suspect the answer is because it happens to fit better. Because this doesn’t contain any connection between the winning/losing process and the formula, there is no reason to read into it, unless you are trying to figure out why it works.

This is akin to noting that Mars tends to orbit the earth, except that it changes direction for a short period of time every now and then. You notice that it happens to be in July (hypothetically), and decide that the war god of Mars hates American Independence, and therefore turns around out of anger every July. It makes sense, but there’s not reason to believe it to be true.

Instead, you should use the deviations (like the twins or july) to examine the things that cause teams to deviate. If you were designing a team based on runs scored and runs allowed per dollar of payroll, you would want to see which teams win more relative to that predicted by the formula. You would notice that teams with winning records do better, but that probably doesn’t help you. Then you would notice that strong bullpens increase the chance of winning relative to the simple pythagorean formula. This may indicate that each bullpen run prevented is worth more dollars compared to starting rotation, and would cause you to refine you’re formula based on starting and relieving runs.

Or, that’s what you should do.

Lastly, I don’t think I believe that records in one run games matter. I think they are the exciting (devastating?) ones to watch, but if you score 5 in the ninth, coming from 6 runs down, I don’t think it means that the loss was any more meaningful than losing by 6. I just don’t believe that teams/players are that different in how they perform in different innings, or based on how close the score is. If anything, one run games come down to how good your best reliever is. Our record in one run games is good, because Joe Nathan is good, and he tends to pitch in one run games. If you believe that one run games are important, than you should also subscribe to the theory that closers many times more important than the other relievers. Many people do, but I’m not convinced.

by snolls on Jun 2, 2008 9:00 AM EDT reply actions   0 recs

not .500

also, I am terrible at math.

http://noblingblings.blogspot.com/

by Aaron Fix on Jun 2, 2008 9:13 PM EDT up reply actions   0 recs

Theoretical basis

It is not correct that the Pythagorean Formula does not have a theoretical explanation. It is found right here:
sjmiller/math/papers/PythagWonLoss_Paper.pdf” target=”_blank”>http://www.math.brown.edu/sjmiller/math/papers/PythagWonLoss_Paper.pdf

By assuming that runs scored and runs allowed are independent for each team, and that runs follow a so-called Weibull distribution, it is fairly easy to derive the formula. As the above-mentioned paper shows, these assumptions are pretty reasonable according to the data.

Actually, the best-fit exponent in the formula is somewhere around 1.79 to 1.82 instead of two.

It is true, however, that originally Bill James devised the formula empirically.

by PhoenixV on Jun 2, 2008 2:52 PM EDT reply actions   0 recs

Link problem

Hm, my link wasn’t posted correctly. Try this instead:
http://www.tinyurl.dk/3756

by PhoenixV on Jun 2, 2008 2:54 PM EDT up reply actions   0 recs

Theoretical explainations

That does not match my criteria for a theoretical explanation. You can look at the data and see patterns and derive formulas that work most of the time. But a theoretical explanation requires cause and effect. No formula derived merely from data analysis can tell you why the formula works most of the time. Nor can it help to explain those cases where it does not work. For that, you need a real theory. If you can give me a link for the theory of baseball that explains why teams tend towards their Pythagorean records, I’d greatly appreciate it.

"You're thinking too much. Just have fun." -- Bennie "The Jet" Rodriguez in Sandlot

by cmathewson on Jun 2, 2008 3:20 PM EDT reply actions   0 recs

Guess you don't believe in gravity, then, either...

...given that Newton had no theory of gravity in mind when he derived the Universal Theorem of Gravitation, nor did Einstein when he developed general and special relativity.

Science is full of formulaic descriptions which ‘fit the observed data’ even in the absence of an explanation of why the data exists as it does. One could make a functional definition of science that has nothing whatsoever to do with ‘cause and effect’ explanations, but instead deal with deriving mathematical functions that describe and effectively predict the phenomena of the universe.

I have bigger issues with the original essay, but not enough time right now to address them—that’ll come later.

by dwintheiser on Jun 2, 2008 5:09 PM EDT up reply actions   0 recs

Um, not to digress too much

But Newton did have a theory of gravity that was based not just on variables and constants, but a basic force in the Universe. Einstein’s theory explains the basic force in terms of mass and energy, rather than just mass. But nobody would say that the Theory of Relativity was just an arbitrary formula that happens to work in every instance except for the subatomic.

If someone could prove to me that a Weibull distribution is a basic pattern in the Universe that is found in galaxies and atoms and molecules and nebulae, I would be more convinced that the Pythagorean is based in a theory as opposed to a statistical model.

I suppose this is all an argument on the semantics or ontology of science, and, thus, completely irrelevant to this site. So I’ll stop now.

"You're thinking too much. Just have fun." -- Bennie "The Jet" Rodriguez in Sandlot

by cmathewson on Jun 2, 2008 5:31 PM EDT up reply actions   0 recs

You're right...

...this is getting into areas that are off-topic for this site. However, I wanted to point out one other thing: that thing about Newton’s theory being based on a basic force in the Universe? That wasn’t Newton, that was your junior high science teacher talking. Here’s what Newton himself wrote about the theoretical underpinnings of his theory:

“I have not yet been able to discover the cause of these properties of gravity from phenomena and I feign no hypotheses… It is enough that gravity does really exist and acts according to the laws I have explained, and that it abundantly serves to account for all the motions of celestial bodies. That one body may act upon another at a distance through a vacuum without the mediation of anything else, by and through which their action and force may be conveyed from one another, is to me so great an absurdity that, I believe, no man who has in philosophic matters a competent faculty of thinking could ever fall into it.”

The Pythagorean distribution describes, to a surprising degree of accuracy, the relationship between runs and wins in baseball (and in football, and basketball, and other sports....). As pointed out by many who have studied it, it actually does a better job of predicting future performance than it does in estimating current performance.

But you’re right—this is getting off-topic, so I’ll stop now.

by dwintheiser on Jun 2, 2008 11:01 PM EDT up reply actions   0 recs

Theory is theory

As the paper shows, the formula is not merely derived from “data analysis”. It is derived from the assumption that run scoring follows a Weibull distribution – it is first later that statistical methods are employed to test the goodness of fit and the validity of the assumptions. This is no different, e.g., than stating that errors are normally distributed in physics.

I agree that we do not have a micro theory of baseball, just like we don’t have micro theory of many other things in this world which are still being analyzed (as dwintheiser states). However, the Weibull distribution is a very generic distribution, which fits a lot if the parameters are chosen wisely. Thus it isn’t strange that we have such accordance between theory and reality in this case.

by PhoenixV on Jun 2, 2008 5:13 PM EDT up reply actions   0 recs

thanks for the article

I was completely unaware of it (obviously) and it’s not the first case where something that seems to fit the data empirically is later justified theoretically. I haven’t decided if it makes me like the formula any better.

I also realize that the formula is not biased against teams which score/give up fewer runs (this will teach me not to write fanposts at 1 AM anymore). So I guess the good bullpens argument is the most reasonable explanation for the Twins outperforming their Pythagorean expectation.

NOTE: the problem with the formula is that it assumes that RS and RA are the only two things that affect expected winning percentage. The fact that the formula systematically performs poorly in the presence of a good bullpen means that this is not the case. I guess a paper that incorporates bullpen quality into a formula could be someone’s dissertation topic.

http://noblingblings.blogspot.com/

by Aaron Fix on Jun 2, 2008 9:31 PM EDT up reply actions   0 recs

I'm such a dork

But this is a great thread. First, thanks Phoenix for providing the link. I did not know that it was based on a specific distribution, which I do find interesting. However, the problems that still exist are that a) there is no reason to believe that weibull is appropriate, unless someone has shown that it provides a good fit (maybe it does), and b) the squared term is arbitrary. It turns out that a better fit is ^1.82.

Newton is a perfect example, however, about the value and danger in creating these models. It is important to understand the difference between empirical evidence and understanding the process (cause/effect). If you like those stories that Newton got hit on the head with an apple, that is a great example of emprical evidence that gravity exists. A better example is Newtons basic laws of motion (f=m*a). This was very interesting, and created a new way to understand the process of motion. However, his ideas about how motion works are based on recognizing a pattern. When Einstein (et al) started researching higher velocity physics, they discovered that Newton’s formula no longer applied. It takes more force to effect the same accelaration, given a higher present velocity. This led to the discovery of relativity (still baffles me).

Bill James has a very good reason to propose the pythagorean formula. He noticed that it seems to be a good fit. He probably backed into the weibull distribution, or tested a few distributions before deciding that Weibull had a better fit. Why couldn’t it be any other common distribution, or another distribution that is too mathematically complicated for people to actually use?

The point is that it is very cool that this fits well. Further research will likely show a number of modifications that would improve the model (who wants to be Einstein to Bill James’ Newton?)

Aaron’s suggested dissertation is a great idea. There was an Actuary (don’t remember his name) who quit his job with an insurance company to start doing fantasy baseball projections. All the stats geeks out there can take a minute to imagine that as a full time profession. The dissertation gets tricky though, as there are really so many underlying issues you have to test. First is modeling the correlation of consecutive pitches, at-bats, innings etc. Once you do that, i suggest researching the relative importance of different factors like closers, starters/relief, number of pitches thrown/taken, slugging percentage, home runs, using generalized linear modeling. Anybody in a grad-school program that would fund them for this research? If so, I’ll apply there to help out.

by snolls on Jun 2, 2008 11:41 PM EDT reply actions   0 recs

relief pitching

It makes intuitive sense that good relief pitching would help you beat your Pythagorean projections, because some runs clearly have more influence on the outcome of a game than others. Giving up 5 runs when you’re already down five doesn’t hurt as much as giving up one when you’re tied in the ninth.

Maybe one way to tweak the Pythagorean formula would be to work in Win Probability Added numbers. So instead of just comparing runs scored to runs allowed, you could weight runs allowed according to situational WPA. That is, you could multiply runs allowed times some coefficient representing how high-leverage the situation was. So runs scored in a blow out would be multiplied by some number less than one, and runs allowed close and late would be multiplied by some number higher than one. You’d add together the resulting numbers, instead of just runs scored, before popping them into the formula.

Or you could just tweak the total runs scored by some factor representing bullpen quality before applying the formula. If you could figure out the influence of the bullpen somehow -that is, if you could document that teams with 5% above average bullpens, or closers, or whatever, typically outperform their pythagorean by 2%, say - then maybe you could tweak the pythagorean projection with some standardized bullpen correction.

I don’t know any of the math for this but it seems like bullpen quality and the deviation from the pythagorean projection are quantifiable and so could be correlated, at least hypothetically.

by by jiminy on Jun 3, 2008 11:08 AM EDT reply actions   0 recs

runs allowed, that is

sorry for alternating between “runs scored” and “runs allowed”—I should have just said “runs allowed” since I was talking about pitching stats

by by jiminy on Jun 3, 2008 11:10 AM EDT reply actions   0 recs

Comments For This Post Are Closed


User Tools

TT is an SB Nation blog of, by and for the fans. We strive to be the best Minnesota Twins blog by providing quality content and analysis, as well as daily news and notes on the team. We hope you'll make Twinkie Town your home for all things Twins!
Start posting about the Twins »

Join SB Nation and dive into communities focused on all your favorite teams.

FanPosts

Community blog posts and discussion.

Recent FanPosts

Small
Implications of Picking up Cuddyers Option and Trading for Hardy
Small
Organization Review (Center fielders)
Small
Minor League Report...November 7, 2009
Joemauerjesus_small
"I am going to create a lot of RBIs for Morneau, the catcher and the other guy."
Small
Kent Hrbek as a Rookie
Joel87bw5_small
And NOW THE OFFSEASON BEGINS!
Fitch372aurora_073_small
2010 Free Agents
Small
Organization Review (Corner Outfielders)
Small
Minor League Report...October 31, 2009
Small
Gomez vs. Young

+ New FanPost All FanPosts >

Twinkie Town On Twitter

SPONSORS


Editor-In-Chief

Twinkietown_small Jesse

Senior Writer

Hrbek_small Jon Marthaler

The_jet_small cmathewson

Gladdentwins_small Adam Peterson

Hosken_powell_autograph_small RandBall's Stu