View Full Version : Baseball Statistics Glossary
Skipper Steve
11-11-2001, 09:55 PM
As requested in one of our threads, I've added a Stat Glossary to NetShrine. It can be found at:
http://www.netshrine.com/statglossary.html
Please check it out - - if you noticed anything that I got wrong (very possible! I did this quickly) or if you have something that you think should be added, post it in this thread. Thanks!
Skipper Steve
11-11-2001, 09:57 PM
BTW, yes, I know that not every formula is shown and some of the definitions are broad - - - file this effort under "something is better than nothing."
Skipper Steve
11-12-2001, 06:34 PM
No comments? :confuse2:
BuzzBuzzard
11-12-2001, 07:12 PM
I didn't get very far before being presented with a question. I had never really looked at AERA before. Based upon your example, does that imply that in 1999, Pedro was 2.4 times better than average? If so, what exactly does that tell you?
Nice work on the compilation of information. I am sure to have more questions as I work through it.:thumb:
Skipper Steve
11-12-2001, 10:40 PM
Originally posted by BuzzBuzzard
Based upon your example, does that imply that in 1999, Pedro was 2.4 times better than average? If so, what exactly does that tell you?
Nice work on the compilation of information. I am sure to have more questions as I work through it.:thumb:
Thanks Buzz - appreciate the look and the kind words.
Pedro? Hmmm - - let me put it this way, if his 1999 season was a "test" where a grade of 65 was passing, he got a 156. Not too shabby. ;)
timconnelly
11-25-2001, 01:54 AM
I thought I might give a very detailed explanation on how some of the sabermetric formulas work and more importantly, why they work.
Runs Created is a formula that Bill James worked out a number of years ago. There are a number of variations to it as it can be fine tuned to adjust for some of the extra statistical information that is not available from one year to another. But it is a far more reliable method of measuring offensive productivity than batting average, slugging average, on base average, or any other regular statistical method using even the simplest method of figuring it.
Before I give you the formula, the weakness of batting average is that it fails to take into account the amount of power that a hitter has, plus it ignores walks. Slugging average measures power but fails to address the batters ability to get on base. And on base average measures the ability to get on base by looking at walks but fails to look at power.
The purpose of the batter is to create runs for his team. So why won't RBI and runs scored adequately measure that? The answer is that both stats are team dependent. If you have a lot of runners on base, your chances for a high RBI total is greatly enhanced. With good batters batting behind you, your chances of scoring runs is greatly enhanced. So you are only measuring the productivity of the hitter a little bit .
But there is still another problem with RBI and runs scored: They fail to measure important offensive contributions that often lead to runs. The single that moves the runner from 1st to 3rd gets you nothing. The walk that loads the bases will only count if you happen to score. The problem with that is that these things have offensive value that simply isn't measured.
Runs created solves this whole problem as it looks at the 3 most important offensive things in just the right proportion: 1) Times reached base 2) The amount of power that the batter hit with 3) The number of opportunities that the batter had.
1) Take the number of times the batter got on base. So you are counting walks, HBP, and base hits.
2) Multiply that by the number of total bases. So the more power the batter had, the higher this total will be.
3) Take this number and divide it by plate appearances. This is at bats plus walks plus HBP.
A more complex Runs Created formula takes stolen bases, caught stealing, grounded into double plays, and other similar things, and factors it in.
This will give you an accurate estimation of the number of runs that this would create for a team! How do we know that it works? We can use the formula on a whole team and it will accurately estimate the number of runs that a team would score from that set of offensive factors. We are only breaking it down individually when we turn from a team to a player.
Ok, so now we have the number of runs the player created: Runs Created per game is a way of breaking that down to approximate how many runs that would add up to per 27 outs.
So you take the # of at bats and subtract base hits: That will give you the number of outs. (Again I am simplifying it, you can add grounded into double plays and caught stealing when available)
You then take the total # of outs and divide it by 27. That gives you the figure for exactly how many games the players outs added up to.
Divide the runs created by the total above and that will give you runs created per 27 outs.
There are 2 ways of looking at this: One is that it is what a team made up of players with exactly this offensive productivity would score if the entire line up was this player. But it is actually what that player is producing spread around through the entire season! Barry Bonds was actually creating more than 18 runs for his team for every 27 outs he was making!
But this still has the problem of measuring a raw total that fails to take into account the value those numbers have. Averaging 6 runs per game when the league average is 4 runs a game makes you incredibly valuable, average 6 runs a game when the league average is 6 runs makes you average.
A famous mathematician named Pythagoras came up with a famous formula. Amazingly, it works incredibly well with wins and losses relative to runs scored and runs allowed.
You take the runs created and square it and then divide this by (runs created squared plus the league average of runs squared)
Actually, Lee uses 1.83 rather than squared.
This will give you the offensive winning percentage. It is an accurate reflection of what that set of offensive contributions would mean to a team in terms of helping them to win.
That is pretty long and I'm not sure anybody will get through this. But maybe that'll help somebody a bit.
Yogi#8Fan
11-25-2001, 05:44 AM
I don't happen to see "WHIP" that you guys use all the time. I don't know all of Lee's terms that are used but are all of them in that html file?
For terms like "ASB", is that your own? The one about "BK" seemed like it was straight out of the MLB terms (see the "Lingo" thread in the "History" section).
Skipper Steve
11-25-2001, 08:50 AM
None of these are my own.
"WHIP" is more of a rotisserie term than a sabermetric. It's walks (W) plus hits (H) divided by innings pitched (IP). The closest thing in the glossary to it is:
Baserunners Per Nine Innings [BR/9]
The total number of batters reaching base against a pitcher divided by the number of innings pitched and multiplied by nine. It measures how many batters reach base on a per game basis against a pitcher.
A league best figure for this category is typically between 9 and 10.
Yogi#8Fan
11-25-2001, 01:06 PM
Presuming this is an appropriate place to discuss this (if not, please let me know), if a pitcher "scatters" 9 hits over 7 innings, gave up 5 walks but the opposing team never took advantage of this, would the pitcher who never got hurt by these hits but got the win (perhaps a 4-2 score) be seen as less capable due to a higher stat?
In both WHIP and BR/9, a pitcher's measured on how many people got on base (measured in one way or another). I guess this is like the WHIP vs ERA, but if the pitcher never let those runners hurt him, especially if no one ever got past 2B or even got that far, I'd say this pitcher controlled the situation.
Yogi#8Fan
11-25-2001, 01:45 PM
I'm trying to find out all of them and their definitions. Is this available here, or are they on Lee's site?
Skipper Steve
11-25-2001, 04:07 PM
Originally posted by Yogi#8Fan
seen as less capable due to a higher stat?
I would say that a pitcher with a high WHIP would be viewed as not having great stuff.
As far as Lee's stats, if it's not in the Stat Glossary here, let me know and I will get you the answer.
Yogi#8Fan
11-26-2001, 02:34 PM
I didn't see WHIP in your glossary. If you can let me know where it and all the other sabermetric terms and definitions are, please do so.
Skipper Steve
11-26-2001, 03:52 PM
Originally posted by Yogi#8Fan
I didn't see WHIP in your glossary. If you can let me know where it and all the other sabermetric terms and definitions are, please do so.
Re-read this entire thread.
Yogi#8Fan
11-27-2001, 12:51 AM
:read:. I simply asked where terms like "WHIP" are all found. You'd mentioned it being a rotisserie term. All I asked for was a link, not having to re-read the entire thread. Link, please, if possible.
Skipper Steve
11-27-2001, 09:26 AM
Not possible Yog.
Crash Course
08-18-2005, 09:14 PM
FYI, today, I just got this e-mail:
To Steve Lombardi:
Ron Shandler of BaseballHQ.com here. I am writing in regards to some
content appearing on your Web site, Netshrine.com, in the Baseball
Statistics Glossary (http://www.netshrine.com/statglossary.html). It
appears that there is material on this page that was copied directly from
my book, Baseball Forecaster, and/or my Web site, BaseballHQ.com.
Given that there is a copyright issue at stake here, my initial reaction
was to require that you take all of the material from my book down.
However, as you'll see below, I believe there's a better way to straighten
this out.
First, some background: I (actually, my company, Shandler Enterprises, LLC)
own the complete copyright to the content of the Baseball Forecaster and of
BaseballHQ.com. This includes the copyright to the actual text as well as
the gauges and terms I created, which I also own trademarks for. These
include such gauges as Base Performance Value (both batting and pitching
versions), Base Performance Index, Base Performance Indicators and Strand
Rate, each of which appears in your glossary.
I have to be careful in not allowing anyone to reproduce my copyright
property without my express permission, and careful to not allow anyone to
use my trademarks in a way that does not reflect that the related services
(the statistical gauges) originate with me. This is because, if I do not
defend my intellectual property (these copyrights and trademarks), it can
be deemed abandoned due to unauthorized use by others. If that happens, all
of my hard work and investment could be lost.
While, by virtue of my intellectual property rights, I could require you to
take all of my material down, I don't want to do so. Netshrine is an
important site, and your glossary serves an important purpose in the
sabermetric community.
So here is what I ask:
First, for the entries in your glossary that were taken from the Baseball
Forecaster/BaseballHQ.com but for gauges that I did NOT originally create
-- essentially those in which blocks of text were copied straight from my
products -- I ask you to reword your definitions so as to paraphrase the
language. These are items like batting eye and command ratio. I own the
copyright to the way I word things, so adequate paraphrasing of these
definitions should protect my copyright while allowing you to use the
definition you seek. I have not done the legwork to identify each of these
copied definitions, so I ask you to do so (seems fair as you'll be the one
benefiting from posting them).
Second, for each of the statistical gauges I listed above that are my
creations, will you please add an attribution that these gauges are the
work of Ron Shandler of BaseballHQ.com. You have attributed other gauges to
authors such as Bill James and Pete Palmer so I expect that this should not
be an issue with mine.
Provided you include this notation for each of my gauges, you do not need
to reword them or remove them.
To be clear, the permission I grant above is only for your glossary and
only for use on NetShrine.com. I do not grant permission to use the listed
gauges in providing baseball statistics or analysis or in specific fantasy
baseball applications. Also, my permission is contingent upon NetShrine.com
not becoming a provider of baseball statistics or individual player
analysis, and upon it not becoming an operator within the fantasy baseball
industry.
I look forward to your response, and take care.
Ron
Crash Course
08-18-2005, 09:18 PM
To which, I replied:
Ron - thanks for your note. And, sorry for any inconveniences which this issue may have caused for you. Trust me, none of it was intentional. Also, thanks for your offer to try and square this away in a manner that could benefit both of us.
I do have a problem on my side - the Stats Glossary on NetShrine.com was created in early November 2001. And, for the most part, as far as I can recall, it hasn't been touched since then - maybe sans adding Win Shares or something.
When I put it together, in what is now almost now four years ago, I tried to use whatever public sources were available to help populate it. But, that was a long time ago, and, if you were to put a gun to my head now and say "Tell me where you found what or you're dead," then I would have to say "Go ahead and pull the trigger" because I could never go back now and say "X" was from "Y" etc.
Therefore, for me to do what you have requested, it would be impossible for me to do that from memory - and would take hours and hours to try and google back on terms to see which may have come from a public website somewhere, etc.
So in the end, if I have violated something somewhere in terms of a copyright, etc., the only solution that I have is to take it down. I should have that done within a week's time - I would like to have a few days to work up something for the link to point to, now that I have to take it down.
Can you grant me a week?
Please let me know - thanks, Steve Lombardi
Crash Course
08-18-2005, 09:19 PM
I share this just to provide you all with a heads-up that the glossary will be going away in a week or so.
If you know of any good glossaries online, that I can point to, in place of the NetShrine one, please let me know. Thanks, Steve
Deep Blue
08-19-2005, 12:55 PM
The whole thing is a bummer - sorry to hear about that.
One portion of my job here has become dealing with copyrights and patents with our software development efforts - definitely not fun.
Crash Course
08-19-2005, 01:22 PM
Thanks it's no sweat.
When I did the glossary four years ago, there were not many around - that's why I did it - because many had asked for it. But, these days, there are probably a half-dozen other on-line stat glossaries, if not more. No one will even miss the one on NetShrine.com - this is not a big issue in my eyes.
mainsr
08-20-2005, 07:21 AM
FWIW, I think Shandler handled himself pretty well here. He led with a potential solution, not lawyers. I don't think what he suggested was that onerous--I'll take your word that it would've been a pain to line up your definitions against his and re-work the ones that were the same, but it seemed like a reasonable solution.
Full disclosure - I am a subscriber to Shandler's BaseballHQ.com website. I am also a strong supporter of intellectual property rights and a strong opponent of lawyers.
Crash Course
08-20-2005, 04:52 PM
I have no beef with Ron.
And, don't let mainsr fool you - I know he has good friends that are lawyers.
mainsr
08-20-2005, 09:13 PM
And, don't let mainsr fool you - I know he has good friends that are lawyers.
Touche, dude. Friends and the love of my life as well (although she isn't a member of the NY Bar). But that doesn't man I don't dislike them generically.
Crash Course
08-20-2005, 10:27 PM
Wait a minute, Mrs. mainsr is a lawyer and a Red Sox fan? You sick dude.
Crash Course
11-07-2005, 11:01 AM
Wow - I was just reading Bronx Banter at baseballtoaster.com and they had a link for RSAA and when I clicked it, it had the old glossary
http://penasonline.com/Mymvpsoftball/Copy%20of%20Statistic%20Glossary.htm
looks like this guy did a file, save as, on the old one and put it up on his site.
stealing on the internet, go figure!
mainsr
11-07-2005, 01:08 PM
Imitation is the sincerest form of plagiarism.
Crash Course
11-07-2005, 01:23 PM
:loud:
vBulletin v3.5.4, Copyright ©2000-2013, Jelsoft Enterprises Ltd.