Are there clutch hitters in baseball? The file gives performance data for all major
league hitters for the 1999 season (the most recent for which I can obtain the
information). The data file contains each player's overall record (atbats and hits) as
well as performance in clutch situations (defined two ways: "runners in scoring
position" and "late and close").
Annual records (since 1901) for every major league baseball team, giving data for home
runs, runs scored, team batting average, and team winning percentage. Do these processes
appear to be "in control"?
Data from Statistics for Experimenters, by Box, Hunter, & Hunter. Results from
an industrial experiment. Used to illustrate several approaches to analyzing data,
in chapters 2 and 3 of that book.
Data on percent bodyfat measurements for a sample of 252 men, along with various
measurements of body size. Percent body fat is normally measured by weighing the
person underwater - a cumbersome procedure. The goal is a regression model that will
allow accurate estimation of percent body fat, given easily obtainable body measurements.
The data were taken from the Journal of Statistics Education website.
Data from a (not particularly "perfect") in-class survey, intended to
illustrate some basic principles in survey coding, data cleaning, and analysis.
Can stock market professionals really do any better than simply throwing darts at pages of stock
market listings? The Wall Street Journal ran a monthly contest on this for several years.
These data are the results of their contests.
Death rates (by state) from various causes, as well as information on related factors
(such as age distribution and urbanization). From the Statistical Abstract of
the United States.
Ten years (2000-2009) of daily closing prices for the Dow Jones Industrial
Average. The data were obtained from the Yahoo! Finance historical stock
Twenty years (1991-2011) of monthly closing prices for the Dow Jones Industrial Average and
the Standard and Poor's 500 stock market indexes. The data were obtained from Yahoo! Finance's
historical stock quotations page.
Winning numbers from the Florida Lottery (through the end of 2008), as reported by the
state lottery commission. Note that from its inception through 23 October 1999, the lottery
was "pick 6 of 49." Thereafter, it was a "pick 6 of 53" lottery.
Winning time of the Kentucky Derby since its inception in 1875. The file gives the year, name
of the winning horse, and winning time in minutes and seconds.
Data sets used in the paper "Explaining Success in Baseball: The Local Correlation
Approach," by Hamrick and Rasp, under review by the Journal of Quantitative Analysis
Results from the NCAA Division I Regional Baseball Tournament, for the years 2003 through 2008.
Data from the 1998 through 2000 seasons of the National Football League, giving year and week of
the game, home and visiting teams, score, and pointspread. The same data for the 2001 through
2008 seasons, minus the pointspread data.
A small subset of data from the National Longitudinal Youth Survey.
Data on body temperature, gender, and heart rate. Taken from the article "A Critical Appraisal
of 98.6 Degrees F ...," by Mackowiak et al., in the Journal of the American
Medical Association (vol. 268, pp. 1578-80, 1992).
Air quality measurements on 41 U.S. cities. Data obtained from A Handbook of Small Data Sets,
edited by D.J. Hand, et al.
Data for 97 countries, on birth and death rates, infant mortality rates, life expectancies, and
per capita GDP. Obtained from the Journal of Statistics Education online data archive.
Original source was 1990 United Nations data.
Results for Post-World War II presidential elections, giving the total number of votes cast for
all candidates as well as the number received by the two major political parties.
Data for a mock "case." The Sirius Cybernetics Corporation is bidding on a contract to
supply parts to a defense contractor.
Data from the first (and last) voyage of the Titanic.
Population of the United States, in each decennial census.