100 Interesting Data Sets for Statistics

Sat, 06/07/2014 - 08:05 -- rprice

archive of 216,930 past Jeopardy questions last words of every Texas inmate executed since 1984 data on prisoners, including information about “their current offense and sentence, criminal history, family background and personal characteristics, prior drug and alcohol use and treatment programs, gun possession and use, and prison activities, programs, and services” the Enron corpus. It contains more than half a million emails from about 150 users, mostly senior management of Enron, organized into folders. Wikipedia calls it “unique in that it is one of the only publicly available mass collections of ‘real’ emails easily available for study.” the top 2.5 million Reddit posts and then placed them on GitHub. 10,000 annotated images of cats.