Data set: introductory statistics class surveys (2002-2008)
Link to the data set
The full data set in csv format is at:
Background on the data set

Each semester in all of the Introductory Statistics sections at the University of Puget Sound, a survey is given to the students during the first week of class. The survey is voluntary and is used as an example data set throughout the class.

The data set given here is a compilation of much of the data collected in the period from the Fall 2002 semester through the Spring 2008 semester. Values that have been determined to be incorrect (such as 8-foot tall students) have been removed from this data set.

Variables in the data set
The variables in the data set are as follows:
semester1=Fall 2002, 2=Spring 2003, 3=Fall 2004, ... , 12=Spring 2008 course semester
genderF=female, M=malestudent gender
collegeYear1=first-year student (freshman), 2=second-year student (sophomore), etc.year number in college
heightinchesheight of student
weightpoundsstudent weight
pulsebeats per minutestudent pulse at time of survey
hsGPAtraditional 4-point grade scale pointsstudent grade point average in high school
collegeGPAtraditional 4-point grade scale pointsstudent grade point average to date in college
SATMSAT Mathematics pointsstudent SAT Mathematics score (200-800 range)
SATVSAT Verbal pointsstudent SAT Verbal score (200-800 range)
shoeSizeUS shoe size unitsstudent shoe size
financialAidN=no, Y=yesis the student on financial aid for college?
tvHourshours per weekaverage number of hours per week spent watching television during the school year
statesstatesnumber of US states the student has been to
siblingssiblingsnumber of siblings the student has
motherAgeyearsage of the student's mother
fatherAgeyearsage of the student's father
salaryUS dollarsannual salary that the student realistically expects to earn upon graduation
Projects using this data set
29assessing the suitability of predicting year in college with GPA
41comparing female and male heights among introductory statistics students
42comparing heights of introductory statistics students by year
71comparing the proportion of female students in 2002 and 2008
72confidence interval for high GPA among introductory statistics students
70confidence interval for the proportion of students on financial aid
64graphical summaries of the ages of students' mothers
53investigating relationships between college year and height
43predicting college GPA
47regression diagnostics for a model to predict college GPA