Project 11
Project description

One way to deal with outliers in a data set is to compare models fit both with and without the outliers. (It might even be better to do this with individual outliers, rather than the whole collection at once, but we will ignore such complications here.) After removing all outliers from the data set, conduct the same analysis as in Project 9. Then compare the two analyses. (This sort of comparison could then help you decide which model to use, depending on what you intended to do with the model.)

Background on the data set

The data consist of the April 1, 2007 populations and land areas of the 281 cities and towns of Washington State, obtained from the 2007 Washington State Data Book.

There are some "incorporated and unincorporated" areas in Washington state (with a total population of approximately 5.9 million in 2000) not included in this list of cities and towns. Also, the website notes that: "Land area by city was derived from a 1980 survey of cities with annexed territory added. Some of the city provided 1980 land areas are not accurate. Some land areas have been corrected. Others will be corrected."

Data Source: Washington State Data Book website, maintained by the Office of Financial Management for the State of Washington. (Accessed August 31, 2008.)

Variables in the data set
The variables in the data set are as follows:
NameUnitsDescription
name(municipality name)name of Washington municipality
populationpeoplenumber of residents
areasquare milesarea of municipality
Link to the data set
The full data set in csv format is at:
http://hoard.projectivespace.com/datasets/WAmunicipalities2007.csv