Project 9
Project description

Use simple linear regression to develop a method of using the area of a city or town in the state of Washington to predict its population. Assess the reliability in practice of the method you find. Also, in this model, what does the size of the residual for a city or town indicate about the population density of that city or town?

Background on the data set

The data consist of the April 1, 2007 populations and land areas of the 281 cities and towns of Washington State, obtained from the 2007 Washington State Data Book.

There are some "incorporated and unincorporated" areas in Washington state (with a total population of approximately 5.9 million in 2000) not included in this list of cities and towns. Also, the website notes that: "Land area by city was derived from a 1980 survey of cities with annexed territory added. Some of the city provided 1980 land areas are not accurate. Some land areas have been corrected. Others will be corrected."

Data Source: Washington State Data Book website, maintained by the Office of Financial Management for the State of Washington. (Accessed August 31, 2008.)

Variables in the data set
The variables in the data set are as follows:
NameUnitsDescription
name(municipality name)name of Washington municipality
populationpeoplenumber of residents
areasquare milesarea of municipality
Link to the data set
The full data set in csv format is at:
http://hoard.projectivespace.com/datasets/WAmunicipalities2007.csv