Project 32
Project description

In this project, use the model that you set up in Project 9, which you used again in Project 19 and Project 25. Try to find a transformation that will allow for a simple linear regression model to use city or town area to predict population that is more suitable than the untransformed model of population modeled on area. (Of course, look for the most suitable model that you can find.)

Background on the data set

The data consist of the April 1, 2007 populations and land areas of the 281 cities and towns of Washington State, obtained from the 2007 Washington State Data Book.

There are some "incorporated and unincorporated" areas in Washington state (with a total population of approximately 5.9 million in 2000) not included in this list of cities and towns. Also, the website notes that: "Land area by city was derived from a 1980 survey of cities with annexed territory added. Some of the city provided 1980 land areas are not accurate. Some land areas have been corrected. Others will be corrected."

Data Source: Washington State Data Book website, maintained by the Office of Financial Management for the State of Washington. (Accessed August 31, 2008.)

Variables in the data set
The variables in the data set are as follows:
NameUnitsDescription
name(municipality name)name of Washington municipality
populationpeoplenumber of residents
areasquare milesarea of municipality
Link to the data set
The full data set in csv format is at:
http://hoard.projectivespace.com/datasets/WAmunicipalities2007.csv