Application and Reasoning Problems
9.3. Ms. Carrie Allover needs more information about the model we developed to predict the
number of weekly riders on her commuter rail system. The model equation is in example 1. Recall
that it predicts the number of weekly riders based on population, price per ride, parking rates, and
disposable income. Ms. Allover wants more explanation of what the equation means. She has asked
some very specific questions about the situation.
- Based on the model equation, which of the following will have the largest impact on
the number of weekly riders: an increase of 10,000 people in the region, a ten cent drop
in the price per ticket, a ten cent raise in parking rates, or a $100 decrease in average
disposable income? Explain your answer.
- Demographics experts suggest that the population will drop by 10% next year. The
model predicts that this will change the number of weekly riders. Ms. Allover wants
to ensure that the revenue (=price per ticket * number of tickets sold) remains about
the same for next year as it is for this year. In order to accomplish this, the price per
ticket will have to change. Should the ticket price be raised or lowered? By how much?
Use the regression model and your software to help answer this.
9.4. The data file C09 Homes.xls [.rda] contains data on 271 homes sold in a three-month period
in 2001 in the greater Rochester, NY area. A realtor has enlisted your help to develop a regression
model in order to explain which characteristics of a home influence its price. You are going to build
the regression model by adding one variable at a time and removing variables that do not seem to
be significant. At each stage of the model building process, record the equation of the model, the
R2, the adjusted R2 and the standard error of estimate. You should record all of this
information in a table like the one below in order to make it easier to compare the
results.
- Introduce a new variable for the age of the home. To do this, add a new column heading
”Age” in cell M3. In cell M4, enter the formula ”=2003 - H4” in order to calculate the
age of the home based on the year in which it was built (H4). Copy this formula to all
the cells in the column.
- Develop a series of models to predict the price of the home by adding one variable at a
time. Add them in this order: Size, Baths, Age, Acres, Rooms, and Taxes. Make sure
that each model includes all of the previous variables. (The second model will include
size and baths as explanatory variables; the third will include size, baths, and age.)
Record the model equation and the summary measures indicated in the table below.
- What do you expect to happen to each of the summary measures as you add more
variables into the model? What actually happens each time? What do the differences
tell you about some of the variables?
- Based on your observations of the summary measures eliminate the variable or variables
that you feel are not helpful in predicting the price of a home. Using the remaining
variables, develop your ”best regression model” and compare it to the others you have
developed.
Sample Table for Recording the Housing Models in Problem 2
|
|
|
|
|
Variable Added | Model equation | R2 | Adj. R2 | S
e |
|
|
|
|
|
Size | | | | |
|
|
|
|
|
Baths | | | | |
|
|
|
|
|
Age | | | | |
|
|
|
|
|
Acres | | | | |
|
|
|
|
|
Rooms | | | | |
|
|
|
|
|
Taxes | | | | |
|
|
|
|
|
Best Model | | | | |
|
|
|
|
|
|