Part IV
Analyzing Data with Nonlinear Models

In Unit One we began to see the world as data; in Unit Two we began to ask questions of data in order to find out the story it has to tell about itself, and hence about the world from which it was extracted. In Unit Three we began to make connections between sets of data, to see how the events in the situations from which the data were extracted might be related to each other. We began to analyze the relationships between sets of data by capturing those relationships in regression models, simple linear ones at first involving a dependent variable and a single explanatory variable, and then more complex linear ones with a dependent variable and several explanatory variables. This unit investigates one of the four assumptions that underlie regression modeling and at the same time seeking to develop the relationships between even more complex sets of data.

One of the main assumptions about data when you construct a regression model is that the data is sampled from a linear relationship of some sort (either two-variable or more than two-variables). If this is not true, then your resulting regression model may seem to be okay, but it will exhibit problems of one of the following types:

  1. The model may be accurate for only a small slice of data. If we apply the model to data points outside this small slice, the resulting errors from the model may become larger and larger. This is related to having too small a sample of the data to notice that it really does not exhibit linearity.
  2. The regression model consistently underestimates the data in certain regions and consistently overestimates it in other regions. This resulting pattern indicates that there is a better model for the data than a linear model.

In chapter 11 we begin dealing with data that is not proportional, that is, data that violates our first regression assumption that a linear model is an appropriate fit. We will start by focusing on two-variable data and then learn how to extend this to multivariable data. Even though most real data sets are multi-dimensional, there are solid reasons for beginning our study with two-variable nonlinear data sets:

In much the same way that straight lines have parameters that can be chosen so as to match the line closely to the data, the basic nonlinear models we will introduce have parameters that can serve the same purpose. By using these parameters to shift one of the basic models horizontally and vertically and to stretch them and flip it, we can fit this basic function to a non-proportional data set.

However, the regression routines in most software are only useful for producing linear models. In fact, we overcome this problem by transforming nonlinear data so that it becomes suitably linear and then applying our regression model to this straightened out data. Thus, chapter 12 presents the key transformations that will convert many kinds of nonlinear data into linear data. This chapter also teaches us how to evaluate the quality of models built from transformed data and then how to interpret these models. The unit closes with chapter 13 on interpreting the relationships in nonlinear models with more than one variable. We also discuss how to locate the maxima and minima of such functions.

11 Nonlinear Models Through Graphs
 11.1 What if the Data is Not Proportional
 11.2 Transformations of Graphs
 11.3 Homework
 11.4 Memo Problem: DataCon Contract
12 Modeling with Nonlinear Data
 12.1 Non-proportional Regression Models
 12.2 Interpreting a Non-proportional Model
 12.3 Homework
 12.4 Memo Problem: Insurance Costs
13 Multivariate Nonlinear Models
 13.1 Models with Numerical Interaction Terms
 13.2 Interpreting Quadratic Models in Several Variables
 13.3 Homework
 13.4 Memo Problem: Revenue Projections