In Unit One we began to see the world as data; in Unit Two we began to ask questions of data in order to find out the story it has to tell about itself, and hence about the world from which it was extracted. In Unit Three we began to make connections between sets of data, to see how the events in the situations from which the data were extracted might be related to each other. We began to analyze the relationships between sets of data by capturing those relationships in regression models, simple linear ones at first involving a dependent variable and a single explanatory variable, and then more complex linear ones with a dependent variable and several explanatory variables. This unit investigates one of the four assumptions that underlie regression modeling and at the same time seeking to develop the relationships between even more complex sets of data.
One of the main assumptions about data when you construct a regression model is that the data is sampled from a linear relationship of some sort (either two-variable or more than two-variables). If this is not true, then your resulting regression model may seem to be okay, but it will exhibit problems of one of the following types:
In chapter 11 we begin dealing with data that is not proportional, that is, data that violates our first regression assumption that a linear model is an appropriate fit. We will start by focusing on two-variable data and then learn how to extend this to multivariable data. Even though most real data sets are multi-dimensional, there are solid reasons for beginning our study with two-variable nonlinear data sets:
In much the same way that straight lines have parameters that can be chosen so as to match the line closely to the data, the basic nonlinear models we will introduce have parameters that can serve the same purpose. By using these parameters to shift one of the basic models horizontally and vertically and to stretch them and flip it, we can fit this basic function to a non-proportional data set.
However, the regression routines in most software are only useful for producing linear models. In fact, we overcome this problem by transforming nonlinear data so that it becomes suitably linear and then applying our regression model to this straightened out data. Thus, chapter 12 presents the key transformations that will convert many kinds of nonlinear data into linear data. This chapter also teaches us how to evaluate the quality of models built from transformed data and then how to interpret these models. The unit closes with chapter 13 on interpreting the relationships in nonlinear models with more than one variable. We also discuss how to locate the maxima and minima of such functions.