Handbook of Regression Analysis With Applications in R. Samprit Chatterjee

Handbook of Regression Analysis With Applications in R - Samprit  Chatterjee


Скачать книгу
alt="images"/>‐statistics, it does not have a large effect on overall measures of fit like the overall images‐test or images, since adding unneeded variables (whether or not they are collinear with predictors already in the model) cannot increase the residual sum of squares (it can only decrease it or leave it roughly the same).

Image described by caption.

      Another problem with collinearity comes from attempting to use a fitted regression model for prediction. As was noted in Chapter 1, simple models tend to forecast better than more complex ones, since they make fewer assumptions about what the future will look like. If a model exhibiting collinearity is used for future prediction, the implicit assumption is that the relationships among the predicting variables, as well as their relationship with the target variable, remain the same in the future. This is less likely to be true if the predicting variables are collinear.

      How can collinearity be diagnosed? The two‐predictor model

equation

      provides some guidance. It can be shown that in this case

equation

      and

equation
images Variance inflation
images images
images images
images images
images images
images images
images images
images images
images images
images images
images images

      A diagnostic to determine this in general is the variance inflation factor (images) for each predicting variable, which is defined as

equation

      where images is the images of the regression of the variable images on the other predicting variables. images gives the proportional increase in the variance of images compared to what it would have been if the predicting variables had been uncorrelated. There are no formal cutoffs as to what constitutes a large images, but collinearity is generally not a problem if the observed images satisfies

Скачать книгу