Handbook of Regression Analysis With Applications in R. Samprit Chatterjee

Handbook of Regression Analysis With Applications in R - Samprit  Chatterjee


Скачать книгу
of images and images. This subset model is

equation

      with images. This equality condition is called a linear restriction, because it defines a linear condition on the parameters of the regression model (that is, it only involves additions, subtractions, and equalities of coefficients and constants).

      The question about whether the total SAT score is sufficient to predict grade point average can be stated using a hypothesis test about this linear restriction. As always, the null hypothesis gets the benefit of the doubt; in this case, that is the simpler restricted (subset) model that the sum of images and images is adequate, since it says that only one predictor is needed, rather than two. The alternative hypothesis is the unrestricted full model (with no conditions on images). That is,

equation

      versus

equation

      These hypotheses are tested using a partial images‐test. The images‐statistic has the form

      An alternative form for the images‐test above might make clearer what is going on here:

equation

      That is, if the strength of the fit of the full model (measured by images) isn't much larger than that of the subset model, the images‐statistic is small, and we do not reject the subset model; if, on the other hand, the difference in images values is large (implying that the fit of the full model is noticeably stronger), we do reject the subset model in favor of the full model.

      The images‐statistic to test the overall significance of the regression is a special case of this construction (with restriction images), as is each of the individual images‐statistics that test the significance of any variable (with restriction images). In the latter case images.

      

      2.2.2 COLLINEARITY

equation

      in the top plot to

equation

      in the bottom plot; a small change in only one data point causes a major change in the estimated regression function.