Скачать книгу
of the confidence interval from the t‐version of the central limit theorem, where has an approximate t‐distribution with degrees of freedom. In particular, suppose that we want to calculate a 95% confidence interval for the population mean, , for the home prices example—in other words, an interval such that there will be an area of 0.95 between the two endpoints of the interval (and an area of 0.025 to the left of the interval in the lower tail, and an area of 0.025 to the right of the interval in the upper tail). Let us consider just one side of the interval first. Since 2.045 is the 97.5th percentile of the t‐distribution with 29 degrees of freedom (see the t‐table in Section 1.4.2), then
The difference from earlier calculations is that this time is the focus of inference, so we have not assumed that we know its value. One consequence for the probability calculation is that in the fourth line we have “.” To change this to “” in the fifth line, we multiply each side of the inequality sign by “” (this also has the effect of changing the direction of the inequality sign).
This probability statement must be true for all potential values of and . In particular, it must be true for our observed sample statistics, and . Thus, to find the values of that satisfy the probability statement, we plug in our sample statistics to find
This shows that a population mean greater than would satisfy the expression . In other words, we have found that the lower bound of our confidence interval is , or approximately . The value 20.1115 in this calculation is the margin of error.
To find the upper bound, we perform a similar calculation:
To find the values of that satisfy this expression, we plug in our sample statistics to find
This shows that a population mean less than would satisfy the expression . In other words, we have found that the upper bound of our confidence interval is , or approximately . Again, the value 20.1115 in this calculation is the margin of error.
We can write these two calculations a little more concisely as
As before, we plug in our sample statistics to find the values of that satisfy this expression:
This shows that a population mean between and would satisfy the expression . In other words, we have found that a 95% confidence interval for for this example is (, ), or approximately (, ). It is traditional to write confidence intervals with the lower number on the left.
More generally, using symbols, a 95% confidence interval for a univariate population mean, , results from the following:
where the 97.5th percentile comes from the t‐distribution with degrees of freedom. In other words, plugging in our observed sample statistics, and , we can write the 95% confidence interval as . In this expression, is the margin of error.
For a lower or higher level of confidence than 95%, the percentile used in the calculation must be changed as appropriate. For example, for a 90% interval (i.e., with 5% in each tail), the 95th percentile would be needed, whereas for a 99% interval (i.e., with 0.5% in each tail), the 99.5th percentile would be needed. These percentiles can be obtained from the table “Univariate Data” in Notation and Formulas (which is an expanded version of the table in Section 1.4.2). Instructions for using the table can be found in Notation and Formulas.
Thus, in general, we can write a confidence interval for a univariate mean, , as