<< Chapter < Page | Chapter >> Page > |
When you perform a hypothesis test, there are four possible outcomes depending on the actual truth (or falseness) of the null hypothesis H _{0} and the decision to reject or not. The outcomes are summarized in the following table:
STATISTICAL DECISION | H _{0} IS ACTUALLY... | |
---|---|---|
True | False | |
Cannot reject H _{0} | Correct Outcome | Type II error |
Cannot accept H _{0} | Type I Error | Correct Outcome |
The four possible outcomes in the table are:
Each of the errors occurs with a particular probability. The Greek letters α and β represent the probabilities.
α = probability of a Type I error = P (Type I error) = probability of rejecting the null hypothesis when the null hypothesis is true.
β = probability of a Type II error = P (Type II error) = probability of not rejecting the null hypothesis when the null hypothesis is false.
α and β should be as small as possible because they are probabilities of errors.
Statistics allows us to set the probability that we are making a Type I error. The probability of making a Type I error is α. Recall that the confidence intervals in the last unit were set by choosing a value called Z _{α} (or t _{α} ) and the alpha value determined the confidence level of the estimate because it was the probability of the interval capturing the true mean (or proportion parameter p). This alpha and that one are the same.
The easiest way to see the relationship between the alpha error and the level of confidence is with the following figure.
In the center of [link] is a normally distributed probability distribution marked Ho. This is a sampling distribution of $\stackrel{\u2013}{X}$ and by the Central Limit Theorem it is normally distributed. The distribution in the center is marked H _{0} and represents the distribution for the null hypotheses H _{0} : µ = 100. This is the value that is being tested. The formal statements of the null and alternative hypotheses are listed below the figure.
The distributions on either side of the H _{0} distribution represent distributions that would be true if H _{0} is false, under the alternative hypothesis listed as H _{a} . We do not know which is true, and will never know. There are, in fact, an infinite number of distributions from which the data could have been drawn if H _{a} is true, but only two of them are on [link] representing all of the others.
Notification Switch
Would you like to follow the 'Introductory statistics' conversation and receive update notifications?