<< Chapter < Page Chapter >> Page >

When conducting a hypothesis test that compares two independent population proportions, the following characteristics should be present:

  1. The two independent samples are simple random samples that are independent.
  2. The number of successes is at least five, and the number of failures is at least five, for each of the samples.
  3. Growing literature states that the population must be at least ten or 20 times the size of the sample. This keeps each population from being over-sampled and causing incorrect results.

Comparing two proportions, like comparing two means, is common. If two estimated proportions are different, it may be due to a difference in the populations or it may be due to chance in the sampling. A hypothesis test can help determine if a difference in the estimated proportions reflects a difference in the population proportions.

Like the case of differences in sample means, we construct a sampling distribution for differences in sample proportions: ( p A ' - p B ' ) where p A ' = X A n A and p B ' = X B n B are the sample proportions for the two sets of data in question. X A and X B are the number of successes in each group respectively, and n A and n B are the respective sample sizes from the two groups. Again we go the Central Limit theorem to find the distribution of this sampling distribution for the differences in sample proportions. And again we find that this sampling distribution, like the ones past, are normally distributed as proved by the Central Limit Theorem, as seen in [link] .

Generally, the null hypothesis allows for the test of a difference of a particular value, 𝛿 0 , just as we did for the case of differences in means.

H 0 : p 1 p 2 = 𝛿 0
H 1 : p 1 p 2 𝛿 0

Most common, however, is the test that the two proportions are the same. That is,

H 0 : p A = p B
H a : p A p B

To conduct the test, we use a pooled proportion, p c .

The pooled proportion is calculated as follows:

p c = x A + x B n A + n B

The test statistic ( z -score) is:

Z c = ( p A p B ) δ 0 p c ( 1 p c ) ( 1 n A + 1 n B )

where δ 0 is the hypothesized differences between the two proportions and p c is the pooled variance from formula above.

Two types of medication for poison ivy are being tested to determine if there is a difference in the proportions of adult patient reactions. Twenty out of a random sample of 200 adults given medication A still had poison ivy 30 minutes after taking the medication. Twelve out of another random sample of 200 adults given medication B still had itching 30 minutes after taking the medication. Test at a 10% level of significance.

The problem asks for a difference in proportions, making it a test of two proportions.

Let A and B be the subscripts for medication A and medication B, respectively. Then p A and p B are the population proportions.

Random variable:

P′ A P′ B = difference in the proportions of adult patients who did not react after 30 minutes to medication A and to medication B.

H 0 : p A = p B

H a : p A p B

The words "is a difference" tell you the test is two-tailed.

Distribution for the test: Since this is a test of two binomial population proportions, the distribution is normal:

p c = x A + x B n A + n B = 20 + 12 200 + 200 = 0.08 1 p c = 0.92

( p′ A p′ B ) = 0.04 follows an approximate normal distribution.

Estimated proportion for group A: p A = x A n A = 20 200 = 0.1

Estimated proportion for group B: p B = x B n B = 12 200 = 0.06

The estimated difference between the two groups is : p′ A p′ B = 0.1 – 0.06 = 0.04.

Normal distribution curve of the difference in the percentages of adult patients who don't react to medication A and B after 30 minutes. The mean is equal to zero, and the values -0.04, 0, and 0.04 are labeled on the horizontal axis. Two vertical lines extend from -0.04 and 0.04 to the curve. The region to the left of -0.04 and the region to the right of 0.04 are each shaded to represent 1/2(p-value) = 0.0702.
Z c = ( P′ A P′ B ) δ 0 P c ( 1 P c ) ( 1 n A + 1 n B ) = 0.54

The calculated test statistic is .54 and is not in the tail of the distribution.

Make a decision: Since the calculate test statistic is not in the tail of the distribution we cannot reject H 0 .

Conclusion: At a 1% level of significance, from the sample data, there is not sufficient evidence to conclude that there is a difference in the proportions of adult patients who did not react after 30 minutes to medication A and medication B .

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Introductory statistics. OpenStax CNX. Aug 09, 2016 Download for free at http://legacy.cnx.org/content/col11776/1.26
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introductory statistics' conversation and receive update notifications?

Ask