<< Chapter < Page Chapter >> Page >
Calculations for determining the required sample sized when calculation a confidence interval for the population mean or population proportion.

Determining sample size required to estimate μ

Prior to creating a confidence interval a sample must be taken. Often the number of data values needed in a sample to obtain a particular level of confidence within a given error needs to be determined prior to taking the sample. If the sample is too small the result may not be useful and if the sample is too big both time and money are wasted in the sampling.

Let’s begin by looking at the equation for the Error Bound.

EBM = Z α 2 σ n size 12{ ital "EBM"=Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } } {}

To affect the size of the error, what can be changed in the formula?

Only Z α 2 size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } {} , by changing the level of confidence, or n by changing the sample size affects the error. The standard deviation is a given which one can not change. (Note if n>30 then the sample standard deviation can be used to approximate the population standard deviation.)

Given the following data: n = 64, x ¯ size 12{ {overline {x}} } {} = 36, and σ = 3, the EBM for a 80%, 90%, 95% and 99% confidence interval are:

80%: EBM = Z α 2 σ n size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } } {} = 1.28 3 64 size 12{ { {3} over { sqrt {"64"} } } } {} = 0.48

90%: EBM = Z α 2 σ n size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } } {} = 1.645 3 64 size 12{ { {3} over { sqrt {"64"} } } } {} = 0.616875

95%: EBM = Z α 2 σ n size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } } {} = 1.96 3 64 size 12{ { {3} over { sqrt {"64"} } } } {} = 0.735

99%: EBM = Z α 2 σ n size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } } {} = 2.58 3 64 size 12{ { {3} over { sqrt {"64"} } } } {} = 0.9675

Note that as the confidence increases, so also does the EBM. To ensure that the error bound is small, the confidence must be decreased. Hence changing the confidence to lower the error is not a practical solution.

What happens as the sample size is increased? Calculate the EBM for a 90% confidence interval for n = 25, 64, and 100.

n = 25: EBM = Z α 2 σ n size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } } {} = 1.645 3 25 size 12{ { {3} over { sqrt {"25"} } } } {} = 0987

n = 64: EBM = Z α 2 σ n size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } } {} = 1.645 3 64 size 12{ { {3} over { sqrt {"64"} } } } {} =0.616875

n = 100: EBM = Z α 2 σ n size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } } {} = 1.645 3 100 size 12{ { {3} over { sqrt {"100"} } } } {} = 0.4935

As the sample size increases, the EBM decreases. The question now becomes how large a sample is needed for a particular error?

Begin by solving the equation for the EBM in terms of n.

EBM = Z α 2 σ n n = Z α 2 σ EBM 2 size 12{ ital "EBM"=Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } { {σ} over { sqrt {n} } } drarrow n= left ( { {Z rSub { { size 6{α} } wideslash { size 6{2} } } size 12{σ}} over { ital "EBM"} } right ) rSup {2} } {}
  • Where Z size 12{Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } {} = the critical z score based on the desired confidence level
  • EBM = desired margin of error
  • σ = population standard deviation

Often the population standard deviation is unknown; hence the sample standard deviation from a previous sample of size greater than 30 may be used as an approximation to σ.

The value found by using the formula for sample size is generally not a whole number. However the sample size must be a whole number, so ALWAYS ROUND UP to the next larger whole number.

Suppose for the given information, we want to be 90% confident with an error of only ±0.5, how large should n be?

Now substitute the given values:

n = Z α 2 σ EBM 2 = 1 . 645 ( 3 ) 0 . 5 2 = 9 . 87 2 = 97 . 4169 size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } σ} over { ital "EBM"} } right ) rSup {2} size 12{ {}= left ( { {1 "." "645" \( 3 \) } over {0 "." 5} } right ) rSup {2} } size 12{ {}=9 "." "87" rSup {2} } size 12{ {}="97" "." "4169"}} {}

Since you can not sample less than a whole this value n = 98.

How large a sample would you need if you wanted to be 95% confident with an error of ± 0.25?

n = Z α 2 σ EBM 2 = 1.96 ( 3 ) 0.25 2 = 23.52 2 554.19 n = 554 size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } σ} over { ital "EBM"} } right ) rSup {2} size 12{ {}= left ( { {1 "." "96" \( 3 \) } over {0 "." "25"} } right ) rSup {2} } size 12{ {}="23" "." "52" rSup {2} } size 12{ approx "553" "." "19" drarrow n="554"}} {}

Suppose the scores on a statistics final are normally distributed with a standard deviation of 10 points. You have been asked to construct a 95% confidence interval with an error of no more than 2 points.

Z 0 . 025 = 1 . 645 size 12{Z rSub { size 8{0 "." "025"} } =1 "." "645"} {}

EMB = 2

σ = 10

n = Z α 2 σ EBM 2 = 1 . 645 ( 10 ) 2 2 = ( 8 . 225 ) 2 = 67 . 65 rounded up =>n = 68 size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } σ} over { ital "EBM"} } right ) rSup {2} size 12{ {}= left ( { {1 "." "645" \( "10" \) } over {2} } right ) rSup {2} } size 12{ {}"= " \( 8 "." "225" \) "2 = 67" "." "65 rounded up =>n = 68"}} {} .

Hence, a sample of size 68 must be taken to create a 95% confidence interval with an error of no more than two points.

Suppose you want to be 98% confident with an error of no more than 1.5 points, how large must your sample be?

Z 0 . 01 = 2 . 33 size 12{Z rSub { size 8{0 "." "01"} } =2 "." "33"} {}

EMB = 1.5

σ = 10

n = Z α 2 σ EBM 2 = 2 . 33 ( 10 ) 1 . 5 2 = ( 15 . 533 ) 2 = 241 . 28 rounded up => n = 242 . size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } σ} over { ital "EBM"} } right ) rSup {2} size 12{ {}= left ( { {2 "." "33" \( "10" \) } over {1 "." 5} } right ) rSup {2} } size 12{ {}= \( "15" "." "533" \) "2 "=" 241" "." "28 rounded up ""=>"" n "=" 242" "." }} {}

Hence, a sample of size 242 must be taken to create a 98% confidence interval with an error of no more than 1.5 points.

Determining sample size required to estimate p.

Again, let’s look at the equation for the Error Bound.

{} EBP = Z α 2 p ' q ' n size 12{ ital "EBP"=Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } sqrt { { {p'q'} over {n} } } } {}

  • where p’ = x n size 12{ { {x} over {n} } } {} is the point estimate for the true population proportion
  • x = the number of successes
  • n = the sample size
  • q’ = 1 – p’

Solving for n, we obtain: n = Z α 2 EBP 2 p ' q ' size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } over { ital "EBP"} } right ) rSup {2} size 12{p'q'}} {}

Suppose that a previous study claimed that only 25% of people recycle on a regular basis. Determine the sample size needed to create a 95% confidence interval for the true population proportion with an error of only +/- 3%.

p’ = 0.25 =>q’ = 1 – p’ = 0.75

n = Z α 2 EBP 2 p ' q ' = 1 . 96 0 . 03 2 ( 0 . 25 ) ( 0 . 75 ) = 800 . 33 . size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } over { ital "EBP"} } right ) rSup {2} size 12{p'q'= left ( { {1 "." "96"} over {0 "." "03"} } right ) rSup {2} } size 12{ \( 0 "." "25" \) \( 0 "." "75" \) ="800" "." "33" "." }} {}

AWAYS ROUND UP, hence n = 801.

If there is no previous sample, let p’ = 0.5 and q’ = 0.5 as this gives the largest value for n.

What would n be for a 95% confidence interval with an error of only +/-3% with no sample data?

n = Z α 2 EBP 2 p ' q ' = 1 . 96 0 . 03 2 ( 0 . 5 ) ( 0 . 5 ) 1067 . 1 size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } over { ital "EBP"} } right ) rSup {2} size 12{p'q'= left ( { {1 "." "96"} over {0 "." "03"} } right ) rSup {2} } size 12{ \( 0 "." 5 \) \( 0 "." 5 \) approx "1067" "." 1}} {}

AWAYS ROUND UP, hence n = 1068

Note that not having a previous sample greatly increases the number of data values needed in a sample. Often a pilot study is done to generate an approximation for p.

The Mesa College mathematics department has noticed that a number of students place in a non-transfer level course and only need a 6 week refresher rather than an entire semester long course. If it is thought that about 10% of the students fall in this category, how many must the department survey is they wish to be 95% certain that the true population proportion is within +/-5%?

n = Z α 2 EBP 2 p ' q ' = 1 . 96 0 . 05 2 ( 0 . 1 ) ( 0 . 9 ) 138 . 3 size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } over { ital "EBP"} } right ) rSup {2} size 12{p'q'= left ( { {1 "." "96"} over {0 "." "05"} } right ) rSup {2} } size 12{ \( 0 "." 1 \) \( 0 "." 9 \) approx "138" "." 3}} {}

AWAYS ROUND UP, hence n = 139

Suppose the math department has no previous information. How many students should be surveyed?

n = Z α 2 EBP 2 p ' q ' = 1 . 96 0 . 05 2 ( 0 . 5 ) ( 0 . 5 ) 384 . 2 size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } over { ital "EBP"} } right ) rSup {2} size 12{p'q'= left ( { {1 "." "96"} over {0 "." "05"} } right ) rSup {2} } size 12{ \( 0 "." 5 \) \( 0 "." 5 \) approx "384" "." 2}} {}

AWAYS ROUND UP, hence n = 385

Suppose Cardmart wish to know what proportion of men buy their wife a Mother’s Day Card. How many people must be sampled is they wish to be 95% certain that the proportion is within 2%? Suppose that a previous sample of 500 men reported that 421 of them bought their wife a Mother’s Day Card.

p' = 421 500 = 0.842
, hence
q' = 1 - p' = 1 - 0.842 = 0.158

E = 0.02

n = Z α 2 EBP 2 p ' q ' = 1 . 645 0 . 02 2 ( 0 . 842 ) ( 0 . 158 ) 899 . 997 size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } over { ital "EBP"} } right ) rSup {2} size 12{p'q'= left ( { {1 "." "645"} over {0 "." "02"} } right ) rSup {2} } size 12{ \( 0 "." "842" \) \( 0 "." "158" \) approx "899" "." "997"}} {}

AWAYS ROUND UP, n = 900. Hence 900 people need to be surveyed to ensure a 95% confidence interval with an error of at most 2%.

Suppose there is no previous sample. How many men will need to be surveyed?

n = Z α 2 EBP 2 p ' q ' = 1 . 645 0 . 05 2 ( 0 . 5 ) ( 0 . 5 ) 1691 . 27 size 12{n= left ( { {Z rSub { size 8{ { size 6{α} } wideslash { size 6{2} } } } } over { ital "EBP"} } right ) rSup {2} size 12{p'q'= left ( { {1 "." "645"} over {0 "." "05"} } right ) rSup {2} } size 12{ \( 0 "." 5 \) \( 0 "." 5 \) approx "1691" "." "27"}} {}

AWAYS ROUND UP, n = 1692. Hence 1692 people need to be surveyed to ensure a 95% confidence interval with an error of at most 2%.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Collaborative statistics (with edits: teegarden). OpenStax CNX. Jul 20, 2009 Download for free at http://legacy.cnx.org/content/col10561/1.3
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Collaborative statistics (with edits: teegarden)' conversation and receive update notifications?

Ask