<< Chapter < Page Chapter >> Page >
This module states the factors associated with F Distributions and provides students with some examples to help further understand the concept. Students will be given the opportunity to see F Distributions in action through participation in an optional classroom exercise.
  1. The curve is not symmetrical but skewed to the right.
  2. There is a different curve for each set of dfs .
  3. The F statistic is greater than or equal to zero.
  4. As the degrees of freedom for the numerator and for the denominator get larger, the curve approximates the normal.
  5. Other uses for the F distribution include comparing two variances and Two-Way Analysis of Variance. Comparing two variances is discussed at the end of the chapter.Two-Way Analysis is mentioned for your information only.

Nonsymmetrical F distribution curve skewed to the right, more values in the right tail and the peak is closer to the left. This curve is different from the graph on the right because of the different dfs. Nonsymmetrical F distribution curve skewed to the right, more values in the right tail and the peak is closer to the left. This curve is different from the graph on the left because of the different dfs. Because its dfs are larger, it is closer in resemblance to a normal distribution curve.

One-Way ANOVA: Four sororities took a random sample of sisters regarding their grade means for the past term. The results are shown below:

MEAN GRADES FOR FOUR SORORITIES
Sorority 1 Sorority 2 Sorority 3 Sorority 4
2.17 2.63 2.63 3.79
1.85 1.77 3.78 3.45
2.83 3.25 4.00 3.08
1.69 1.86 2.55 2.26
3.33 2.21 2.45 3.18

Using a significance level of 1%, is there a difference in mean grades among the sororities?

Let μ 1 , μ 2 , μ 3 , μ 4 be the population means of the sororities. Remember that the null hypothesis claims that the sorority groups are from the same normal distribution.The alternate hypothesis says that at least two of the sorority groups come from populations with different normal distributions. Notice that the four sample sizes areeach size 5.

This is an example of a balanced design , since each factor (i.e. Sorority) has the same number of observations.

H o : μ 1 = μ 2 = μ 3 = μ 4

H a : Not all of the means μ 1 , μ 2 , μ 3 , μ 4 are equal.

Distribution for the test: F 3 , 16

where k = 4 groups and n = 20 samples in total

df(num) = k - 1 = 4 - 1 = 3

df(denom) = n - k = 20 - 4 = 16

Calculate the test statistic: F = 2.23

Graph:

Nonsymmetrical F distribution curve with values of 0 and 2.23 on the x-axis representing the test statistic of sorority grade averages. A vertical upward line extends from 2.23 to the curve and the area to the right of this is equal to the p-value.

Probability statement: p-value = P ( F > 2.23 ) = 0.1241

Compare α and the p-value : α = 0.01 p-value = 0.1241 α < p-value

Make a decision: Since α < p-value , you cannot reject H o .

Conclusion: There is not sufficient evidence to conclude that there is a difference among the mean grades for the sororities.

TI-83+ or TI 84: Put the data into lists L1, L2, L3, and L4. Press STAT and arrow over to TESTS . Arrow down to F:ANOVA . Press ENTER and Enter ( L1,L2,L3,L4 ). The F statistic is 2.2303 and the p-value is 0.1241. df(numerator) = 3 (under "Factor" ) and df(denominator) = 16 (under Error ).

Got questions? Get instant answers now!
Got questions? Get instant answers now!

A fourth grade class is studying the environment. One of the assignments is to grow bean plants in different soils. Tommy chose to grow his beanplants in soil found outside his classroom mixed with dryer lint. Tara chose to grow her bean plants in potting soil bought at the local nursery. Nick chose to grow his beanplants in soil from his mother's garden. No chemicals were used on the plants, only water. They were grown inside the classroom next to a large window. Each childgrew 5 plants. At the end of the growing period, each plant was measured, producing the following data (in inches):

Tommy's Plants Tara's Plants Nick's Plants
24 25 23
21 31 27
23 23 22
30 20 30
23 28 20

Does it appear that the three media in which the bean plants were grown produce the same mean height? Test at a 3% level of significance.

This time, we will perform the calculations that lead to the F' statistic. Notice that each group has the same number of plants so we will use the formula F' = n s x _ 2 s 2 pooled .

First, calculate the sample mean and sample variance of each group.

Tommy's Plants Tara's Plants Nick's Plants
Sample Mean 24.2 25.4 24.4
Sample Variance 11.7 18.3 16.3

Next, calculate the variance of the three group means (Calculate the variance of 24.2, 25.4, and 24.4). Variance of the group means = 0.413 = s x 2

Then MS between = n s x 2 = ( 5 ) ( 0.413 ) where n = 5 is the sample size (number of plants each child grew).

Calculate the mean of the three sample variances (Calculate the mean of 11.7, 18.3, and 16.3). Mean of the sample variances = 15.433 = s 2 pooled size 12{ \( s rSub { size 8{ ital "pooled"} } \) rSup { size 8{2} } ={}} {}

Then MS within = s 2 pooled size 12{ \( s rSub { size 8{ ital "pooled"} } \) rSup { size 8{2} } ={}} {} = 15.433 .

The F statistic (or F ratio) is F = MS between MS within = n s x _ 2 s 2 pooled = ( 5 ) ( 0.413 ) 15.433 = 0.134

The dfs for the numerator = the number of groups - 1 = 3 - 1 = 2

The dfs for the denominator = the total number of samples - the number of groups = 15 - 3 = 12

The distribution for the test is F 2 , 12 and the F statistic is F = 0.134

The p-value is P ( F > 0.134 ) = 0.8759 .

Decision: Since α = 0.03 and the p-value = 0.8759 , do not reject H o . (Why?)

Conclusion: With a 3% the level of significance, from the sample data, the evidence is not sufficient to conclude that the mean heights of the bean plants are different.

(This experiment was actually done by three classmates of the son of one of the authors.)

Got questions? Get instant answers now!

Another fourth grader also grew bean plants but this time in a jelly-like mass. The heights were (in inches) 24, 28, 25, 30, and 32.

Do a One-Way ANOVA test on the 4 groups. You may use your calculator or computer to perform the test. Are the heights of the bean plants different? Use a solution sheet .

  • F = 0.9496
  • p-value = 0.4402

From the sample data, the evidence is not sufficient to conclude that the mean heights of the bean plants are different.

Got questions? Get instant answers now!

Optional classroom activity

From the class, create four groups of the same size as follows: men under 22, men at least 22, women under 22, women at least 22. Have each member of each group record the number of states in the United States he or she has visited.Run an ANOVA test to determine if the average number of states visited in the four groups are the same. Test at a 1% level of significance. Use one of the solution sheets at the end of the chapter (after the homework).

Questions & Answers

how do you get the 2/50
Abba Reply
number of sport play by 50 student construct discrete data
Aminu Reply
width of the frangebany leaves on how to write a introduction
Theresa Reply
Solve the mean of variance
Veronica Reply
Step 1: Find the mean. To find the mean, add up all the scores, then divide them by the number of scores. ... Step 2: Find each score's deviation from the mean. ... Step 3: Square each deviation from the mean. ... Step 4: Find the sum of squares. ... Step 5: Divide the sum of squares by n – 1 or N.
kenneth
what is error
Yakuba Reply
Is mistake done to something
Vutshila
Hy
anas
hy
What is the life teble
anas
hy
Jibrin
statistics is the analyzing of data
Tajudeen Reply
what is statics?
Zelalem Reply
how do you calculate mean
Gloria Reply
diveving the sum if all values
Shaynaynay
let A1,A2 and A3 events be independent,show that (A1)^c, (A2)^c and (A3)^c are independent?
Fisaye Reply
what is statistics
Akhisani Reply
data collected all over the world
Shaynaynay
construct a less than and more than table
Imad Reply
The sample of 16 students is taken. The average age in the sample was 22 years with astandard deviation of 6 years. Construct a 95% confidence interval for the age of the population.
Aschalew Reply
Bhartdarshan' is an internet-based travel agency wherein customer can see videos of the cities they plant to visit. The number of hits daily is a normally distributed random variable with a mean of 10,000 and a standard deviation of 2,400 a. what is the probability of getting more than 12,000 hits? b. what is the probability of getting fewer than 9,000 hits?
Akshay Reply
Bhartdarshan'is an internet-based travel agency wherein customer can see videos of the cities they plan to visit. The number of hits daily is a normally distributed random variable with a mean of 10,000 and a standard deviation of 2,400. a. What is the probability of getting more than 12,000 hits
Akshay
1
Bright
Sorry i want to learn more about this question
Bright
Someone help
Bright
a= 0.20233 b=0.3384
Sufiyan
a
Shaynaynay
How do I interpret level of significance?
Mohd Reply
It depends on your business problem or in Machine Learning you could use ROC- AUC cruve to decide the threshold value
Shivam
how skewness and kurtosis are used in statistics
Owen Reply
yes what is it
Taneeya
Got questions? Join the online conversation and get instant answers!
Jobilize.com Reply

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Collaborative statistics. OpenStax CNX. Jul 03, 2012 Download for free at http://cnx.org/content/col10522/1.40
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Collaborative statistics' conversation and receive update notifications?

Ask