<< Chapter < Page Chapter >> Page >

Assignment checklist

Turn in the following typed (12 point) and stapled packet for your final project:
____ Cover sheet containing your name(s), class time, and the name of your study
____ Summary , which includes all items listed on summary checklist
____ Solution sheet neatly and completely filled out. The solution sheet does not need to be typed.
____ Graphic representation of your data , created following the guidelines previously discussed; include only graphs which are appropriate and useful.
____ Raw data collected AND a table summarizing the sample data ( n , x ¯ and s ; or x , n , and p ’, as appropriate for your hypotheses); the raw data does not need to be typed, but the summary does. Hand in the data as you collected it. (Either attach your tally sheet or an envelope containing your questionnaires.)

Bivariate data, linear regression, and univariate data

Student learning objectives

  • The students will collect a bivariate data sample through the use of appropriate sampling techniques.
  • The student will attempt to fit the data to a linear model.
  • The student will determine the appropriateness of linear fit of the model.
  • The student will analyze and graph univariate data.

Instructions

  1. As you complete each task below, check it off. Answer all questions in your introduction or summary.
  2. Check your course calendar for intermediate and final due dates.
  3. Graphs may be constructed by hand or by computer, unless your instructor informs you otherwise. All graphs must be neat and accurate.
  4. All other responses must be done on the computer.
  5. Neatness and quality of explanations are used to determine your final grade.

Part i: bivariate data

Introduction

____State the bivariate data your group is going to study.

Here are two examples, but you may NOT use them: height vs. weight and age vs. running distance.


____Describe your sampling technique in detail. Use cluster, stratified, systematic, or simple random sampling (using a random number generator) sampling. Convenience sampling is NOT acceptable.
____Conduct your survey. Your number of pairs must be at least 30.
____Print out a copy of your data.

Analysis

____On a separate sheet of paper construct a scatter plot of the data. Label and scale both axes.
____State the least squares line and the correlation coefficient.
____On your scatter plot, in a different color, construct the least squares line.
____Is the correlation coefficient significant? Explain and show how you determined this.
____Interpret the slope of the linear regression line in the context of the data in your project. Relate the explanation to your data, and quantify what the slope tells you.
____Does the regression line seem to fit the data? Why or why not? If the data does not seem to be linear, explain if any other model seems to fit the data better.
____Are there any outliers? If so, what are they? Show your work in how you used the potential outlier formula in the Linear Regression and Correlation chapter (since you have bivariate data) to determine whether or not any pairs might be outliers.

Part ii: univariate data

In this section, you will use the data for ONE variable only. Pick the variable that is more interesting to analyze. For example: if your independent variable is sequential data such as year with 30 years and one piece of data per year, your x -values might be 1971, 1972, 1973, 1974, …, 2000. This would not be interesting to analyze. In that case, choose to use the dependent variable to analyze for this part of the project.
_____Summarize your data in a chart with columns showing data value, frequency, relative frequency, and cumulative relative frequency.
_____Answer the following question, rounded to two decimal places:

  1. Sample mean = ______
  2. Sample standard deviation = ______
  3. First quartile = ______
  4. Third quartile = ______
  5. Median = ______
  6. 70th percentile = ______
  7. Value that is 2 standard deviations above the mean = ______
  8. Value that is 1.5 standard deviations below the mean = ______
_____Construct a histogram displaying your data. Group your data into six to ten intervals of equal width. Pick regularly spaced intervals that make sense in relation to your data. For example, do NOT group data by age as 20-26,27-33,34-40,41-47,48-54,55-61 . . . Instead, maybe use age groups 19.5-24.5, 24.5-29.5, . . . or 19.5-29.5, 29.5-39.5, 39.5-49.5, . . .
_____In complete sentences, describe the shape of your histogram.
_____Are there any potential outliers? Which values are they? Show your work and calculations as to how you used the potential outlier formula in Descriptive Statistics (since you are now using univariate data) to determine which values might be outliers.
_____Construct a box plot of your data.
_____Does the middle 50% of your data appear to be concentrated together or spread out? Explain how you determined this.
_____Looking at both the histogram AND the box plot, discuss the distribution of your data. For example: how does the spread of the middle 50% of your data compare to the spread of the rest of the data represented in the box plot; how does this correspond to your description of the shape of the histogram; how does the graphical display show any outliers you may have found; does the histogram show any gaps in the data that are not visible in the box plot; are there any interesting features of your data that you should point out.

Due dates

  • Part I, Intro: __________ (keep a copy for your records)
  • Part I, Analysis: __________ (keep a copy for your records)
  • Entire Project, typed and stapled: __________

    ____ Cover sheet: names, class time, and name of your study

    ____ Part I: label the sections “Intro” and “Analysis.”

    ____ Part II:

    ____ Summary page containing several paragraphs written in complete sentences describing the experiment, including what you studied and how you collected your data. The summary page should also include answers to ALL the questions asked above.

    ____ All graphs requested in the project

    ____ All calculations requested to support questions in data

    ____ Description: what you learned by doing this project, what challenges you had, how you overcame the challenges

Note

Include answers to ALL questions asked, even if not explicitly repeated in the items above.

Questions & Answers

how to use grouped and ungrouped data
Hassan Reply
Just a test from gplay
Lucy Reply
how come 5.67
Mano Reply
by dividing 11.37 on 2
saifuddin
by dividing 11.34 on 2
saifuddin
what is index number?
vinayak
What is the differences between quota an lottery system of sampling
EGBE
What are the are the characteristics that are critically expedients in selecting the sample size
EGBE
fit a binomial distribution for the following data and test the goodness of fit x: 0 1 2 3 4 5 6 f: 5 18 28 12 7 6 4
Mikki Reply
solution
Mano
solve the question please
Mano
can some please help solve so that we learn some
Simonsakala
It is a square chi
Nelson
But can't be a binomial because, the x numbers are 0 to 6, instead those would be "0" or "1" in a straight way
Nelson
You can do a chi-square test, but the assumption has to be a normal distribution, and the last f's number need to be "64"
Nelson
sorry the last f's numbers : "6 and 4" which are the observed values for 5 and 6 (expected values)
Nelson
hi
rajendra
can't understand basic of statistics ..
rajendra
Sorry I see my mistake, we have to calculate the expected values
Nelson
So we need this equation: P= (X=x)=(n to x) p^x(1-p)^n-x
Nelson
why it is not possible brother
ibrar
were n= 2 ( binomial) x= number of makes (0 to 6) and p= probability, could be 0.8.
Nelson
so after we calculate the expected values for each observed value (f) we do the chi-square. x^2=summatory(observed-expected)^2 / expected and compare with x^2 in table with 0.8
Nelson
tomorrow I'll post the answer, I'm so tired today, sorry for my mistake in the first messages.
Nelson
It is possible, sorry for my mistake
Nelson
two trader shared investment and buoght Cattle.Mr.Omer bought 255 cows & rented the farm for a period of 32 days. Mr. Ahmed grazed his Cattle for 25 days. Mr. Ahmed's cattle was 180 cows.Together they profited $ 7800. the rent of the farm is $ 3000 so divide the profit per gows/day for grazing day
Mohamed
how to start this book, who is reading thins first time
Nissar Reply
It is my first time reading this book
Qader
Good one
ihsan
from were did you get 2/50?m
Chico Reply
People living longer
Qader Reply
Why do you think that is?
Jazzy
because there is an increase in number of people with age more than 30.
Qader
Ok. And what do you think is the driving factor behind that hypothesis?
Jazzy
fewer birth and increase in # of years living or fewer dying
Qader
What about the improvement of technology and medicine?
Jazzy
please help me, with this.... three coins are tossed let be the number of head(h)obtain,construct a probability distribution for X
godwin
technology and medicine is improving but they are limited
Ayunku
If those conscience of their health, one will live longer periods of life.
Montrae
hi,why the mean =sum(xi)/n but the variance =sum(xi-xbar)/ n-1 what is the difference between (n or n-1)
Soran Reply
This is hard to type, so I'll use "m" for "x bar", and a few other notations that I hope will be clear: Definition: sqrt(SUM[(x - m)^2] / (n-1)) where m = SUM[x] / n Desired formula: sqrt((SUM[x^2] - SUM[x]^2)/n / (n-1)) Now let's do what you started to do, and see if we can manipulate the definitio
Michael
what is the difference between (n ) and (n-1) in the mean and variance
Soran
Definition: sqrt(SUM[(x - m)^2] / (n-1)) where m = SUM[x] / n what is the difference between (n and n-1)
Soran
Hi, the diference is tha when we estimate parameters in a sample (not in the total population) we need to consider the degrees of liberty for the estimation.
Nelson
Hie guys, am analysing rainfall data for different stations and i got kurtosis values of 0.7 for one station and 0.4 for another, what can i say about this?
Kudakwashe
hi
ujjal
difference in degrees of freedom
vinayak
who introduced the statistics in england
Sunday Reply
what year is data first used
Sunday
Gottfried achievell is a professor at
Sunday Reply
hi
Soran
hello
mskinne4
hi
Raja
hello
Tshepo
freakin annoying chat. uninstalled
KickingtheDonksNuts
Hi
Sunday
hello
Inoxent
Hello rumiex
Abimbola
How is ur day going
Abimbola
Pls I hv a question 🙋 to ask
Abimbola
I want to ask, as a student who wants to study mechatronic engineering in the University. What are the necessary subject he or she needed, and after ur service were can or she work as a graduate
Abimbola
?
Abimbola
hello
Gh
Write a short note on skewness
Saran Reply
and on kurtosis too
Hiren
What is events
Manish Reply
IT occurs in probability
Peter
yes
Preeti
any experiment done
Preeti
who introduced statistics in england
Sunday
who is a strong man?
Desmond Reply
who never gives up
Preeti
Who eats lots of food 😂
abhinav
A man with great physical strength
Nucks
I still fetch confused about this statistics
Mustapha
hi
Samuel
hlw
Suman
hy
ibrar
One who withstand everything that was meant to break him.
Jazzy
"The stronger is not he who puts down his opponent on the ground but the stronger is he who controls himself in a pit of anger" (Bukhari Sharif)
saifuddin
still contemplating about analyzing statistical methods
solomon
Can you sir plz provide all the multiple choice questions related to Index numbers.?
Hiren Reply
about probabilty i have some questions and i want the solution
asad Reply

Get the best Introductory statistics course in your pocket!





Source:  OpenStax, Introductory statistics. OpenStax CNX. May 06, 2016 Download for free at http://legacy.cnx.org/content/col11562/1.18
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introductory statistics' conversation and receive update notifications?

Ask