<< Chapter < Page Chapter >> Page >

Assignment checklist

Turn in the following typed (12 point) and stapled packet for your final project:
____ Cover sheet containing your name(s), class time, and the name of your study
____ Summary , which includes all items listed on summary checklist
____ Solution sheet neatly and completely filled out. The solution sheet does not need to be typed.
____ Graphic representation of your data , created following the guidelines previously discussed; include only graphs which are appropriate and useful.
____ Raw data collected AND a table summarizing the sample data ( n , x ¯ and s ; or x , n , and p ’, as appropriate for your hypotheses); the raw data does not need to be typed, but the summary does. Hand in the data as you collected it. (Either attach your tally sheet or an envelope containing your questionnaires.)

Bivariate data, linear regression, and univariate data

Student learning objectives

  • The students will collect a bivariate data sample through the use of appropriate sampling techniques.
  • The student will attempt to fit the data to a linear model.
  • The student will determine the appropriateness of linear fit of the model.
  • The student will analyze and graph univariate data.

Instructions

  1. As you complete each task below, check it off. Answer all questions in your introduction or summary.
  2. Check your course calendar for intermediate and final due dates.
  3. Graphs may be constructed by hand or by computer, unless your instructor informs you otherwise. All graphs must be neat and accurate.
  4. All other responses must be done on the computer.
  5. Neatness and quality of explanations are used to determine your final grade.

Part i: bivariate data

Introduction

____State the bivariate data your group is going to study.

Here are two examples, but you may NOT use them: height vs. weight and age vs. running distance.


____Describe your sampling technique in detail. Use cluster, stratified, systematic, or simple random sampling (using a random number generator) sampling. Convenience sampling is NOT acceptable.
____Conduct your survey. Your number of pairs must be at least 30.
____Print out a copy of your data.

Analysis

____On a separate sheet of paper construct a scatter plot of the data. Label and scale both axes.
____State the least squares line and the correlation coefficient.
____On your scatter plot, in a different color, construct the least squares line.
____Is the correlation coefficient significant? Explain and show how you determined this.
____Interpret the slope of the linear regression line in the context of the data in your project. Relate the explanation to your data, and quantify what the slope tells you.
____Does the regression line seem to fit the data? Why or why not? If the data does not seem to be linear, explain if any other model seems to fit the data better.
____Are there any outliers? If so, what are they? Show your work in how you used the potential outlier formula in the Linear Regression and Correlation chapter (since you have bivariate data) to determine whether or not any pairs might be outliers.

Part ii: univariate data

In this section, you will use the data for ONE variable only. Pick the variable that is more interesting to analyze. For example: if your independent variable is sequential data such as year with 30 years and one piece of data per year, your x -values might be 1971, 1972, 1973, 1974, …, 2000. This would not be interesting to analyze. In that case, choose to use the dependent variable to analyze for this part of the project.
_____Summarize your data in a chart with columns showing data value, frequency, relative frequency, and cumulative relative frequency.
_____Answer the following question, rounded to two decimal places:

  1. Sample mean = ______
  2. Sample standard deviation = ______
  3. First quartile = ______
  4. Third quartile = ______
  5. Median = ______
  6. 70th percentile = ______
  7. Value that is 2 standard deviations above the mean = ______
  8. Value that is 1.5 standard deviations below the mean = ______
_____Construct a histogram displaying your data. Group your data into six to ten intervals of equal width. Pick regularly spaced intervals that make sense in relation to your data. For example, do NOT group data by age as 20-26,27-33,34-40,41-47,48-54,55-61 . . . Instead, maybe use age groups 19.5-24.5, 24.5-29.5, . . . or 19.5-29.5, 29.5-39.5, 39.5-49.5, . . .
_____In complete sentences, describe the shape of your histogram.
_____Are there any potential outliers? Which values are they? Show your work and calculations as to how you used the potential outlier formula in Descriptive Statistics (since you are now using univariate data) to determine which values might be outliers.
_____Construct a box plot of your data.
_____Does the middle 50% of your data appear to be concentrated together or spread out? Explain how you determined this.
_____Looking at both the histogram AND the box plot, discuss the distribution of your data. For example: how does the spread of the middle 50% of your data compare to the spread of the rest of the data represented in the box plot; how does this correspond to your description of the shape of the histogram; how does the graphical display show any outliers you may have found; does the histogram show any gaps in the data that are not visible in the box plot; are there any interesting features of your data that you should point out.

Due dates

  • Part I, Intro: __________ (keep a copy for your records)
  • Part I, Analysis: __________ (keep a copy for your records)
  • Entire Project, typed and stapled: __________

    ____ Cover sheet: names, class time, and name of your study

    ____ Part I: label the sections “Intro” and “Analysis.”

    ____ Part II:

    ____ Summary page containing several paragraphs written in complete sentences describing the experiment, including what you studied and how you collected your data. The summary page should also include answers to ALL the questions asked above.

    ____ All graphs requested in the project

    ____ All calculations requested to support questions in data

    ____ Description: what you learned by doing this project, what challenges you had, how you overcame the challenges

Note

Include answers to ALL questions asked, even if not explicitly repeated in the items above.

Questions & Answers

what is permutation
Rodlett Reply
how to construct a histogram
Baalisi Reply
I really appreciate that
umar Reply
I want to test linear regression data such as maintenance fees vs house size. Can I use R square, F test to test the relationship? Is the good condition of R square greater than 0.5
Mok Reply
yes of course must have use f test and also use t test individually multple coefficients
rishi
Alright
umar
hi frnd I'm akeem by name, I wanna study economics and statistics wat ar d thing I must do to b a great economist
akeem
Is R square cannot analysis linear regression of X vs Y relationship?
Mok
To be an economist you have to be professional in maths
umar
hi frnds
Shehu
what is random sampling what is sample error
Nistha Reply
@Nistha Kashyap Random sampling is the selection of random items (or random numbers) from the group. A sample error occurs when the selected samples do not truely represent the whole group. The can happen when most or all of the selected samples are taken from only one section of the group;
Ron
Thus the sample is not truely random.
Ron
What is zero sum game?
Hassan Reply
A game in which there is no profit & no loss to any of the both player.
Milan
Differences between sample mean & population mean
mohammed Reply
***keydifferences.com/difference-between-sample-mean-and-population-mean.html
Lucien
Not difference in the formula except the notation, sample mean is denoted by x bar and population mean is denoted by mu symbol. There is formula as well as notation between difference variance and standard deviations
Akash
Likely the difference would be in the result, unless the sample is an exact representation of the population (which is unlikely.)
Ron
what is data
Nii
Nii Avin - Data is just a simple way to refer to the numbers in the population, or in the sample used in your calculations.
Ron
what are the types of data
Nii
Data is the very pale android from the Star Trek Enterprise
Andrew
Am Emmanuel from Nigeria
Emmanuel
Am Qudus from Nigeria
Rasak
am Handson from Cameroon
Handson
what is a mode?
Handson
Nii - data is whatever you are sampling. Such as the number of students in each classroom.
Ron
Handson Ndintek - the mode is the number appearing most frequently. Example: 7 9 11 7 4 6 3 7 2. 7 is the mode. In a group such as 7 9 1 4 6 3, there is no mode because no number appears more often than any other.
Ron
hi I want to know how to find class boundary
Baalisi
give me the two types of data
Neddy Reply
qualitative and quantitative
phoenix
primary and secondary data
Peace
qualitative and quantitative
Prince
Using Cauchy Schwartz inequality,or prove that b2-b1-1=0
Md Reply
what is the ongoing probability that President Trump will remain in the position he has chosen as his viability of his cabinet as he runs for reelection in the primaries of 2020 election year
Terry
what is statistic?
Jhasaketan Reply
it's a science of collection, organization, analysis and summarizing data to get useful information to make several types of conclusions.which can be used in real life.
anshika
what is the statistical probability that president Trump will remain in the white house after the election of 2020?
Terry
i agree with anshika is right but let me add that such decisions are made in face of uncertainty
Maureen
yes
Stephen
classification of statistic
Jhasaketan Reply
statistic can classified into many types eassy to understand future values effect
Narendra
what is mean?
Jhasaketan
average value
Narendra
İ want to understand what is t test or neyma. Pearson test ans difference
Yasin
to test the hypotesis ho follws h1 l1/lo
Narendra
Hope this helps. There are three main types of averages. *mean -> average -> (X1+X2+X3+...+Xn) / n *mode -> the element within a set which occurs most. {3,4,5,8,12,3,4,3,3,56} mode = 3 *median -  {3,3,4,5,8,12,56} median = 5 OR {3,4,5,8,12,56} median = 6.5
Jack
conceptual approach to limits
lameck Reply
how are limits derived?
lameck
an entire section of calculus is devoted to that explanation.
Pitior
what is statistics?
Martin Reply
statistics :- can be defined as the branches of mathematics that deals with the summarizing, analysing,organization and interpretation of data.
Usman
well said
Venkat
can we find Z value on calculator with out using Z table
Maham Reply
no
Pitior
why
Maham
can another way is possible ?
Maham
Well you could make a table. And as the function you use the one used at the z table
Luca
The normal function is only one way, so you can only try using different numbers until you get the probability that you have. So that is easier if you have a table
Luca
me don't know nothing about z table and don't know how to see the z value on table can you tell me please how see the value on table
Maham
The z table is the table of the standard normal distribution
Luca
You can look it up on internet, its easier than writing down the normal distribution function (with an integral) and doing a table in the calculator
Luca
OK thanks luca
Maham
yes use pnorm in r
Venkat
pnorm(2.3,mean=0,sd=1)
Venkat
pnorm?
Maham
do u have r software
Venkat
no
Maham
its with tht u will get
Venkat
or type in google
Venkat
z mathportal calculator
Venkat
calculator
Venkat
OK venkat thanks
Maham
welcome
Venkat
have calculator but don't know how find z value
Maham
ti83
Venkat
hey guys I'm from computer background so what are the concepts I supposed to prepare for interview in statistics
Alwin
descriptive stats
Venkat
inferential stats
Venkat
outlier treatment
Venkat
boxplot
Venkat
ok
Alwin
assumption of linear regression
Venkat
logistic regression
Venkat
k means clustering
Venkat
exact syllabus?
Alwin
type. analytics vidya interview questions statistics
Venkat
listen. data also
Venkat
like this forum
Jameel
My question is "is it only stats?"
Jameel
wer is the problem
Venkat
how find straight line equation in regression
Maham Reply
u can find using excel
Venkat
or r studio
Venkat
for regression
Venkat
shall i help
Venkat
im an expert
Venkat
by giving a value to x,y
Ibrokhim
first provide data
Venkat
ill solve and guve
Venkat
ive
Venkat
yeah please
Maham
maham you posted data
Venkat
please post data
Venkat
ok
Maham
x:1,2,3,4,5 y:2,5,6,8,9
Maham
regredsion equation is
Venkat
y=0.9+1.7x
Venkat
reg eq is y=0.9+1.7x
Venkat
slope = 1.7
Venkat
yintercept = 0.9
Venkat
answered
Venkat
thanx venkat naveen😊
Maham
welcome
Venkat

Get the best Introductory statistics course in your pocket!





Source:  OpenStax, Introductory statistics. OpenStax CNX. May 06, 2016 Download for free at http://legacy.cnx.org/content/col11562/1.18
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introductory statistics' conversation and receive update notifications?

Ask