# 0.4 Group and partner projects  (Page 4/4)

 Page 4 / 4

## Assignment checklist

Turn in the following typed (12 point) and stapled packet for your final project:
____ Cover sheet containing your name(s), class time, and the name of your study
____ Summary , which includes all items listed on summary checklist
____ Solution sheet neatly and completely filled out. The solution sheet does not need to be typed.
____ Graphic representation of your data , created following the guidelines previously discussed; include only graphs which are appropriate and useful.
____ Raw data collected AND a table summarizing the sample data ( n , $\overline{x}$ and s ; or x , n , and p ’, as appropriate for your hypotheses); the raw data does not need to be typed, but the summary does. Hand in the data as you collected it. (Either attach your tally sheet or an envelope containing your questionnaires.)

## Student learning objectives

• The students will collect a bivariate data sample through the use of appropriate sampling techniques.
• The student will attempt to fit the data to a linear model.
• The student will determine the appropriateness of linear fit of the model.
• The student will analyze and graph univariate data.

## Instructions

1. As you complete each task below, check it off. Answer all questions in your introduction or summary.
2. Check your course calendar for intermediate and final due dates.
3. Graphs may be constructed by hand or by computer, unless your instructor informs you otherwise. All graphs must be neat and accurate.
4. All other responses must be done on the computer.
5. Neatness and quality of explanations are used to determine your final grade.

## Introduction

____State the bivariate data your group is going to study.

Here are two examples, but you may NOT use them: height vs. weight and age vs. running distance.

____Describe your sampling technique in detail. Use cluster, stratified, systematic, or simple random sampling (using a random number generator) sampling. Convenience sampling is NOT acceptable.
____Conduct your survey. Your number of pairs must be at least 30.
____Print out a copy of your data.

## Analysis

____On a separate sheet of paper construct a scatter plot of the data. Label and scale both axes.
____State the least squares line and the correlation coefficient.
____On your scatter plot, in a different color, construct the least squares line.
____Is the correlation coefficient significant? Explain and show how you determined this.
____Interpret the slope of the linear regression line in the context of the data in your project. Relate the explanation to your data, and quantify what the slope tells you.
____Does the regression line seem to fit the data? Why or why not? If the data does not seem to be linear, explain if any other model seems to fit the data better.
____Are there any outliers? If so, what are they? Show your work in how you used the potential outlier formula in the Linear Regression and Correlation chapter (since you have bivariate data) to determine whether or not any pairs might be outliers.

## Part ii: univariate data

In this section, you will use the data for ONE variable only. Pick the variable that is more interesting to analyze. For example: if your independent variable is sequential data such as year with 30 years and one piece of data per year, your x -values might be 1971, 1972, 1973, 1974, …, 2000. This would not be interesting to analyze. In that case, choose to use the dependent variable to analyze for this part of the project.
_____Summarize your data in a chart with columns showing data value, frequency, relative frequency, and cumulative relative frequency.
_____Answer the following question, rounded to two decimal places:

1. Sample mean = ______
2. Sample standard deviation = ______
3. First quartile = ______
4. Third quartile = ______
5. Median = ______
6. 70th percentile = ______
7. Value that is 2 standard deviations above the mean = ______
8. Value that is 1.5 standard deviations below the mean = ______
_____Construct a histogram displaying your data. Group your data into six to ten intervals of equal width. Pick regularly spaced intervals that make sense in relation to your data. For example, do NOT group data by age as 20-26,27-33,34-40,41-47,48-54,55-61 . . . Instead, maybe use age groups 19.5-24.5, 24.5-29.5, . . . or 19.5-29.5, 29.5-39.5, 39.5-49.5, . . .
_____In complete sentences, describe the shape of your histogram.
_____Are there any potential outliers? Which values are they? Show your work and calculations as to how you used the potential outlier formula in Descriptive Statistics (since you are now using univariate data) to determine which values might be outliers.
_____Construct a box plot of your data.
_____Does the middle 50% of your data appear to be concentrated together or spread out? Explain how you determined this.
_____Looking at both the histogram AND the box plot, discuss the distribution of your data. For example: how does the spread of the middle 50% of your data compare to the spread of the rest of the data represented in the box plot; how does this correspond to your description of the shape of the histogram; how does the graphical display show any outliers you may have found; does the histogram show any gaps in the data that are not visible in the box plot; are there any interesting features of your data that you should point out.

## Due dates

• Part I, Intro: __________ (keep a copy for your records)
• Part I, Analysis: __________ (keep a copy for your records)
• Entire Project, typed and stapled: __________

____ Cover sheet: names, class time, and name of your study

____ Part I: label the sections “Intro” and “Analysis.”

____ Part II:

____ Summary page containing several paragraphs written in complete sentences describing the experiment, including what you studied and how you collected your data. The summary page should also include answers to ALL the questions asked above.

____ All graphs requested in the project

____ All calculations requested to support questions in data

____ Description: what you learned by doing this project, what challenges you had, how you overcame the challenges

## Note

Include answers to ALL questions asked, even if not explicitly repeated in the items above.

It should be a Machine learning terms。
Mok
it is a term used in linear regression
Saurav
what are the differences between standard deviation and variancs?
Enhance
what is statistics
statistics is the collection and interpretation of data
Enhance
the science of summarization and description of numerical facts
Enhance
Is the estimation of probability
Zaini
mr. zaini..can u tell me more clearly how to calculated pair t test
Haai
do you have MG Akarwal Statistics' book Zaini?
Enhance
Haai how r u?
Enhance
maybe .... mathematics is the science of simplification and statistics is the interpretation of such values and its implications.
Miguel
can we discuss about pair test
Haai
what is outlier?
outlier is an observation point that is distant from other observations.
Gidigah
what is its effect on mode?
Usama
Outlier  have little effect on the mode of a given set of data.
Gidigah
How can you identify a possible outlier(s) in a data set.
Daniel
The best visualisation method to identify the outlier is box and wisker method or boxplot diagram. The points which are located outside the max edge of wisker(both side) are considered as outlier.
Akash
@Daniel Adunkwah - Usually you can identify an outlier visually. They lie outside the observed pattern of the other data points, thus they're called outliers.
Ron
what is completeness?
I am new to this. I am trying to learn.
Dom
I am also new Dom, welcome!
Nthabi
thanks
Dom
please my friend i want same general points about statistics. say same thing
alex
outliers do not have effect on mode
Meselu
also new
yousaf
I don't get the example
ways of collecting data at least 10 and explain
Example of discrete variable
Gbenga
I am new here, can I get someone to guide up?
alayo
dies outcome is 1, 2, 3, 4, 5, 6 nothing come outside of it. it is an example of discrete variable
jainesh
continue variable is any value value between 0 to 1 it could be 4digit values eg 0.1, 0.21, 0.13, 0.623, 0.32
jainesh
hi
Kachalla
what's up here ... am new here
Kachalla
sorry question a bit unclear...do you mean how do you analyze quantitative data? If yes, it depends on the specific question(s) you set in the beginning as well as on the data you collected. So the method of data analysis will be dependent on the data collecter and questions asked.
Bheka
how to solve for degree of freedom
saliou
Quantitative data is the data in numeric form. For eg: Income of persons asked is 10,000. This data is quantitative data on the other hand data collected for either make or female is qualitative data.
Rohan
*male
Rohan
Degree of freedom is the unconditionality. For example if you have total number of observations n, and you have to calculate variance, obviously you will need mean for that. Here mean is a condition, without which you cannot calculate variance. Therefore degree of freedom for variance will be n-1.
Rohan
data that is best presented in categories like haircolor, food taste (good, bad, fair, terrible) constitutes qualitative data
Bheka
vegetation types (grasslands, forests etc) qualitative data
Bheka
I don't understand how you solved it can you teach me
solve what?
Ambo
mean
Vanarith
What is the end points of a confidence interval called?
lower and upper endpoints
Bheka
Class members write down the average time (in hours, to the nearest half-hour) they sleep per night.
how we make a classes of this(170.3,173.9,171.3,182.3,177.3,178.3,174.175.3)
Sarbaz
6.5
phoenix
11
Shakir
7.5
Ron
why is always lower class bundry used
Caleb
Assume you are in a class where quizzes are 20% of your grade, homework is 20%, exam _1 is 15%,exam _2 is 15%, and the final exam is 20%.Suppose you are in the fifth week and you just found out that you scored a 58/63 on the fist exam. You also know that you received 6/9,8/10,9/9 on the first
quizzes as well as a 9/11,10/10,and 4.5/7 on the first three homework assignment. what is your current grade in the course?
Diamatu
Abdul
if putting y=3x examine that correlation coefficient between x and y=3x is 1.
what is permutation
how to construct a histogram
You have to plot the class midpoint and the frequency
Wydny
ok so you use those two to draw the histogram right.
Amford
yes
Wydny
ok can i be a friend so you can be teaching me small small
Amford
how do you calculate cost effectiveness?
George
Hi everyone, this is a very good statistical group and am glad to be part of it. I'm just not sure how did I end up here cos this discussion just popes on my screen so if I wanna ask something in the future, how will I find you?
Bheka
To make a histogram, follow these steps: On the vertical axis, place frequencies. Label this axis "Frequency". On the horizontal axis, place the lower value of each interval. ... Draw a bar extending from the lower value of each interval to the lower value of the next interval.
Divya
I really appreciate that