# 0.2 Practice tests (1-4) and final exams  (Page 28/36)

 Page 28 / 36

## 12.4: the regression equation

13 . $r\left(\frac{{s}_{y}}{{s}_{x}}\right)=0.73\left(\frac{9.6}{4.0}\right)=1.752\approx 1.75$

14 . $a=\overline{y}-b\overline{x}=141.6-1.752\left(68.4\right)=21.7632\approx 21.76$

15 . $\stackrel{^}{y}=21.76+1.75\left(68\right)=140.76$

## 12.5: correlation coefficient and coefficient of determination

16 . The coefficient of determination is the square of the correlation, or r 2 .
For this data, r 2 = (–0.56)2 = 0.3136 ≈ 0.31 or 31%. This means that 31 percent of the variation in fuel efficiency can be explained by the bodyweight of the automobile.

17 . The coefficient of determination = 0.32 2 = 0.1024. This is the amount of variation in freshman college GPA that can be explained by high school GPA. The amount that cannot be explained is 1 – 0.1024 = 0.8976 ≈ 0.90. So about 90 percent of variance in freshman college GPA in this data is not explained by high school GPA.

18 . $r=\sqrt{{r}^{2}}$
$\sqrt{0.5}=0.707106781\approx 0.71$
You need a correlation of 0.71 or higher to have a coefficient of determination of at least 0.5.

## 12.6: testing the significance of the correlation coefficient

19 . H 0 : ρ = 0
H a : ρ ≠ 0

20 . $t=\frac{r\sqrt{n-2}}{\sqrt{1-{r}^{2}}}=\frac{0.33\sqrt{30-2}}{\sqrt{1-{0.33}^{2}}}=1.85$
The critical value for α = 0.05 for a two-tailed test using the t 29 distribution is 2.045. Your value is less than this, so you fail to reject the null hypothesis and conclude that the study produced no evidence that the variables are significantly correlated.
Using the calculator function tcdf, the p -value is 2tcdf(1.85, 10^99, 29) = 0.0373. Do not reject the null hypothesis and conclude that the study produced no evidence that the variables are significantly correlated.

21 . $t=\frac{r\sqrt{n-2}}{\sqrt{1-{r}^{2}}}=\frac{0.45\sqrt{25-2}}{\sqrt{1-{0.45}^{2}}}=2.417$
The critical value for α = 0.05 for a two-tailed test using the t 24 distribution is 2.064. Your value is greater than this, so you reject the null hypothesis and conclude that the study produced evidence that the variables are significantly correlated.
Using the calculator function tcdf, the p-value is 2tcdf(2.417, 10^99, 24) = 0.0118. Reject the null hypothesis and conclude that the study produced evidence that the variables are significantly correlated.

## 12.7: prediction

22 . $\stackrel{^}{y}=25+16\left(5\right)=105$

23 . Because the intercept appears in both predicted values, you can ignore it in calculating a predicted difference score. The difference in grams of fiber per serving is 6 – 3 = 3 and the predicted difference in grams of potassium per serving is (16)(3) = 48.

## 12.8: outliers

24 . An outlier is an observed value that is far from the least squares regression line. A rule of thumb is that a point more than two standard deviations of the residuals from its predicted value on the least squares regression line is an outlier.

25 . An influential point is an observed value in a data set that is far from other points in the data set, in a horizontal direction. Unlike an outlier, an influential point is determined by its relationship with other values in the data set, not by its relationship to the regression line.

26 . The predicted value for y is: $\stackrel{^}{y}=5+0.3x=5.6$ . The value of 6.2 is less than two standard deviations from the predicted value, so it does not qualify as an outlier.
Residual for (2, 6.2): 6.2 – 5.6 = 0.6 (0.6<2(0.4))

conceptual approach to limits
how are limits derived?
lameck
an entire section of calculus is devoted to that explanation.
Pitior
what is statistics?
statistics :- can be defined as the branches of mathematics that deals with the summarizing, analysing,organization and interpretation of data.
Usman
well said
Venkat
can we find Z value on calculator with out using Z table
no
Pitior
why
Maham
can another way is possible ?
Maham
Well you could make a table. And as the function you use the one used at the z table
Luca
The normal function is only one way, so you can only try using different numbers until you get the probability that you have. So that is easier if you have a table
Luca
me don't know nothing about z table and don't know how to see the z value on table can you tell me please how see the value on table
Maham
The z table is the table of the standard normal distribution
Luca
You can look it up on internet, its easier than writing down the normal distribution function (with an integral) and doing a table in the calculator
Luca
OK thanks luca
Maham
yes use pnorm in r
Venkat
pnorm(2.3,mean=0,sd=1)
Venkat
pnorm?
Maham
do u have r software
Venkat
no
Maham
its with tht u will get
Venkat
Venkat
z mathportal calculator
Venkat
calculator
Venkat
OK venkat thanks
Maham
welcome
Venkat
have calculator but don't know how find z value
Maham
ti83
Venkat
hey guys I'm from computer background so what are the concepts I supposed to prepare for interview in statistics
Alwin
descriptive stats
Venkat
inferential stats
Venkat
outlier treatment
Venkat
boxplot
Venkat
ok
Alwin
assumption of linear regression
Venkat
logistic regression
Venkat
k means clustering
Venkat
exact syllabus?
Alwin
type. analytics vidya interview questions statistics
Venkat
listen. data also
Venkat
like this forum
Jameel
My question is "is it only stats?"
Jameel
wer is the problem
Venkat
how find straight line equation in regression
u can find using excel
Venkat
or r studio
Venkat
for regression
Venkat
shall i help
Venkat
im an expert
Venkat
by giving a value to x,y
Ibrokhim
first provide data
Venkat
ill solve and guve
Venkat
ive
Venkat
Maham
maham you posted data
Venkat
Venkat
ok
Maham
x:1,2,3,4,5 y:2,5,6,8,9
Maham
regredsion equation is
Venkat
y=0.9+1.7x
Venkat
reg eq is y=0.9+1.7x
Venkat
slope = 1.7
Venkat
yintercept = 0.9
Venkat
Venkat
thanx venkat naveen😊
Maham
welcome
Venkat
the tenth percentile for land selling at jabi is 35,000 and the nineteenth percentile for the land price in the same area is 225,what is the 10_90 percentile range
what is statistics
statistics is the beach of mathematics which deals with collection ,organisation, presentation, analysis and interpretation of numerical data
Saeed
oh but interpretation of data, like what and how? 🤔
Bhavani
interpretation: Think in a way that you have given a company year turnover and you have a record of 100years and data set is like (Year,Turnover). Now with that data you can interpret many thing how was the company growth, when were the losses and other things
Akash
interpretation: it is a process in which we make a decision about a population on the basis of sample data . example: if we want to interpret the average income of employees for upcoming year so we have to interpret the income of employees on the basis of previous year's income of those employees
Saeed
thank you saeed, Akash. I understood.
Bhavani
how to remember all this formulas easy ly
no easy way
Pitior
best way is to do as many problems as possible
Pitior
Oh
Bhavani
is this the only one room? or separate room for separate users? 🤔
Bhavani
Bhavani
Finding correlation and regression
explain statistics whether it is a science or arts or both
I would say art is a creation. A chef is an artist. They create new dishes just like the painters. I believe one who creates something new, is an artist. So, Statistics is also an art, if you know it, you can create some new formula, theory, law, etcetera. It is also Science. So yes, it is both.
Rohan
how do you use the normal distribution table when testing the hypothesis
Davia
Rohan
percentages of all the possible outcomes are measured. This is so simple and bases on the questionnaire or interview schedule. It's just measuring the probability chances of high %age of the either part of the hypothesis ... dependent ..independent. data is classified on the basis of respondents
saifuddin
how find CV(x) and CV(y) if X: 3,7,5,4,6,9 &Y:4,9,6,4,7,8 please tell
Maham
what percent of the students would be expected to score above 95?
inferential statistics is what?
in which we make infrences (hypothsis)
surpose a data set of 2,3,5,6,1,4 are given find median
lucy
Mean (average) 4... Median (middle term) 3.5.. Mode (frequency) every element in a set has 1 frequrncy
Akash
i arrange the data set in ascending order. that is, 1,2,3,4,5,6. then find the data set that falls in the middle. in this case, 3 & 4 fall in the middle. you then sum and obtain the average. that is, (3+4)/2=3.5. therefore, 3.5 is the median.
Gbenga
both of you are correct.
Joseph
hello guys
Abasikponke
thanks
lucy
great to be here
King
how does a line graph look
King
hi
Davia
hello
lucy
pls who knows how line graph look like
King
line graph usually have a straight line running through axis
Dike
am new here anyone willing to orient me?
Timothy
find the media of the following numbers 61,64,67,70,73
my body pls
lucy
67
Benmike
Benmike
what is the percentile for the set of data in the class C and frequency F(c,f)given by (9.3-9.7,2) (9.8-10.2,5) (10.3-10.7,12) (10.8-11.2,17) (11.3-11.7,14) (11.8-12.2,6) (12.3-12.7,3) (12.8-13.2,1)
how to find median
arrange ascending and desending order than the mid value is Median
rajendra
ok
Hrishe
what if it is a group data
Oloyede
mean/ medium/ mode
Michelle
n\2 and n+1\2
An operational manager at a manufacturing company is interested in the level of satisfaction of computer buyers. The manager has developed a satisfaction scale of 1-10 to mark their level of understanding with the company.What is the population of the interest?
Any clues
Virtual