<< Chapter < Page Chapter >> Page >

Try it

Construct a frequency polygon of U.S. Presidents’ ages at inauguration shown in [link] .

Age at Inauguration Frequency
41.5–46.5 4
46.5–51.5 11
51.5–56.5 14
56.5–61.5 9
61.5–66.5 4
66.5–71.5 2

The first label on the x -axis is 39. This represents an interval extending from 36.5 to 41.5. Since there are no ages less than 41.5, this interval is used only to allow the graph to touch the x -axis. The point labeled 44 represents the next interval, or the first “real” interval from the table, and contains four scores. This reasoning is followed for each of the remaining intervals with the point 74 representing the interval from 71.5 to 76.5. Again, this interval contains no data and is only used so that the graph will touch the x -axis. Looking at the graph, we say that this distribution is skewed because one side of the graph does not mirror the other side.

This figure shows a graph entitled, 'President's Age at Inauguration.' The x-axis is labeled 'Ages' and is marked off at 39, 44, 49, 54, 59, 64, 69 and 74. The y-axis is labeled, 'Frequency,' and is marked off in intervals of 1 from 0 to 15. The following points are plotted and a line connects one to the other to create the frequency polygon: (39, 0), (44, 4), (49, 11), (54, 14), (59, 9), (64, 4), (69, 2), (74, 0).
Got questions? Get instant answers now!

Frequency polygons are useful for comparing distributions. This is achieved by overlaying the frequency polygons drawn for different data sets.

We will construct an overlay frequency polygon comparing the scores from [link] with the students’ final numeric grade.

Frequency Distribution for Calculus Final Test Scores
Lower Bound Upper Bound Frequency Cumulative Frequency
49.5 59.5 5 5
59.5 69.5 10 15
69.5 79.5 30 45
79.5 89.5 40 85
89.5 99.5 15 100
Frequency Distribution for Calculus Final Grades
Lower Bound Upper Bound Frequency Cumulative Frequency
49.5 59.5 10 10
59.5 69.5 10 20
69.5 79.5 30 50
79.5 89.5 45 95
89.5 99.5 5 100
This is an overlay frequency polygon that matches the supplied data. The x-axis shows the grades, and the y-axis shows the frequency.
Got questions? Get instant answers now!

Suppose that we want to study the temperature range of a region for an entire month. Every day at noon we note the temperature and write this down in a log. A variety of statistical studies could be done with this data. We could find the mean or the median temperature for the month. We could construct a histogram displaying the number of days that temperatures reach a certain range of values. However, all of these methods ignore a portion of the data that we have collected.

One feature of the data that we may want to consider is that of time. Since each date is paired with the temperature reading for the day, we don‘t have to think of the data as being random. We can instead use the times given to impose a chronological order on the data. A graph that recognizes this ordering and displays the changing temperature as the month progresses is called a time series graph.

Constructing a time series graph

To construct a time series graph, we must look at both pieces of our paired data set . We start with a standard Cartesian coordinate system. The horizontal axis is used to plot the date or time increments, and the vertical axis is used to plot the values of the variable that we are measuring. By doing this, we make each point on the graph correspond to a date and a measured quantity. The points on the graph are typically connected by straight lines in the order in which they occur.

The following data shows the Annual Consumer Price Index, each month, for ten years. Construct a time series graph for the Annual Consumer Price Index data only.

Year Jan Feb Mar Apr May Jun Jul
2003 181.7 183.1 184.2 183.8 183.5 183.7 183.9
2004 185.2 186.2 187.4 188.0 189.1 189.7 189.4
2005 190.7 191.8 193.3 194.6 194.4 194.5 195.4
2006 198.3 198.7 199.8 201.5 202.5 202.9 203.5
2007 202.416 203.499 205.352 206.686 207.949 208.352 208.299
2008 211.080 211.693 213.528 214.823 216.632 218.815 219.964
2009 211.143 212.193 212.709 213.240 213.856 215.693 215.351
2010 216.687 216.741 217.631 218.009 218.178 217.965 218.011
2011 220.223 221.309 223.467 224.906 225.964 225.722 225.922
2012 226.665 227.663 229.392 230.085 229.815 229.478 229.104
Year Aug Sep Oct Nov Dec Annual
2003 184.6 185.2 185.0 184.5 184.3 184.0
2004 189.5 189.9 190.9 191.0 190.3 188.9
2005 196.4 198.8 199.2 197.6 196.8 195.3
2006 203.9 202.9 201.8 201.5 201.8 201.6
2007 207.917 208.490 208.936 210.177 210.036 207.342
2008 219.086 218.783 216.573 212.425 210.228 215.303
2009 215.834 215.969 216.177 216.330 215.949 214.537
2010 218.312 218.439 218.711 218.803 219.179 218.056
2011 226.545 226.889 226.421 226.230 225.672 224.939
2012 230.379 231.407 231.317 230.221 229.601 229.594
This is a times series graph that matches the supplied data. The x-axis shows years from 2003 to 2012, and the y-axis shows the annual CPI.
Got questions? Get instant answers now!
Got questions? Get instant answers now!

Questions & Answers

Example of discrete variable
Bada Reply
sales made monthly.
Gbenga
How to answer quantitative data
Alhassan Reply
hi
Kachalla
what's up here ... am new here
Kachalla
sorry question a bit unclear...do you mean how do you analyze quantitative data? If yes, it depends on the specific question(s) you set in the beginning as well as on the data you collected. So the method of data analysis will be dependent on the data collecter and questions asked.
Bheka
how to solve for degree of freedom
saliou
Quantitative data is the data in numeric form. For eg: Income of persons asked is 10,000. This data is quantitative data on the other hand data collected for either make or female is qualitative data.
Rohan
*male
Rohan
Degree of freedom is the unconditionality. For example if you have total number of observations n, and you have to calculate variance, obviously you will need mean for that. Here mean is a condition, without which you cannot calculate variance. Therefore degree of freedom for variance will be n-1.
Rohan
data that is best presented in categories like haircolor, food taste (good, bad, fair, terrible) constitutes qualitative data
Bheka
vegetation types (grasslands, forests etc) qualitative data
Bheka
I don't understand how you solved it can you teach me
Caleb Reply
solve what?
Ambo
What is the end points of a confidence interval called?
ZIMKHITHA Reply
lower and upper endpoints
Bheka
Class members write down the average time (in hours, to the nearest half-hour) they sleep per night.
William Reply
how we make a classes of this(170.3,173.9,171.3,182.3,177.3,178.3,174.175.3)
Sarbaz
6.5
phoenix
11
Shakir
7.5
Ron
why is always lower class bundry used
Caleb
Assume you are in a class where quizzes are 20% of your grade, homework is 20%, exam _1 is 15%,exam _2 is 15%, and the final exam is 20%.Suppose you are in the fifth week and you just found out that you scored a 58/63 on the fist exam. You also know that you received 6/9,8/10,9/9 on the first
Diamatu Reply
quizzes as well as a 9/11,10/10,and 4.5/7 on the first three homework assignment. what is your current grade in the course?
Diamatu
the answer is 2.6
Abdul
if putting y=3x examine that correlation coefficient between x and y=3x is 1.
Aadrsh Reply
what is permutation
Rodlett Reply
how to construct a histogram
Baalisi Reply
You have to plot the class midpoint and the frequency
Wydny
ok so you use those two to draw the histogram right.
Amford
yes
Wydny
ok can i be a friend so you can be teaching me small small
Amford
how do you calculate cost effectiveness?
George
Hi everyone, this is a very good statistical group and am glad to be part of it. I'm just not sure how did I end up here cos this discussion just popes on my screen so if I wanna ask something in the future, how will I find you?
Bheka
To make a histogram, follow these steps: On the vertical axis, place frequencies. Label this axis "Frequency". On the horizontal axis, place the lower value of each interval. ... Draw a bar extending from the lower value of each interval to the lower value of the next interval.
Divya
I really appreciate that
umar Reply
I want to test linear regression data such as maintenance fees vs house size. Can I use R square, F test to test the relationship? Is the good condition of R square greater than 0.5
Mok Reply
yes of course must have use f test and also use t test individually multple coefficients
rishi
Alright
umar
hi frnd I'm akeem by name, I wanna study economics and statistics wat ar d thing I must do to b a great economist
akeem
Is R square cannot analysis linear regression of X vs Y relationship?
Mok
To be an economist you have to be professional in maths
umar
hi frnds
Shehu
what is random sampling what is sample error
Nistha Reply
@Nistha Kashyap Random sampling is the selection of random items (or random numbers) from the group. A sample error occurs when the selected samples do not truely represent the whole group. The can happen when most or all of the selected samples are taken from only one section of the group;
Ron
Thus the sample is not truely random.
Ron
What is zero sum game?
Hassan Reply
A game in which there is no profit & no loss to any of the both player.
Milan
Differences between sample mean & population mean
mohammed Reply
***keydifferences.com/difference-between-sample-mean-and-population-mean.html
Lucien
Not difference in the formula except the notation, sample mean is denoted by x bar and population mean is denoted by mu symbol. There is formula as well as notation between difference variance and standard deviations
Akash
Likely the difference would be in the result, unless the sample is an exact representation of the population (which is unlikely.)
Ron
what is data
Nii
Nii Avin - Data is just a simple way to refer to the numbers in the population, or in the sample used in your calculations.
Ron
what are the types of data
Nii
Data is the very pale android from the Star Trek Enterprise
Andrew
Am Emmanuel from Nigeria
Emmanuel
Am Qudus from Nigeria
Rasak
am Handson from Cameroon
Handson
what is a mode?
Handson
Nii - data is whatever you are sampling. Such as the number of students in each classroom.
Ron
Handson Ndintek - the mode is the number appearing most frequently. Example: 7 9 11 7 4 6 3 7 2. 7 is the mode. In a group such as 7 9 1 4 6 3, there is no mode because no number appears more often than any other.
Ron
hi I want to know how to find class boundary
Baalisi
give me the two types of data
Neddy Reply
qualitative and quantitative
phoenix
primary and secondary data
Peace
qualitative and quantitative
Prince

Get the best Introductory statistics course in your pocket!





Source:  OpenStax, Introductory statistics. OpenStax CNX. May 06, 2016 Download for free at http://legacy.cnx.org/content/col11562/1.18
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introductory statistics' conversation and receive update notifications?

Ask