<< Chapter < Page Chapter >> Page >

The simplest probability density function is the hypergeometric. This is the most basic one because it is created by combining our knowledge of probabilities from Venn diagrams, the addition and multiplication rules, and the combinatorial counting formula.

To find the number of ways to get 2 aces from the four in the deck we computed:

4 2 = 4 ! 2 ! ( 4 - 2 ) ! = 6

And if we did not care what else we had in our hand for the other three cards we would compute:

48 3 = 48 ! 3 ! 45 ! = 17,296

Putting this together, we can compute the probability of getting exactly two aces in a 5 card poker hand as:

4 2 48 3 52 5 = .0399

This solution is really just the probability distribution known as the Hypergeometric. The generalized formula is:

h ( x ) = A x N - A n - x N n

where x = the number we are interested in coming from the group with A objects.

h(x) is the probability of x successes, in n attempts, when A successes (aces in this case) are in a population that contains N elements. The hypergeometric distribution is an example of a discrete probability distribution because there is no possibility of partial success, that is, there can be no poker hands with 2 1/2 aces. Said another way, a discrete random variable has to be a whole, or counting, number only. This probability distribution works in cases where the probability of a success changes with each draw. Another way of saying this is that the events are NOT independent. In using a deck of cards, we are sampling WITHOUT replacement. If we put each card back after it was drawn then the hypergeometric distribution would not work.

For the hypergeometric to work,

  1. the population must be dividable into two and only two independent subsets (aces and non-aces in our example). The random variable X = the number of items from the group of interest.
  2. the experiment must have changing probabilities of success with each experiment (the fact that cards are not replaced after the draw in our example makes this true in this case). Another way to say this is that you sample without replacement and therefore each pick is not independent.
  3. the random variable must be discrete, rather than continuous.

A candy dish contains 30 jelly beans and 20 gumdrops. Ten candies are picked at random. What is the probability that 5 of the 10 are gumdrops? The two groups are jelly beans and gumdrops. Since the probability question asks for the probability of picking gumdrops, the group of interest (first group A in the formula) is gumdrops. The size of the group of interest (first group) is 30. The size of the second group is 20. The size of the sample is 10 (jelly beans or gumdrops). Let X = the number of gumdrops in the sample of 10. X takes on the values x = 0, 1, 2, ..., 10. a. What is the probability statement written mathematically? b. What is the hypergeometric probability density function written out to solve this problem? c. What is the answer to the question "What is the probability of drawing 35 gumdrops in 50 picks from the dish?"

a. P ( x = 5 )
b. P ( x = 5 ) = ( 5 30 ) ( 5 20 ) ( 10 50 )
c. P ( x = 5 ) = 0.215

Try it

A bag contains letter tiles. Forty-four of the tiles are vowels, and 56 are consonants. Seven tiles are picked at random. You want to know the probability that four of the seven tiles are vowels. What is the group of interest, the size of the group of interest, and the size of the sample?

The group of interest is the vowel letter tiles. The size of the group of interest is 44. The size of the sample is seven.

Chapter review

The combinatorial formula can provide the number of unique subsets of size x that can be created from n unique objects to help us calculate probabilities. The combinatorial formula is n x   = n C x = n ! x ! ( n - x ) !

A hypergeometric experiment is a statistical experiment with the following properties:

  1. You take samples from two groups.
  2. You are concerned with a group of interest, called the first group.
  3. You sample without replacement from the combined groups.
  4. Each pick is not independent, since sampling is without replacement.

The outcomes of a hypergeometric experiment fit a hypergeometric probability distribution. The random variable X = the number of items from the group of interest. h ( x ) = A x N - A n - x N n .

Formula review

h ( x ) = A x N - A n - x N n

Use the following information to answer the next five exercises: Suppose that a group of statistics students is divided into two groups: business majors and non-business majors. There are 16 business majors in the group and seven non-business majors in the group. A random sample of nine students is taken. We are interested in the number of business majors in the sample.

In words, define the random variable X .

X = the number of business majors in the sample.

What values does X take on?

2, 3, 4, 5, 6, 7, 8, 9

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Introductory statistics. OpenStax CNX. Aug 09, 2016 Download for free at http://legacy.cnx.org/content/col11776/1.26
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introductory statistics' conversation and receive update notifications?

Ask