<< Chapter < Page Chapter >> Page >
This course is a short series of lectures on Introductory Statistics. Topics covered are listed in the Table of Contents. The notes were prepared by EwaPaszek and Marek Kimmel. The development of this course has been supported by NSF 0203396 grant.

Maximum likelihood estimation - examples

Exponential distribution

Let X 1 , X 2 , ... , X n be a random sample from the exponential distribution with p.d.f.

f ( x ; θ ) = 1 θ e x / θ , 0 < x < , θ Ω = { θ ; 0 < θ < } .

The likelihood function is given by

L ( θ ) = L ( θ ; x 1 , x 2 , ... , x n ) = ( 1 θ e x 1 / θ ) ( 1 θ e x 2 / θ ) · · · ( 1 θ e x n / θ ) = 1 θ n exp ( i = 1 n x i θ ) , 0 < θ < .

The natural logarithm of L ( θ ) is ln L ( θ ) = ( n ) ln ( θ ) 1 θ i = 1 n x i , 0 < θ < .

Thus, d [ ln L ( θ ) ] d θ = n θ + i = 1 n x i θ 2 = 0. The solution of this equation for θ is θ = 1 n i = 1 n x i = x ¯ .

Note that, d [ ln L ( θ ) ] d θ = 1 θ ( n + n x ¯ θ ) > 0 , θ < x ¯ ,

d [ ln L ( θ ) ] d θ = 1 θ ( n + n x ¯ θ ) = 0 , θ = x ¯ ,

d [ ln L ( θ ) ] d θ = 1 θ ( n + n x ¯ θ ) < 0 , θ > x ¯ ,

Hence, ln L ( θ ) does have a maximum at x ¯ , and thus the maximum likelihood estimator for θ is θ ^ = X ¯ = 1 n i = 1 n X i . This is both an unbiased estimator and the method of moments estimator for θ .

Geometric distribution

Let X 1 , X 2 , ... , X n be a random sample from the geometric distribution with p.d.f. f ( x ; p ) = ( 1 p ) x 1 p , x = 1 , 2 , 3 , ....

The likelihood function is given by L ( p ) = ( 1 p ) x 1 1 p ( 1 p ) x 2 1 p · · · ( 1 p ) x n 1 p = p n ( 1 p ) x i n , 0 p 1.

The natural logarithm of L ( θ ) is ln L ( p ) = n ln p + ( i = 1 n x i n ) ln ( 1 p ) , 0 < p < 1.

Thus restricting p to 0 < p < 1 so as to be able to take the derivative, we have d ln L ( p ) d p = n p i = 1 n x i n 1 p = 0.

Solving for p , we obtain p = n i = 1 n x i = 1 x ¯ . So the maximum likelihood estimator of p is p ^ = n i = 1 n X i = 1 X

Again this estimator is the method of moments estimator, and it agrees with the intuition because, in n observations of a geometric random variable, there are n successes in the i = 1 n x i trials. Thus the estimate of p is the number of successes divided by the total number of trials.

Normal distribution

Let X 1 , X 2 , ... , X n be a random sample from N ( θ 1 , θ 2 ) , where Ω = ( ( θ 1 , θ 2 ) : < θ 1 < , 0 < θ 2 < ) . That is, here let θ 1 = μ and θ 2 = σ 2 . Then L ( θ 1 , θ 2 ) = i 1 n ( 1 2 π θ 2 exp [ ( x i θ 1 ) 2 2 θ 2 ] ) , or equivalently, L ( θ 1 , θ 2 ) = ( 1 2 π θ 2 ) n exp [ i = 1 n ( x i θ 1 ) 2 2 θ 2 ] , ( θ 1 , θ 2 ) Ω . The natural logarithm of the likelihood function is ln L ( θ 1 , θ 2 ) = n 2 ln ( 2 π θ 2 ) i = 1 n ( x i θ 1 ) 2 2 θ 2 .

The partial derivatives with respect to θ 1 and θ 2 are ( ln L ) θ 1 = 1 θ 2 i = 1 n ( x i θ 1 ) and ( ln L ) θ 2 = n 2 θ 2 + 1 2 θ 2 2 i = 1 n ( x i θ 1 ) 2 .

The equation ( ln L ) θ 1 = 0 has the solution θ 1 = x ¯ . Setting ( ln L ) θ 2 = 0 and replacing θ 1 by x ¯ yields θ 2 = 1 n i = 1 n ( x i x ¯ ) 2 .

By considering the usual condition on the second partial derivatives, these solutions do provide a maximum. Thus the maximum likelihood estimators μ = θ 1 and σ 2 = θ 2 are θ ^ 1 = X ¯ and θ ^ 2 = 1 n i = 1 n ( X i X ¯ ) 2 .

Where we compare the above example with the introductory one, we see that the method of moments estimators and the maximum likelihood estimators for μ and σ 2 are the same. But this is not always the case. If they are not the same, which is better? Due to the fact that the maximum likelihood estimator of θ has an approximate normal distribution with mean θ and a variance that is equal to a certain lower bound, thus at least approximately, it is unbiased minimum variance estimator. Accordingly, most statisticians prefer the maximum likelihood estimators than estimators found using the method of moments.

Binomial distribution

Observations: k successes in n Bernoulli trials.

f ( x ) = n ! x ! ( n x ) ! p x ( 1 p ) n x

L ( p ) = i = 1 n f ( x i ) = i = 1 n ( n ! x i ! ( n x i ) ! p x i ( 1 p ) n x i ) = ( i = 1 n n ! x i ! ( n x i ) ! ) p x i ( 1 p ) n i = 1 n x i

ln L ( p ) = i = 1 n x i ln p + ( n i = 1 n x i ) ln ( 1 p )

d ln L ( p ) d p = 1 p i = 1 n x i ( n i = 1 n x i ) 1 1 p = 0

( 1 p ^ ) i = 1 n x i ( n i = 1 n x i ) p ^ p ^ ( 1 p ^ ) = 0

i = 1 n x i p ^ i = 1 n x i n p ^ + i = 1 n x i p ^ = 0

p ^ = i = 1 n x i n = k n

Poisson distribution

Observations: x 1 , x 2 , ... , x n , f ( x ) = λ x e λ x ! , x = 0 , 1 , 2 , ...

L ( λ ) = i = 1 n ( λ x i e λ x i ! ) = e λ n λ i = 1 n x i i = 1 n x i

ln L ( λ ) = λ n + i = 1 n x i ln λ ln ( i = 1 n x i )

d l d λ = n + i = 1 n x i 1 λ

n + i = 1 n x i 1 λ = 0

λ ^ = i = 1 n x i n

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Introduction to statistics. OpenStax CNX. Oct 09, 2007 Download for free at http://cnx.org/content/col10343/1.3
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introduction to statistics' conversation and receive update notifications?

Ask