<< Chapter < Page Chapter >> Page >
This course is a short series of lectures on Introductory Statistics. Topics covered are listed in the Table of Contents. The notes were prepared by EwaPaszek and Marek Kimmel. The development of this course has been supported by NSF 0203396 grant.

The t distribution

In probability and statistics, the t-distribution or Student's distribution arises in the problem of estimating the mean of a normally distributed population when the sample size is small, as well as when (as in nearly all practical statistical work) the population standard deviation is unknown and has to be estimated from the data.

    Textbook problems treating the standard deviation as if it were known are of two kinds:

  • those in which the sample size is so large that one may treat a data-based estimate of the variance as if it were certain,
  • those that illustrate mathematical reasoning, in which the problem of estimating the standard deviation is temporarily ignored because that is not the point that the author or instructor is then explaining.

The t distribution

t Distribution
If Z is a random variable that is N ( 0,1 ) , if U is a random variable that is χ 2 ( r ) , and if Z and U are independent, then
T = Z U / r = X ¯ μ S / n
has a t distribution with r degrees of freedom.

Where μ is the population mean, x ¯ is the sample mean and s is the estimator for population standard deviation (i.e., the sample variance) defined by

s 2 = 1 N 1 i = 1 N ( x i x ¯ ) 2 .

If σ = s , t = z , the distribution becomes the normal distribution. As N increases, Student’s t distribution approaches the normal distribution . It can be derived by transforming student’s z -distribution using z x ¯ μ s and then defining t = z n 1 .

The resulting probability and cumulative distribution functions are:

f ( t ) = Γ [ ( r + 1 ) / 2 ] π r Γ ( r / 2 ) ( 1 + t 2 / r ) ( r + 1 ) / 2 ,
F ( t ) = 1 2 + 1 2 [ I ( 1 ; 1 2 r , 1 2 ) I ( r r + t 2 , 1 2 r , 1 2 ) ] sgn ( t ) = 1 2 i t B ( t 2 r ; 1 2 , 1 2 ( 1 r ) ) Γ ( 1 2 ( r + 1 ) ) 2 π | t | Γ ( 1 2 r )

where,

  • r = n 1 is the number of degrees of freedom,
  • < t < ,
  • Γ ( z ) is the gamma function,
  • B ( a , b ) is the bets function,
  • I ( z ; a , b ) is the regularized beta function defined by I ( z ; a , b ) = B ( z ; a , b ) B ( a , b ) .

The effect of degree of freedom on the t distribution is illustrated in the four t distributions on the Figure 1 .

p.d.f. of the t distribution for degrees of freedom r =3, r =6, r = .

In general, it is difficult to evaluate the distribution function of T . Some values are usually given in the tables. Also observe that the graph of the p.d.f. of T is symmetrical with respect to the vertical axis t =0 and is very similar to the graph of the p.d.f. of the standard normal distribution N ( 0,1 ) . However the tails of the t distribution are heavier that those of a normal one; that is, there is more extreme probability in the t distribution than in the standardized normal one. Because of the symmetry of the t distribution about t =0, the mean (if it exists) must be equal to zero. That is, it can be shown that E ( T ) = 0 when r 2 . When r =1 the t distribution is the Cauchy distribution , and thus both the variance and mean do not exist.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Introduction to statistics. OpenStax CNX. Oct 09, 2007 Download for free at http://cnx.org/content/col10343/1.3
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Introduction to statistics' conversation and receive update notifications?

Ask