Estimating the mean of a Normal distribution when the distribution's standard deviation is unknown

 Download a complete copy of this risk analysis resource for free here.


For a given set of n data values randomly sampled from an assumed Normal distribution, with unknown mean m and unknown standard deviation s, the distribution of uncertainty of the true mean is calculated from a Student-t distribution:

                                     (1)

where t(n-1) is a Student-t distribution with (n-1) degrees of freedom.

[This page provides an explanation of the derivation of Equation 1]

 is the unbiased single point estimate of the true standard deviation (calculated by STDEV( ) in Excel), given by:

The Student-t distribution is unimodal and symmetric about zero. The formula therefore centres the uncertainty distribution of the value of the true mean m around the sample mean x which is the 'best guess'. It also has a spread that increases with the standard deviation  and decreases with the square root of the sample size n. The Student-t distribution looks quite like a unit Normal distribution but flatter, with greater spread than the unit Normal distribution: a Student(n) distribution has a standard deviation of  compared with a standard deviation of 1 for the unit Normal distribution:

Figure 1 Examples of the Student-t distribution

In fact, the larger n gets, the closer the Student-t distribution approaches a unit Normal distribution (i.e. Normal(0, 1)). So, for large n (greater than 20 is usually fine), Equation 1 is very well approximated by:

This spreadsheet example model lets you generate values for the above uncertainty distribution for m for a data set.

Comparison with the Bayesian approach

The Bayesian derivation of Equation 2 is given here.