Standard Error of the Mean and Confidence Intervals for the Mean

The precision of the mean of a sample of data, as an estimate of some unknown or “true” vlaue of the mean of the population, can be described using the standard error of the mean, or by describing or plotting confidence intervals for the mean.

The standard error of the mean

The standard error of the mean is given by

                                             

where  is the standard deviation of the population and  is the sample size.  In practice, because the standard deviation of the population is not known, the standard deviation of the sample, , is used in place of .  The standard error of the mean (which is sometimes referred to as the standard deviation of the mean), has the nice property of increasing as the variability of the data increases, and decreasing as the sample size increases.

Confidence interval for the mean

The confidence interval for the mean is the range of values that is likely to enclose the “true” value of the mean some particular proportion of the time (e.g. as in repeated sampling of the same processor or phenomena).  The confidence interval for the mean can be described by noting that a) means are normally distributed (which is implied by the central limit theorem) and consequently b) the normal distribution can be used to describe the probability of observing different values of the mean.  The  confidence interval for the mean is written as

                                            

Where  is the sample mean,  is the standard error of the sample mean, and  is a value (obtained using the cumulative density function (cdf) of the normal distribution, such that  of the area under the pdf lies to the left of  (i.e.  of the values of  observed in practice are less than  ), and  of the area lies to the right of  (i.e.  of the values of  observed in practice are less than  ).  The width of the confidence interval depends on the magnitude of the standard error, and the desired precision.