As the mean depends on the first moment of variance and standard deviation depend on the second moment. It is not uncommon, in real life, to be dealing with a distribution whose second moment does not exist (i.e., is infinite). In this case, the variance or standard deviation is useless as a measure of the dataâs width around its central value: The values obtained from equations (14.1.2 - Variance) or (14.1.3- Standard Deviation) will not converge with increased numbers of points, nor show any consistency from data set to data set drawn from the same distribution. This can occur even when the width of the peak looks, by eye, perfectly finite. A more robust estimator of thewidth is the average deviation or mean absolute deviation, defined by [ ... formula didn't copy, sorry].
One often substitutes the sample median xmed for x in equation (14.1.4 - Average Deviation). For any fixed sample, the median in fact minimizes the mean absolute deviation.
Statisticians have historically sniffed at the use of (14.1.4 - Average Deviation) instead of (14.1.2 - Standard Deviation), since the absolute value brackets in (14.1.4) are ânonanalyticâ and make theorem-proving difficult. In recent years, however, the fashion has changed, and the subject of robust estimation (meaning estimation for broad distributions with significant numbers of âoutlierâ points) has become a popular and important one. Higher moments, or statistics involving higher powers of the input data, are almost always less robust than lower moments or statistics that involve only linear sums or (the lowest moment of all) counting.