askvity

What is the difference between standard deviation and MAD?

Published in Statistics 3 mins read

The core difference between standard deviation and Mean Absolute Deviation (MAD) lies in how they measure the spread of data around the mean; MAD calculates the average of the absolute differences from the mean, while standard deviation calculates the square root of the average of the squared differences from the mean.

In-Depth Comparison

To understand the nuances, let's break down each concept and highlight their differences:

Mean Absolute Deviation (MAD)

  • Definition: MAD calculates the average of the absolute differences between each data point and the mean of the dataset. In simpler terms, it measures the average distance each data point is from the center (mean) of the data.
  • Calculation:
    1. Calculate the mean of the dataset.
    2. Find the absolute difference between each data point and the mean. (Absolute value ignores negative signs)
    3. Calculate the average of these absolute differences.
  • Formula: MAD = Σ |xᵢ - μ| / n , where xᵢ is each data point, μ is the mean, and n is the number of data points.
  • Advantages: Easy to understand and calculate. Gives equal weight to all deviations.
  • Disadvantages: Less sensitive to extreme values than standard deviation. Mathematically less tractable than standard deviation, making it less useful for some statistical inferences.

Standard Deviation

  • Definition: Standard deviation also measures the spread of data around the mean, but it uses squared differences. It represents the square root of the variance.
  • Calculation:
    1. Calculate the mean of the dataset.
    2. Find the difference between each data point and the mean.
    3. Square each of these differences.
    4. Calculate the average of these squared differences (this is the variance).
    5. Take the square root of the variance.
  • Formula: σ = √[Σ (xᵢ - μ)² / (n-1)] for a sample standard deviation (using n-1 for degrees of freedom), or σ = √[Σ (xᵢ - μ)² / n] for a population standard deviation. xᵢ is each data point, μ is the mean, and n is the number of data points.
  • Advantages: More sensitive to extreme values. More mathematically tractable (easier to work with in statistical calculations). Fundamental to many statistical tests and models.
  • Disadvantages: More complex to calculate than MAD. Squaring the differences can inflate the impact of outliers.

Table Summarizing the Differences

Feature Mean Absolute Deviation (MAD) Standard Deviation
Calculation Average of absolute deviations Square root of variance
Sensitivity to Outliers Less Sensitive More Sensitive
Mathematical Properties Less Tractable More Tractable
Complexity Simpler More Complex
Interpretation Average absolute distance from the mean Typical deviation from the mean

Example

Let's consider a dataset: {2, 4, 6, 8, 10}

  • Mean: (2+4+6+8+10)/5 = 6
  • MAD: (|2-6| + |4-6| + |6-6| + |8-6| + |10-6|) / 5 = (4 + 2 + 0 + 2 + 4) / 5 = 2.4
  • Standard Deviation (sample): √[((2-6)² + (4-6)² + (6-6)² + (8-6)² + (10-6)²) / (5-1)] = √( (16 + 4 + 0 + 4 + 16) / 4) = √(40/4) = √10 ≈ 3.16

In this example, the standard deviation (3.16) is larger than the MAD (2.4), which is typical because squaring amplifies the larger deviations.

Conclusion

In summary, while both MAD and standard deviation measure data spread, standard deviation gives more weight to outliers and is mathematically more useful, making it the more common choice in statistical analysis. MAD is easier to understand and calculate, making it suitable for simpler applications.

Related Articles