Skewness describes how a distribution departs from perfect symmetry. It helps students interpret whether data cluster evenly around the center or stretch farther on one side. Understanding shape matters because mean, median, and mode behave differently in skewed data. It also affects which summary statistics and models give the clearest picture of a dataset.

A symmetric distribution has similar left and right sides, while a positively skewed distribution has a long tail to the right and a negatively skewed distribution has a long tail to the left. In skewed data, extreme values pull the mean toward the tail more strongly than they pull the median. This is why the order of mean, median, and mode can reveal the direction of skew. Recognizing distribution shape is useful in fields such as test scores, income data, reaction times, and measurement errors.

Key Facts

  • Symmetric distribution: left and right sides are approximately mirror images, so mean ≈ median ≈ mode.
  • Positively skewed distribution: long tail to the right, and typically mode < median < mean.
  • Negatively skewed distribution: long tail to the left, and typically mean < median < mode.
  • Skewness measures asymmetry of a distribution around its center.
  • A common sample skewness formula is g1=[n(n1)(n2)](xixˉs)3.g_1 = \left[\frac{n}{(n - 1)(n - 2)}\right] \sum \left(\frac{x_i - \bar{x}}{s}\right)^3.
  • Outliers in the tail can strongly affect the mean and increase the magnitude of skewness.

Vocabulary

Skewness
Skewness is a numerical or visual measure of how asymmetric a distribution is.
Symmetric distribution
A symmetric distribution has similar shape on both sides of its center.
Positive skew
Positive skew means the distribution has a longer tail on the right side.
Negative skew
Negative skew means the distribution has a longer tail on the left side.
Outlier
An outlier is a data value far from most of the other values and it can distort the shape of a distribution.

Common Mistakes to Avoid

  • Confusing the tail with the peak, because students often label skew by the side where most data are piled up instead of the side where the long tail extends. Skew direction is named by the tail, not by the tallest part.
  • Assuming mean, median, and mode stay together in skewed data, because that is only true for a perfectly symmetric distribution. In skewed distributions the mean is pulled toward the tail.
  • Treating every uneven histogram as skewed, because small samples can look lumpy just from random variation. Check the overall pattern rather than one or two bars.
  • Ignoring outliers when judging shape, because a few extreme values can create or exaggerate skewness. Always inspect unusual values before summarizing the distribution.

Practice Questions

  1. 1 A dataset has mean 18, median 15, and mode 12. Is the distribution most likely symmetric, positively skewed, or negatively skewed? Explain briefly.
  2. 2 For the data 2, 3, 3, 4, 5, 12, calculate the mean and median. Based on these values, state whether the distribution appears skewed and in which direction.
  3. 3 Two histograms have the same center and spread, but one has a long right tail while the other is nearly mirror symmetric. Explain how the mean and median would compare in each case and why.