The Empirical Rule is a quick way to estimate how data are spread out when the distribution is approximately normal, or bell shaped. It tells you what percent of values lie within 11, 22, and 33 standard deviations of the mean. This matters because many real measurements, test scores, and natural variations are modeled well by a normal distribution. With one mean and one standard deviation, you can make fast predictions about typical and unusual values.

In a normal distribution, the mean sits at the center of the bell curve and the curve is symmetric on both sides. The Empirical Rule says about 68%68\% of values fall within 11 standard deviation of the mean, about 95%95\% within 22, and about 99.7%99.7\% within 33. These percentages help you estimate probabilities, identify outliers, and interpret data without calculating every area exactly. The rule is most useful when the histogram of the data looks roughly mound shaped and symmetric.

Key Facts

  • About 68% of data in a normal distribution lie between μ - σ and μ + σ.
  • About 95% of data in a normal distribution lie between μ - 2σ and μ + 2σ.
  • About 99.7% of data in a normal distribution lie between μ - 3σ and μ + 3σ.
  • Standard score formula: z = (x - μ)/σ
  • For a normal distribution, mean=median=mode\text{mean} = \text{median} = \text{mode}.
  • About 5% of data lie outside μ ± 2σ, and about 0.3% lie outside μ ± 3σ.

Vocabulary

Normal distribution
A symmetric, bell-shaped distribution where values cluster around the center and become less common farther away.
Mean
The average value of a data set, which is the center of a normal distribution.
Standard deviation
A measure of how spread out the data are from the mean.
Empirical Rule
A rule stating that about 68%, 95%, and 99.7% of normal data fall within 11, 22, and 33 standard deviations of the mean.
z-score
The number of standard deviations a value is above or below the mean\text{mean}.

Common Mistakes to Avoid

  • Using the Empirical Rule for any data set, even when the distribution is skewed or irregular. This is wrong because the rule only works well for data that are approximately normal.
  • Confusing the mean\text{mean} with the standard deviation\text{standard deviation}, then placing interval boundaries at the wrong values. This is wrong because the mean\text{mean} gives the center, while the standard deviation\text{standard deviation} gives the distance from the center.
  • Thinking 95% means exactly 95 out of every 100 values must fall within 22 standard deviations. This is wrong because the rule gives an approximate percentage for a normal model, not a guaranteed count in every sample.
  • Forgetting that the intervals are symmetric around the mean\text{mean}. This is wrong because in a normal distribution you must go the same number of standard deviations to the left and right of μ\mu.

Practice Questions

  1. 1 A test score distribution is approximately normal with mean\text{mean} 7070 and standard deviation\text{standard deviation} 88. According to the Empirical Rule, about what percent of students scored between 6262 and 7878?
  2. 2 The heights of a plant species are approximately normal with mean\text{mean} 4545 cm and standard deviation\text{standard deviation} 33 cm. Give the interval that contains about 95%95\% of the plants.
  3. 3 A data set is strongly right-skewed. Explain whether the Empirical Rule should be used and why.