Histograms Explained
Bins, Shape, Skew & Distribution
Related Labs
Related Worksheets
Related Cheat Sheets
A histogram is a graph used to display the distribution of numerical data by grouping values into intervals called bins. It helps students see where data are concentrated, how spread out they are, and whether the shape is symmetric, skewed, or clustered. Histograms are important because they turn long lists of numbers into patterns that can be interpreted quickly. They are widely used in science, economics, psychology, and quality control.
To build a histogram, first divide the data range into equal-width bins, then count how many data values fall in each bin. Each bar represents a bin, and the height of the bar shows the frequency or sometimes the relative frequency. Unlike a bar chart, the bars in a histogram touch because the intervals are continuous numerical ranges. The overall shape can reveal useful features such as peaks, gaps, outliers, and possible center.
Key Facts
- A histogram displays quantitative data grouped into bins.
- Frequency = .
- Relative frequency = .
- Bin width = (maximum value - minimum value) / number of bins.
- In a histogram, bars touch because the data intervals are continuous.
- The area of a bar represents the amount of data in that interval when bin widths are equal.
Vocabulary
- Histogram
- A graph that shows how numerical data are distributed across intervals.
- Bin
- A bin is an interval of values used to group data in a histogram.
- Frequency
- Frequency is the number of data points that fall within a given bin.
- Relative frequency
- Relative frequency is the fraction or percent of the total data that falls in a bin.
- Distribution
- A distribution describes how data values are spread across possible values or intervals.
Common Mistakes to Avoid
- Using categories instead of numerical intervals, which is wrong because histograms are for quantitative data, not separate labels like favorite colors or car brands.
- Leaving gaps between bars, which is wrong because histogram bins represent continuous intervals and should touch unless there is an actual empty interval.
- Choosing unequal bin widths without noting it, which is wrong because it can distort the visual comparison of frequencies across intervals.
- Reading bar height as the exact data value, which is wrong because the bar height shows how many data points fall in an interval, not the individual values themselves.
Practice Questions
- 1 A class recorded these quiz scores: 52, 55, 57, 61, 64, 66, 68, 71, 73, 74, 78, 82. Using bins 50 to 59, 60 to 69, 70 to 79, and 80 to 89, find the frequency in each bin.
- 2 A data set has minimum 12 and maximum 42, and you want 5 equal-width bins. Use bin width = (maximum value - minimum value) / number of bins to calculate the bin width.
- 3 A histogram has most bars clustered on the left and a long tail extending to the right. Explain what this says about the shape of the distribution and what it suggests about the data.