Exploratory Data & Distribution Lab
Paste or type a dataset, and instantly see descriptive statistics, five-number summaries, outlier detection, and interactive visualizations. Compare two groups side by side with histograms, box plots, and dot plots.
Guided Experiment: Shape, Center, and Spread
How do mean, median, and standard deviation change when a dataset is symmetric versus skewed? Will the mean always be close to the median?
Write your hypothesis in the Lab Report panel, then click Next.
Histogram
Controls
Descriptive Statistics
| Measure | Value |
|---|---|
| Count (n) | 15 |
| Mean (x̄) | 84.6 |
| Median | 85 |
| Mode | None |
| Range | 22 |
| Q1 | 81 |
| Q3 | 88.5 |
| IQR | 7.5 |
| Std Dev (s) | 6.1272 |
| Variance (s²) | 37.5429 |
| Skewness | -0.2402 |
| Kurtosis (excess) | -0.4249 |
| Shape | roughly symmetric |
Data Table
(0 rows)| # | Trial | Dataset | n | Mean | Median | Std Dev | Skewness | Outliers | Shape |
|---|
Reference Guide
Measures of Center
The mean, median, and mode each describe the "center" of a dataset in different ways. The mean uses every value, the median splits the data in half, and the mode is the most frequent value.
When data is symmetric, mean and median are close. When data is skewed, the mean gets pulled toward the tail.
Measures of Spread
Range, IQR, variance, and standard deviation quantify how spread out the data is. The sample standard deviation uses n-1 in the denominator (Bessel's correction).
IQR = Q3 - Q1 is resistant to outliers, while standard deviation is sensitive to extreme values.
Five-Number Summary and Box Plots
The five-number summary (min, Q1, median, Q3, max) is the foundation of the box-and-whisker plot. Whiskers extend to the farthest non-outlier data point.
Outlier fences are at Q1 - 1.5(IQR) and Q3 + 1.5(IQR). Any data point beyond these fences is flagged as a potential outlier.
Shape and Outliers
Distribution shape (symmetric, skewed, bimodal) reveals patterns that summary statistics alone may miss. Skewness measures the asymmetry of the distribution.
A z-score tells how many standard deviations a value is from the mean. Values with |z| > 2 are unusual. Always pair numerical summaries with a visual display.