Statistics: Linear Regression and Line of Best Fit
Modeling relationships with scatterplots, slope, residuals, and predictions
Modeling relationships with scatterplots, slope, residuals, and predictions
Statistics - Grade 9-12
- 1
A student records the number of hours studied and the score earned on a test. The data are: (1, 68), (2, 72), (3, 78), (4, 83), (5, 87). Identify the explanatory variable, the response variable, and the general direction of the association.
- 2
A regression model for predicting a quiz score from hours of studying is y = 4.5x + 62, where x is hours studied and y is the predicted quiz score. Find the predicted quiz score for a student who studies for 6 hours.
- 3
A line of best fit for predicting monthly savings from monthly income is y = 0.18x + 45, where x is monthly income in dollars and y is monthly savings in dollars. Interpret the slope in context.
- 4
A regression equation is y = 2.3x + 15. A data point has x = 10 and an actual y-value of 41. Find the predicted value and the residual.
- 5
A regression equation for predicting plant height from days after planting is y = 1.7x + 4.2. Explain what the y-intercept means in context, and state whether it is reasonable.
- 6
Use the data points (1, 2), (2, 4), (3, 5), (4, 7), and (5, 8). The mean of x is 3 and the mean of y is 5.2. The sum of the products of deviations is 15, and the sum of squared x-deviations is 10. Find the least-squares regression line.
- 7
A data set includes x-values from 2 to 12. A regression model from this data is used to predict y when x = 9 and when x = 20. Classify each prediction as interpolation or extrapolation.
- 8
A scatterplot has correlation coefficient r = -0.86. Describe the direction and strength of the linear relationship.
- 9
For a regression model, the correlation coefficient is r = 0.75. Find r squared and explain its meaning in context.
- 10
Two possible lines are used to model the same data. Line A has residuals 2, -1, 3, and -2. Line B has residuals 1, -1, 1, and -1. Compare the sums of squared residuals and identify which line fits better by the least-squares criterion.
- 11
A scatterplot shows a strong positive linear trend, but one point is far from the rest of the data with a very large x-value. Explain how this point could affect the regression line.
- 12
A regression model shows that students who spend more time on a homework app tend to have higher course grades. Explain why this does not prove that the app causes higher grades.
Related Tools
Related Labs
Related Infographics
Related Cheat Sheets
More Statistics Worksheets
Statistics: Chi-Square Tests
Grade 9-12 · 12 problems
Statistics: Confidence Intervals and Margin of Error
Grade 9-12 · 12 problems
Statistics: Frequency Tables and Histograms
Grade 6-8 · 12 problems
Statistics: Hypothesis Testing: Null and Alternate
Grade 9-12 · 12 problems
More Grade 9-12 Worksheets
Linear Equations
Math · 8 problems
Cell Biology
Biology · 8 problems
Reading Comprehension
Language Arts · 8 problems
Historical Thinking & Evidence
Social Studies · 8 problems