Statistics
Grade 11-12
Logistic Regression Reference Cheat Sheet
A printable reference covering logit models, logistic probability, odds ratios, maximum likelihood, classification thresholds, and model evaluation for grades 11-12.
Related Tools
Related Labs
Related Worksheets
Logistic regression is used when the response variable has two outcomes, such as yes or no, pass or fail, or disease or no disease. This cheat sheet helps students connect a linear predictor to a probability using the logit link. It is useful because ordinary linear regression can give impossible probabilities below or above . Logistic regression keeps predictions between and and supports classification decisions.
Key Facts
- For binary logistic regression, the response is often coded as for success and for failure.
- The logistic model is for one predictor.
- The logit form is , where is the odds of success.
- The odds can be found from the logit by .
- A one-unit increase in multiplies the odds by when all other predictors are held constant.
- In multiple logistic regression, .
- A common classification rule predicts if and predicts if .
- Maximum likelihood chooses the coefficients that make the observed outcomes most likely under the model.
Vocabulary
- Binary response
- A variable with two possible outcomes, usually coded as and .
- Probability
- The long-run chance that an event occurs, written as with .
- Odds
- The ratio of the probability of success to the probability of failure, written as .
- Logit
- The natural logarithm of the odds, written as .
- Odds ratio
- The factor by which the odds change for a one-unit increase in a predictor, often written as .
- Classification threshold
- A cutoff value such as used to turn a predicted probability into a predicted class.
Common Mistakes to Avoid
- Treating as the change in probability is wrong because changes the log-odds, not directly.
- Forgetting to convert from logit to probability is wrong because can be any real number, while a probability must be between and .
- Interpreting as an added amount is wrong because an odds ratio multiplies the odds rather than adding to them.
- Using accuracy alone to judge the model can be misleading because a model may predict the majority class well while missing many important minority cases.
- Assuming a threshold of is always best is wrong because the best cutoff depends on the costs of false positives and false negatives.
Practice Questions
- 1 For the model , find the predicted probability when .
- 2 If a logistic regression coefficient is , calculate the odds ratio and interpret it for a one-unit increase in .
- 3 A model gives for one student and for another. Using the threshold , classify each student as or .
- 4 Explain why logistic regression is more appropriate than ordinary linear regression when the response variable is binary.