Sign in to save

Bookmark this page so you can find it later.

Sign in to save

Bookmark this page so you can find it later.

Choosing the right statistical test helps students match a research question to the correct method of analysis. This cheat sheet focuses on deciding whether data involve means, proportions, counts, categories, or relationships. It is useful when planning an investigation, checking assumptions, or interpreting results. Students can use it as a quick reference before performing calculations or using technology. The main ideas are to identify the response variable, the explanatory variable, the number of groups, and whether samples are independent or paired. Mean-based tests often use test statistics such as t=xˉμ0s/nt=\frac{\bar{x}-\mu_0}{s/\sqrt{n}} or compare several means with ANOVA. Category-based tests often use chi-square statistics such as χ2=(OE)2E\chi^2=\sum \frac{(O-E)^2}{E}. Relationship questions may use correlation, regression, or tests of association depending on the data type.

Key Facts

  • Use a one-sample t test for one sample mean when the population standard deviation is unknown, with t=xˉμ0s/nt=\frac{\bar{x}-\mu_0}{s/\sqrt{n}}.
  • Use a two-sample t test to compare two independent means, with t=xˉ1xˉ2s12n1+s22n2t=\frac{\bar{x}_1-\bar{x}_2}{\sqrt{\frac{s_1^2}{n_1}+\frac{s_2^2}{n_2}}}.
  • Use a paired t test when the same subjects are measured twice or matched pairs are used, and test the mean difference with t=dˉsd/nt=\frac{\bar{d}}{s_d/\sqrt{n}}.
  • Use a one-proportion z test when testing one population proportion, with z=p^p0p0(1p0)nz=\frac{\hat{p}-p_0}{\sqrt{\frac{p_0(1-p_0)}{n}}}.
  • Use a two-proportion z test when comparing two independent proportions, with z=p^1p^2p^(1p^)(1n1+1n2)z=\frac{\hat{p}_1-\hat{p}_2}{\sqrt{\hat{p}(1-\hat{p})\left(\frac{1}{n_1}+\frac{1}{n_2}\right)}}.
  • Use a chi-square goodness-of-fit test for one categorical variable, with χ2=(OE)2E\chi^2=\sum \frac{(O-E)^2}{E}.
  • Use a chi-square test of independence when testing whether two categorical variables are related in a two-way table.
  • Use ANOVA to compare 33 or more group means, where the test statistic is F=variation between groupsvariation within groupsF=\frac{\text{variation between groups}}{\text{variation within groups}}.

Vocabulary

Null hypothesis
The null hypothesis, written H0H_0, is the claim that there is no effect, no difference, or no relationship in the population.
Alternative hypothesis
The alternative hypothesis, written HaH_a, is the claim that an effect, difference, or relationship exists.
P-value
The p-value is the probability of getting results at least as extreme as the sample results if H0H_0 is true.
Significance level
The significance level, written α\alpha, is the cutoff probability for rejecting H0H_0, often α=0.05\alpha=0.05.
Independent samples
Independent samples are groups where the data values in one group are not naturally paired with values in another group.
Paired data
Paired data occur when two measurements are linked, such as before-and-after measurements on the same person.

Common Mistakes to Avoid

  • Using a two-sample t test for before-and-after data is wrong because the observations are paired, so the test should analyze the differences dd.
  • Using a z test for a mean when σ\sigma is unknown is wrong because the sample standard deviation ss requires a t distribution.
  • Using a chi-square test when expected counts are too small is wrong because the approximation may be unreliable, especially when any expected count is below 55.
  • Choosing a test only from the sample size is wrong because the type of variable, number of groups, and independence matter first.
  • Rejecting H0H_0 because the p-value is large is wrong because a large p-value means the data do not give strong evidence against H0H_0.

Practice Questions

  1. 1 A class wants to test whether the mean score on a standardized test differs from 7575. A sample of n=36n=36 students has xˉ=78\bar{x}=78 and s=9s=9. Which test should be used, and what is the test statistic?
  2. 2 A survey finds that 6464 out of 100100 juniors and 7272 out of 120120 seniors support a schedule change. Which test should be used to compare the two proportions?
  3. 3 A restaurant records customer ratings as poor, fair, good, or excellent for dine-in and takeout orders. Which test should be used to determine whether rating category is related to order type?
  4. 4 A researcher compares plant growth under 44 different fertilizers. Explain why ANOVA is more appropriate than running many separate two-sample t tests.