Sign in to save

Bookmark this page so you can find it later.

Sign in to save

Bookmark this page so you can find it later.

This AP Statistics formula sheet summarizes the major tools students use to describe data, model chance, and make conclusions from samples. It is useful for homework, unit review, and exam preparation because AP Statistics requires choosing the right formula for each situation. The sheet connects one-variable statistics, probability rules, inference procedures, and regression in one organized reference. The most important ideas include using center and spread to describe distributions, applying probability rules to random events, and using sampling distributions to justify inference. Confidence intervals estimate unknown population values with a statistic plus a margin of error. Hypothesis tests compare observed results to a null model using a test statistic and a PP-value. Regression formulas summarize linear relationships and support prediction when the conditions are appropriate.

Key Facts

  • The sample mean is xˉ=xin\bar{x}=\frac{\sum x_i}{n}, and it measures the average value in a sample.
  • The sample standard deviation is s=(xixˉ)2n1s=\sqrt{\frac{\sum (x_i-\bar{x})^2}{n-1}}, and it measures typical distance from the sample mean.
  • For independent events, P(AB)=P(A)P(B)P(A \cap B)=P(A)P(B), and for conditional probability, P(AB)=P(AB)P(B)P(A\mid B)=\frac{P(A \cap B)}{P(B)}.
  • For a binomial random variable, μX=np\mu_X=np and σX=np(1p)\sigma_X=\sqrt{np(1-p)} when there are nn independent trials with success probability pp.
  • For a sampling distribution of a sample proportion, μp^=p\mu_{\hat{p}}=p and σp^=p(1p)n\sigma_{\hat{p}}=\sqrt{\frac{p(1-p)}{n}} when the independence and large counts conditions are met.
  • A one-sample confidence interval has the general form statistic±critical value×standard error\text{statistic} \pm \text{critical value}\times \text{standard error}.
  • A common one-sample zz interval for a proportion is p^±zp^(1p^)n\hat{p}\pm z^*\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}.
  • The least-squares regression line is y^=a+bx\hat{y}=a+bx, where b=rsysxb=r\frac{s_y}{s_x} and a=yˉbxˉa=\bar{y}-b\bar{x}.

Vocabulary

Parameter
A parameter is a number that describes an entire population, such as μ\mu, pp, or σ\sigma.
Statistic
A statistic is a number calculated from sample data, such as xˉ\bar{x}, p^\hat{p}, or ss.
Sampling Distribution
A sampling distribution is the distribution of a statistic over many random samples of the same size.
Standard Error
A standard error is an estimated standard deviation of a statistic, often used in confidence intervals and hypothesis tests.
P-value
A PP-value is the probability, assuming the null hypothesis is true, of getting a result at least as extreme as the observed result.
Correlation
Correlation rr measures the strength and direction of a linear relationship between two quantitative variables.

Common Mistakes to Avoid

  • Using σ\sigma when only ss is known is wrong because σ\sigma is a population standard deviation and ss is the sample estimate used in most real AP Statistics problems.
  • Forgetting to check conditions before inference is wrong because formulas such as p^±zp^(1p^)n\hat{p}\pm z^*\sqrt{\frac{\hat{p}(1-\hat{p})}{n}} require random sampling, independence, and large counts.
  • Interpreting a confidence level as a probability about one fixed interval is wrong because a 95%95\% confidence level describes the long-run success rate of the method.
  • Using correlation to prove causation is wrong because a strong value of rr only describes linear association and does not show that one variable causes the other.
  • Mixing up P(AB)P(A\mid B) and P(BA)P(B\mid A) is wrong because the condition after the vertical bar defines the group whose probability is being measured.

Practice Questions

  1. 1 A sample has values 44, 77, 99, 1010, and 1515. Find xˉ\bar{x} and identify whether xˉ\bar{x} is a statistic or a parameter.
  2. 2 In a sample of n=200n=200 students, p^=0.62\hat{p}=0.62 support a new schedule. Find the standard error p^(1p^)n\sqrt{\frac{\hat{p}(1-\hat{p})}{n}}.
  3. 3 A regression analysis gives r=0.80r=0.80, sx=5s_x=5, sy=12s_y=12, xˉ=30\bar{x}=30, and yˉ=70\bar{y}=70. Find the slope b=rsysxb=r\frac{s_y}{s_x} and intercept a=yˉbxˉa=\bar{y}-b\bar{x}.
  4. 4 A study finds that students who spend more hours studying tend to earn higher test scores. Explain why this association alone does not prove that studying time caused the higher scores.