PrepGo

Sampling Distributions for Differences in Sample Proportions - AP Statistics Study Guide

Written by AP Content Team, Verified for 2026 AP Exams, Last updated: May 2026

Learn with study guides reviewed by top AP teachers. This guide takes about 18 minutes to read.

Quick Summary

This guide covers the sampling distribution of the difference between two sample proportions (). You will learn how to describe the shape, center, and spread of this distribution and how to verify the necessary conditions—Random, 10% Condition, and Large Counts Condition—for both samples. Mastering this topic is essential for comparing two populations based on categorical data and forms the foundation for two-sample confidence intervals and significance tests for proportions.

Key Concepts

When we want to compare the proportions of successes in two different populations (e.g., the proportion of voters favoring a candidate in two different states), we take independent random samples from each. We then calculate the difference between the sample proportions, . The sampling distribution of the difference in sample proportions describes the distribution of this statistic if we were to take every possible pair of samples and calculate the difference.

To properly use this model, we must check three conditions and then describe its shape, center, and spread.

[Image: A normal curve labeled "Sampling Distribution of p̂₁ - p̂₂". The center of the curve is labeled with the mean, μ = p₁ - p₂. The spread is indicated by the standard deviation, σ = √[ (p₁(1-p₁)/n₁) + (p₂(1-p₂)/n₂) ].]

1. Conditions for the Sampling Distribution of

Before we can describe the distribution, we must verify three critical conditions:

  • Random Condition: The data must come from two independent random samples or two groups in a randomized experiment. The "independent" part is crucial; the selection of one sample cannot influence the selection of the other.

  • 10% Condition (for Independence within samples): When sampling without replacement, the sample size should be no more than 10% of the population size for each sample. This ensures that individual observations within each sample can be treated as independent.

    • and
  • Large Counts Condition (for Normality): The number of expected successes and failures must be at least 10 in each sample. This condition allows us to assume the sampling distribution of is approximately Normal.

    • , , , and

    • In practice, if the true population proportions and are unknown, we check this condition using the sample proportions: , , , and .

2. Describing the Sampling Distribution of

Once the conditions are met, we can describe the distribution's characteristics:

  • Shape: The sampling distribution of is approximately Normal. This is justified by the Large Counts Condition.

  • Center: The mean of the sampling distribution is the true difference between the population proportions. This means is an unbiased estimator of .

    • Mean Formula:
  • Spread: The standard deviation of the sampling distribution measures the typical distance of the statistic from the true difference . To calculate it, we add the variances of the individual sampling distributions.

    • Standard Deviation Formula:

    • This formula is only valid if the Random and 10% conditions are met for both samples.

Key Vocabulary

  • Sampling Distribution for the Difference in Sample Proportions: The theoretical distribution of all possible values of the statistic that would be obtained from all possible pairs of independent random samples of size and from their respective populations.

  • Independent Samples: Two samples are independent if the selection of individuals or objects for one sample does not affect or influence the selection of individuals or objects for the other sample.

  • Unbiased Estimator: A statistic whose sampling distribution has a mean that is exactly equal to the population parameter it is intended to estimate. is an unbiased estimator of .

  • Large Counts Condition: The guideline ( and for both samples) used to check if the shape of a sampling distribution for a proportion or difference in proportions is approximately Normal.

  • 10% Condition: The rule used when sampling without replacement to ensure observations within a sample are reasonably independent. It requires the sample size to be no more than 10% of the population size ().

Calculator Tech (TI-84)

No major calculator functions are required for describing the sampling distribution itself. The calculations for the mean and standard deviation are straightforward arithmetic. Later, in Unit 6, you will use and for inference, which builds upon the concepts learned here.

How to Show Work on the FRQ

On the AP exam, you will be asked to describe the sampling distribution of or to calculate a probability based on it. Your response must be a clear, written description that addresses shape, center, and spread, explicitly checking all necessary conditions.

Template for Describing the Sampling Distribution of :

  1. State the Distribution: Identify the sampling distribution of , where is the sample proportion of [context for group 1] and is the sample proportion of [context for group 2].

  2. Check Conditions:

    • Random: "The problem states we have two independent random samples: a random sample of [subjects from population 1] and a random sample of [subjects from population 2]."

    • 10% Condition: "Assuming the population of [population 1] is at least and the population of [population 2] is at least , the 10% condition is met. This allows us to calculate the standard deviation."

    • Large Counts Condition: "We check the Large Counts Condition to ensure a normal approximation is appropriate:

      • = [calculation] \ge 10

      • = [calculation] \ge 10

      • = [calculation] \ge 10

      • = [calculation] \ge 10

      Since all expected counts are at least 10, the sampling distribution is approximately normal."

  3. Describe Shape, Center, and Spread:

    • Shape: "The sampling distribution of is approximately Normal."

    • Center: "The mean of the sampling distribution is ."

    • Spread: "The standard deviation of the sampling distribution is ."

Practice Problems

Problem 1:

A large university wants to compare the proportion of in-state and out-of-state students who are satisfied with the campus dining services. The true proportion of satisfied in-state students is , and the true proportion of satisfied out-of-state students is . The university plans to take a simple random sample of 200 in-state students and a separate simple random sample of 150 out-of-state students. Let be the difference in the sample proportions of satisfied students. Describe the sampling distribution of .

Solution:

The sampling distribution of , the difference in sample proportions of satisfied in-state and out-of-state students, can be described by its shape, center, and spread after checking the necessary conditions.

  • Conditions: The problem states we have two independent simple random samples. We must assume the university has at least 10(200) = 2000 in-state students and at least 10(150) = 1500 out-of-state students to satisfy the 10% condition. For the Large Counts Condition: , , , and . All conditions are met.

  • Shape, Center, and Spread: Because the Large Counts Condition is met, the sampling distribution of is approximately Normal. The mean of the distribution is . The standard deviation is .

Problem 2:

A national polling agency reports that 44% of adults in the Eastern U.S. approve of the current president, while 52% of adults in the Western U.S. approve. Assume these are the true population proportions. The agency takes a simple random sample of 100 adults from the East and an independent simple random sample of 120 adults from the West. What is the probability that the sample proportion of approvers from the West is at least 10 percentage points higher than the sample proportion from the East?

Solution:

We want to find . First, we must describe the sampling distribution of the difference .

  • Conditions: We have two independent random samples. We assume there are at least 10(120) = 1200 adults in the West and 10(100) = 1000 adults in the East (10% condition). For Large Counts: , , , and . All conditions are met.

  • Shape, Center, and Spread: The sampling distribution of is approximately Normal with a mean of and a standard deviation of .

  • Probability Calculation: We standardize the value 0.10 to find the z-score: . The probability is , which is approximately 0.383. There is about a 38.3% chance that the sample proportion of approvers from the West will be at least 10 percentage points higher than that from the East.

Common Mistakes to Avoid

  • Adding Standard Deviations: You cannot add standard deviations. You must add the variances and then take the square root. Notice the formula . A common mistake is to calculate and separately and add them together.

  • Forgetting to Check Conditions for Both Samples: The Large Counts Condition and the 10% Condition must be explicitly checked and verified for and independently. It is not sufficient to check them for just one group.

  • Confusing and : When describing the sampling distribution, use the true population proportions ( and ) for the mean and standard deviation formulas. Only use sample proportions ( and ) to check the Large Counts Condition if the true parameters are unknown.

  • Saying the "Sample" is Normal: The data in the samples are categorical (e.g., yes/no, satisfied/not satisfied). The samples themselves are not Normal. It is the sampling distribution of the statistic that is approximately Normal. Be precise with your language.

  • Ignoring Sample Independence: The formulas for the center and spread of the difference rely on the two samples being independent of each other. If the samples are paired or dependent (e.g., a pre-test and post-test on the same group), these methods are not appropriate.