PrepGo

Introduction to Random Variables and Probability Distributions - AP Statistics Study Guide

Written by AP Content Team, Verified for 2026 AP Exams, Last updated: May 2026

Learn with study guides reviewed by top AP teachers. This guide takes about 20 minutes to read.

Quick Summary

This guide introduces the foundational concept of a random variable, which is a way to assign a numerical value to the outcome of a random event. You will learn to distinguish between the two main types of random variables: discrete (countable outcomes) and continuous (measurable outcomes). Furthermore, you will master the construction and verification of probability distributions, which are models that describe the likelihood of each possible value of a random variable.

Key Concepts

A random variable is a core concept in statistics that connects random outcomes to numerical values. It's a variable whose value is a numerical outcome of a random phenomenon. We use a capital letter, like X or Y, to denote a random variable.

Types of Random Variables

There are two primary types of random variables, and distinguishing between them is a critical skill.

  • Discrete Random Variable (DRV)

    • A discrete random variable has a finite or countably infinite number of possible values. You can list out all possible outcomes without skipping any.

    • Think of it as something you count.

    • Examples:

      • The number of heads in 4 coin flips (Possible values: 0, 1, 2, 3, 4).

      • The sum of the dots when rolling two standard dice (Possible values: 2, 3, 4, ..., 12).

      • The number of defective lightbulbs in a sample of 50 (Possible values: 0, 1, 2, ..., 50).

      • The number of cars that pass through an intersection in one hour.

  • Continuous Random Variable (CRV)

    • A continuous random variable can take on any value within a given interval. There are infinite possible values between any two given values.

    • Think of it as something you measure.

    • Examples:

      • The height of a randomly selected student (Could be 68 inches, 68.1 inches, 68.11 inches...).

      • The exact time it takes to run a mile.

      • The weight of a newborn baby.

      • The precise volume of coffee in a "12-ounce" cup.

Probability Distributions for Discrete Random Variables

A probability distribution for a discrete random variable X provides the probability for each possible value of X. It can be represented as a table, a formula, or a graph (like a histogram).

[Image: A probability histogram for the sum of two dice. The x-axis is labeled "Sum of Two Dice" with values 2 through 12. The y-axis is labeled "Probability" and has bars corresponding to the probability of each sum.]

Structure of a Probability Distribution Table:

The table lists all possible values of the random variable (x) and their corresponding probabilities (P(X=x)).

Value of X (x)Probability P(X=x)
x₁p₁
x₂p₂
......
xₖpₖ

Two Conditions for a Valid Probability Distribution (CRITICAL):

For any distribution to be a valid probability distribution, it must satisfy two rules:

  1. The Probability Rule: The probability for each value of X must be between 0 and 1, inclusive.

    • Formula: 0 \le P(X=x) \le 1 for all possible values of x.

    • A probability can't be negative or greater than 100%.

  2. The Summation Rule: The sum of all probabilities for all possible values of X must equal exactly 1.

    • Formula: Σ P(X=x) = 1

    • This means the distribution accounts for 100% of all possible outcomes.

Key Vocabulary

  • Random Variable: A variable whose value is a numerical outcome of a random process. Typically denoted by a capital letter (e.g., X).

  • Discrete Random Variable: A random variable that can take on a finite or countable number of distinct values.

  • Continuous Random Variable: A random variable that can take on any value within a given interval of real numbers.

  • Probability Distribution: A table, graph, or formula that gives the probability for each possible value of a random variable.

  • Sample Space: The set of all possible outcomes of a random phenomenon. A random variable assigns a number to each outcome in the sample space.

Calculator Tech (TI-84)

While this topic is primarily conceptual, you can use your calculator to quickly verify the second condition for a valid probability distribution (that the probabilities sum to 1).

Task: Check if the probabilities in a distribution sum to 1.

  1. Enter the probabilities into a list:

    • Press STAT.

    • Select 1:Edit....

    • Clear a list if necessary (arrow up to the list name, press CLEAR, then ENTER).

    • Enter each probability from the distribution into a list, for example, L1.

  2. Calculate the sum of the list:

    • Return to the home screen by pressing 2nd -> MODE [QUIT].

    • Press 2nd -> STAT [LIST].

    • Arrow over to the MATH menu.

    • Select 5:sum(.

    • Tell the calculator which list to sum. For L1, press 2nd -> 1 [L1].

    • Your screen should show .

    • Press ENTER. The result should be exactly 1 for a valid probability distribution.

How to Show Work on the FRQ

Free Response Questions on this topic test your conceptual understanding and communication. You will not use the State-Plan-Do-Conclude framework here. Instead, you'll be asked to define, construct, or verify.

Template for Verifying a Valid Probability Distribution:

To receive full credit for verifying that a table represents a valid probability distribution, you must explicitly check and comment on BOTH conditions.

Question: "Show that the probability distribution for X is a valid distribution."

Your Response:

"To be a valid probability distribution, two conditions must be met:

  1. All individual probabilities must be between 0 and 1, inclusive.

    • Check this condition: "In the given table, all probabilities [list them, e.g., 0.1, 0.2, 0.3, 0.4] are between 0 and 1."
  2. The sum of all probabilities must equal 1.

    • Show the sum: "The sum of the probabilities is 0.1 + 0.2 + 0.3 + 0.4 = 1.0."

Since both conditions are met, this is a valid probability distribution."

Important: Simply stating "the sum is 1" is not enough. You must write out the sum to show your work.

Practice Problems

Problem 1:

A board game involves rolling a special four-sided die. Let the random variable Y be the number of dots showing on the top face. The probability of rolling a 1 is 0.4, the probability of rolling a 2 is 0.3, and the probability of rolling a 3 is 0.2.

(a) The die has faces numbered 1, 2, 3, and 4. What is the probability of rolling a 4?

(b) Construct the probability distribution table for the random variable Y.

(c) Is Y a discrete or continuous random variable? Justify your answer.

Solution:

(a) To find the probability of rolling a 4, we use the Summation Rule. The sum of all probabilities must be 1.

Let P(Y=4) = k.

P(1) + P(2) + P(3) + P(4) = 1

0.4 + 0.3 + 0.2 + k = 1

0.9 + k = 1

k = 1 - 0.9 = 0.1

The probability of rolling a 4 is 0.1.

(b) The probability distribution table for Y is:

Value of Y (y)Probability P(Y=y)
10.4
20.3
30.2
40.1

(c) Y is a discrete random variable. The possible values of Y are {1, 2, 3, 4}, which is a finite, countable set of outcomes. We cannot roll a 2.5 or any other value between the integers.


Problem 2:

A local pet store has a tank with 10 fish: 5 are red, 3 are blue, and 2 are yellow. A customer randomly selects 2 fish without replacement. Let the random variable X be the number of red fish selected.

(a) List the possible values for the random variable X.

(b) Construct the probability distribution for X. Show your work for calculating each probability.

(c) Verify that your distribution from part (b) is a valid probability distribution.

Solution:

(a) The customer selects 2 fish. The number of red fish selected, X, can be 0, 1, or 2.

(b) We calculate the probability for each value of X using combinations. The total number of ways to choose 2 fish from 10 is C(10, 2) = (109)/(21) = 45.

  • P(X=0): This means 0 red fish and 2 non-red fish are chosen. There are 5 non-red fish.

    • Ways to choose 0 red from 5: C(5, 0) = 1

    • Ways to choose 2 non-red from 5: C(5, 2) = 10

    • P(X=0) = [C(5, 0) * C(5, 2)] / C(10, 2) = (1 * 10) / 45 = 10/45

  • P(X=1): This means 1 red fish and 1 non-red fish are chosen.

    • Ways to choose 1 red from 5: C(5, 1) = 5

    • Ways to choose 1 non-red from 5: C(5, 1) = 5

    • P(X=1) = [C(5, 1) * C(5, 1)] / C(10, 2) = (5 * 5) / 45 = 25/45

  • P(X=2): This means 2 red fish and 0 non-red fish are chosen.

    • Ways to choose 2 red from 5: C(5, 2) = 10

    • Ways to choose 0 non-red from 5: C(5, 0) = 1

    • P(X=2) = [C(5, 2) * C(5, 0)] / C(10, 2) = (10 * 1) / 45 = 10/45

The probability distribution for X is:

Value of X (x)Probability P(X=x)
010/45
125/45
210/45

(c) To verify this is a valid probability distribution, we check both conditions:

  1. All probabilities are between 0 and 1: The probabilities 10/45, 25/45, and 10/45 are all positive and less than 1. This condition is met.

  2. The sum of probabilities is 1: We show the sum: 10/45 + 25/45 + 10/45 = 45/45 = 1. This condition is met.

Since both conditions are satisfied, this is a valid probability distribution.

Common Mistakes to Avoid

  • Confusing Discrete vs. Continuous: A common mistake is to classify a variable based on how it's recorded rather than its true nature. For example, age is technically continuous, but we often treat it as discrete (e.g., "17 years old"). For the AP exam, think about what is being measured. If it's a measurement (time, weight, height, length), it's continuous. If it's a count (number of items, number of occurrences), it's discrete.

  • Forgetting to Check BOTH Conditions: When asked to verify a probability distribution, many students only check that the probabilities sum to 1. You MUST also state that all individual probabilities are between 0 and 1. Forgetting this first check will cost you points.

  • Not Showing the Sum: On an FRQ, do not just write "The sum of the probabilities is 1." You must write out the actual addition that leads to 1 (e.g., "0.2 + 0.5 + 0.3 = 1") to receive full credit for communication.

  • Imprecise Definition of the Random Variable: When defining a random variable, be specific. Don't just say "the outcome." Say "the number of heads" or "the sum of the dice." The variable must be numerical.