AP Statistics Practice Quiz: Analyzing Departures from Linearity
Written by AP Content Team, Verified for 2026 AP Exams, Last updated: May 2026
Test your understanding with short quizzes. This quiz has 16 questions to check your progress.
Question 1 of 16
All Questions (16)
A) A point that, if removed, substantially changes the regression model.
B) A point with a substantially larger or smaller x-value than other observations.
C) A point that does not follow the general trend and has a large residual.
D) A point that is created by transforming a variable.
Correct Answer: C
Based on the provided content, 'An outlier in regression does not follow the general trend and has a large residual.'
A) An outlier
B) A high-leverage point
C) An influential point
D) A residual
Correct Answer: B
The provided content states, 'A high-leverage point has a substantially larger or smaller x-value than other observations.'
A) A transformed point
B) A high-leverage point
C) An outlier
D) An influential point
Correct Answer: D
According to the content, 'An influential point, if removed, substantially changes the regression model.' A dramatic change in the slope is a substantial change.
A) To increase the number of outliers.
B) To create a data set that is more linear.
C) To decrease the r-squared value to a more realistic level.
D) To ensure all residuals are exactly zero.
Correct Answer: B
The content explicitly states, 'Transforming variables can create a data set that is more linear.'
A) The original model was better because it was simpler.
B) The transformation was unsuccessful because it created influential points.
C) The transformed model is more appropriate for the data.
D) The transformation should be reversed to analyze the residuals.
Correct Answer: C
The content indicates that 'Increased randomness in residual plots and an r-squared value closer to 1 after transformation suggest a more appropriate model.' Both of these conditions were met.
A) 4.1
B) 125.89
C) 12589.25
D) 4.0
Correct Answer: C
First, calculate the predicted transformed value: log(ŷ) = 2.1 + 0.5(4) = 2.1 + 2.0 = 4.1. To find ŷ, you must reverse the log transformation: ŷ = 10^4.1 ≈ 12589.25. This tests the ability to calculate a predicted response from a transformed model.
A) A point with an extreme x-value that follows the general trend of the data.
B) A point with an average x-value that has a very large residual.
C) A point that, when removed, changes the y-intercept but not the slope.
D) A point with an extreme x-value that has the largest residual in the data set.
Correct Answer: A
A high-leverage point has an extreme x-value. For it to not be an outlier, it must follow the general trend, which means it will have a small residual.
A) The original model, because its r-squared value is higher.
B) The transformed model, because the random residual plot indicates it is more appropriate.
C) Neither model, because the r-squared values are contradictory.
D) The original model, because a curved pattern in residuals is acceptable if r-squared is high.
Correct Answer: B
The content states that 'Increased randomness in residual plots...suggest a more appropriate model.' A clear pattern in the residuals, like a curve, indicates that the linear model is not appropriate, regardless of a high r-squared value.
A) $90
B) $30
C) $8,100
D) $1,800
Correct Answer: C
First, substitute x = 5 into the transformed equation: √ŷ = 150 - 12(5) = 150 - 60 = 90. To find the predicted response ŷ, you must square both sides: ŷ = (90)^2 = 8,100.
A) It has the largest possible x-value in the data set.
B) It has a residual that is greater than two standard deviations from zero.
C) Its exclusion from the analysis leads to a substantial change in the regression model.
D) It is always located at the mean of the x and y values.
Correct Answer: C
This is a direct application of the definition provided in the content: 'An influential point, if removed, substantially changes the regression model.'
A) If its residual is positive.
B) If its x-value is substantially larger or smaller than the other x-values.
C) If removing it increases the r-squared value.
D) If it is also an influential point.
Correct Answer: B
The classifications are based on different criteria. An outlier is defined by its large residual, while a high-leverage point is defined by its extreme x-value. A point can be one, both, or neither.
A) A large residual.
B) A significant influence on the regression line's slope.
C) An x-value far from the mean of the x-values.
D) A y-value that does not match the model's prediction.
Correct Answer: C
The definition of a high-leverage point is based solely on its x-position relative to the other data points. The provided content states it has a 'substantially larger or smaller x-value than other observations.'
A) The r-squared value decreases and the residual plot shows a clear pattern.
B) The slope of the regression line becomes positive.
C) The r-squared value increases and the residual plot becomes more random.
D) The number of high-leverage points is reduced to zero.
Correct Answer: C
The content lists two key indicators of a successful transformation: 'Increased randomness in residual plots and an r-squared value closer to 1 after transformation suggest a more appropriate model.'
A) 64
B) 35
C) 4096
D) 8
Correct Answer: D
First, calculate the value of ŷ^2 by substituting x = 11: ŷ^2 = 20 + 4(11) = 20 + 44 = 64. To find the predicted response ŷ, take the square root of both sides: ŷ = √64 = 8.
A) As a high-leverage point only.
B) As an outlier only.
C) As both an outlier and a high-leverage point.
D) As neither an outlier nor a high-leverage point.
Correct Answer: C
It is a high-leverage point because its x-value (hours studied) is substantially larger than others. It is an outlier because it has a large residual and does not follow the general trend of more studying leading to higher scores.
A) Remove all points with negative residuals.
B) Transform one or both of the variables to achieve linearity.
C) Assume the correlation is zero and stop the analysis.
D) Add more data points until the pattern disappears.
Correct Answer: B
The provided content suggests that when data is not linear, 'Transforming variables can create a data set that is more linear.' This is a standard procedure for dealing with non-linear relationships.