PrepGo

AP Statistics Practice Quiz: Representing a Categorical Variable with Graphs

Written by AP Content Team, Verified for 2026 AP Exams, Last updated: May 2026

Test your understanding with short quizzes. This quiz has 16 questions to check your progress.

Question 1 of 16

Which of the following best describes the primary purpose of a bar chart in statistics?

All Questions (16)

Which of the following best describes the primary purpose of a bar chart in statistics?

A) To show the relationship between two quantitative variables.

B) To display frequencies or relative frequencies for categorical data.

C) To represent the distribution of a single quantitative variable over time.

D) To calculate the mean and median of a dataset.

Correct Answer: B

According to the provided content, 'Bar charts display frequencies or relative frequencies for categorical data.' This is their main function.

In a bar chart representing the favorite pets of students in a class, what does the length of the bar for the 'Dog' category correspond to?

A) The average number of pets owned by the students.

B) The total number of students in the class.

C) The count or proportion of students who chose 'Dog' as their favorite.

D) The variety of dog breeds mentioned by the students.

Correct Answer: C

The content states that 'The length of a bar in a bar graph corresponds to the count or proportion of observations in a category.' Therefore, the length of the 'Dog' bar represents how many students selected that category.

A market researcher creates a bar chart showing the number of customers who prefer four different brands of coffee. The bar for Brand A is the tallest. What is the most accurate description of this data?

A) Brand A is the highest quality coffee.

B) Brand A had the highest frequency of preference among the customers surveyed.

C) Brand A is the most expensive coffee.

D) All customers surveyed drink Brand A coffee.

Correct Answer: B

The length of a bar represents the count or frequency for that category. A taller bar means a higher frequency. The graph only shows preference, not quality or price. This aligns with the principle of describing categorical data represented graphically.

A school administrator wants to compare the primary mode of transportation (Bus, Car, Walk, Bike) for students at the high school versus the middle school. Which of the following would be the most effective way to graphically represent this comparison?

A) A single pie chart showing the combined data from both schools.

B) A scatterplot with high school on one axis and middle school on the other.

C) Two separate bar graphs, one for each school, using the same transportation categories and scale.

D) A histogram showing the total number of students for each mode of transportation.

Correct Answer: C

The content states that 'Frequency tables and bar graphs can be used to compare two or more data sets for the same categorical variable.' Creating two bar graphs (or a side-by-side bar graph) allows for a direct comparison of the frequencies for each category between the two schools.

A company claims, 'Our new advertisement has significantly increased customer preference for our product.' To justify this claim using graphical representations, what would they need to present?

A) A single bar chart showing high current customer preference.

B) A bar chart showing customer preferences from before the advertisement compared to a bar chart showing preferences after the advertisement.

C) A frequency table listing the number of times the advertisement was aired.

D) A bar chart showing the cost of the advertisement compared to the product's sales.

Correct Answer: B

To justify a claim about a change over time, one must 'compare multiple sets of categorical data.' A single graph only shows the current state. Comparing data from before and after the event (the advertisement) provides the evidence needed to justify the claim, as stated in the content.

While bar charts are commonly used to display frequencies for categorical data, what does the provided content suggest about other graphical methods?

A) Bar charts are the only acceptable method for representing categorical data.

B) Histograms should be used interchangeably with bar charts for categorical data.

C) Many other types of graphs can also be used to represent frequencies for categorical data.

D) Graphical representations are not useful for categorical data.

Correct Answer: C

The content explicitly states, 'Many other types of graphs can represent frequencies for categorical data.' This acknowledges that while bar charts are a primary tool, they are not the only one.

A researcher creates a bar chart of survey responses for 'Favorite Social Media Platform' from a sample of 100 teenagers and another from a sample of 200 adults. The bar for 'Platform X' is the same height in both charts. What can be concluded?

A) The number of people who prefer Platform X is the same in both groups.

B) The proportion of people who prefer Platform X is the same in both groups.

C) The proportion of teenagers who prefer Platform X is likely half the proportion of adults who prefer it, assuming the y-axis represents frequency.

D) The proportion of teenagers who prefer Platform X is likely double the proportion of adults who prefer it, assuming the y-axis represents frequency.

Correct Answer: D

If the y-axis is frequency (count) and the bars are the same height, the number of people is the same. Let's say the count is 'N'. For teenagers, the proportion is N/100. For adults, the proportion is N/200. The proportion for teenagers (N/100) is double the proportion for adults (N/200). This requires comparing multiple sets of data and understanding how bar length (count) relates to proportion.

Which of these is a categorical variable that could be represented by a bar chart?

A) The height of students in a classroom.

B) The time it takes for students to complete a test.

C) The favorite subject (Math, Science, English, History) of students.

D) The final exam scores of students in a statistics class.

Correct Answer: C

The content focuses on representing categorical data. Favorite subject is a categorical variable because the values fall into distinct, non-numerical categories. Height, time, and exam scores are all quantitative variables.

A frequency table shows the number of cars sold by color at a dealership. How would this data be represented in a bar graph?

A) Each car sold would be a separate bar.

B) The x-axis would list the car colors, and the y-axis would represent the frequency (number of cars sold).

C) The x-axis would represent the frequency, and the y-axis would list the car colors.

D) A single bar would represent the total number of cars sold.

Correct Answer: B

The content specifies that bar charts display frequencies for categorical data. The categories (car colors) are typically placed on the x-axis, and the length of the bars, determined by the y-axis scale, represents the frequency for each category.

A bar chart shows the number of employees in four departments: Sales (50), Marketing (25), HR (10), and IT (15). A manager claims, 'The Sales department is five times more important than the HR department.' Why is this claim NOT justified by the graph alone?

A) The bar for Sales is not exactly five times taller than the bar for HR.

B) The graph represents the frequency of employees, not a measure of departmental importance.

C) The IT department has more employees than the HR department.

D) The Marketing department has half the employees of the Sales department.

Correct Answer: B

According to the content, 'Graphical representations of a categorical variable can justify claims about the data in context.' The data in this context is the number of employees. The graph can justify a claim that Sales has five times more employees than HR. However, it cannot justify a claim about 'importance,' which is a subjective interpretation not measured by the data.

When comparing two datasets for the same categorical variable, such as the preferred streaming service for students in 2020 versus 2023, what is a key feature that the bar graphs for both years should share for an effective comparison?

A) The same total number of students must be surveyed each year.

B) The same categories and a consistent scale for the frequency or relative frequency axis.

C) The same number of categories must have the highest frequency in both years.

D) The graphs must be displayed as pie charts instead of bar charts.

Correct Answer: B

To effectively 'compare multiple sets of categorical data,' the graphical representations must be consistent. Using the same categories (the streaming services) and the same scale on the y-axis allows for a direct and accurate visual comparison of frequencies or proportions between the two years.

A bar chart displays the relative frequencies of different majors chosen by freshmen at a university. The bar for 'Undeclared' is 0.30. What does this indicate?

A) Exactly 30 students are undeclared.

B) 30% of the freshmen surveyed are undeclared.

C) Undeclared is the third most popular major.

D) The total number of freshmen at the university is 30.

Correct Answer: B

The content specifies that bar charts can display 'relative frequencies'. A relative frequency is a proportion or percentage. A bar length of 0.30 on a relative frequency chart means that 30% of the observations fall into that category.

Two politicians, Candidate A and Candidate B, present bar charts to show their support. Candidate A's chart shows they have 1,000 supporters. Candidate B's chart shows they have 800 supporters. Candidate A claims to be more popular. What crucial piece of information is needed to properly compare these sets of data and evaluate the claim?

A) The number of categories on each bar chart.

B) The total number of voters from which each sample of supporters was drawn.

C) The colors used for the bars in each chart.

D) The height of the tallest bar in each chart.

Correct Answer: B

To 'compare multiple sets of categorical data' meaningfully, context is essential. A count of 1,000 supporters from a sample of 2,000 voters (50%) is very different from 1,000 supporters from a sample of 10,000 voters (10%). Without knowing the total sample size (the denominator), comparing the frequencies (the numerators) can be misleading. Using relative frequencies would solve this issue.

A frequency table is a tabular representation of categorical data. According to the content, what is the most direct graphical representation of a frequency table?

A) A scatterplot

B) A boxplot

C) A bar graph

D) A time series plot

Correct Answer: C

The content directly links these two concepts: 'Frequency tables and bar graphs can be used to compare two or more data sets for the same categorical variable.' A bar graph is the standard graphical display for the information contained in a frequency table.

A city council presents a bar chart showing the number of noise complaints per district. District A has 200 complaints and District B has 100. A council member from District B claims this is misleading because District A has four times the population of District B. If the council member is correct, what would be the most appropriate way to regraph the data to make a fair comparison?

A) Use a pie chart to show the percentage of total complaints from each district.

B) Create a new bar chart using relative frequencies (complaints per capita) for each district.

C) Create a bar chart showing the population of each district instead of the complaints.

D) Keep the original bar chart but make the bar for District B wider to represent a smaller population.

Correct Answer: B

This question requires a deep understanding of how to 'compare multiple sets of categorical data' fairly. A simple frequency chart is misleading when the group sizes are different. To make a fair comparison, one should use relative frequencies (in this case, complaints per person or per 1,000 people). This standardizes the data and provides a more accurate picture. District A's rate would be 200/4P and District B's would be 100/P, making District B's per capita complaint rate twice as high as District A's.

Which of the following statements is a correct description that could be derived from a bar chart showing the distribution of a categorical variable?

A) The range of the data is 45.

B) The distribution is skewed to the right.

C) The category with the highest frequency is 'Blue'.

D) The mean of the data is approximately 25.3.

Correct Answer: C

Bar charts are used to 'describe categorical data represented graphically.' Descriptions for categorical data focus on comparing the frequencies or relative frequencies of the categories, such as identifying the mode (the category with the highest frequency). Terms like range, skewness, and mean are used to describe quantitative data, not categorical data.