PrepGo

Intelligence and Achievement - AP Psychology Study Guide

Written by AP Content Team, Verified for 2026 AP Exams, Last updated: May 2026

Learn with study guides reviewed by top AP teachers. This guide takes about 27 minutes to read.

Getting Started

What does it mean to be "smart"? For over a century, psychologists have tried to answer this question by defining and measuring intelligence. This pursuit has created powerful tools that can help identify needs and predict success, but it has also led to significant controversy, revealing how scientific measurement can be influenced by societal values and systemic biases.

What You Should Be Able to Do

  • Explain historical and modern theories about the nature of intelligence.

  • Describe the essential principles of psychological testing used to measure intelligence.

  • Analyze how systemic issues and biases relate to the use of intelligence assessments.

  • Compare the concepts of intelligence and academic achievement.

Key Developments & Analysis

The study of intelligence is fundamentally a story about measurement. To understand intelligence, we must first understand the methods, data, and ethical challenges involved in assessing it.

Design & Variables

The primary challenge in intelligence research is creating a valid operational definition—a clear, measurable description of a psychological construct. Is intelligence a single, general ability, or a collection of different skills?

  • General Intelligence (g): This theory proposes that a single underlying factor, g, accounts for performance on various cognitive tasks. Tests designed from this perspective aim to produce a single score, like an IQ score, to represent this general ability.

  • Multiple Abilities: This perspective argues that intelligence is not one thing but a collection of distinct abilities, such as verbal, mathematical, spatial, and musical intelligence.

Early tests created an Intelligence Quotient (IQ) by dividing a person’s mental age (determined by the test) by their chronological age and multiplying by 100. Modern tests have moved away from this formula. Instead, they use a standardized scoring system where an individual's score is compared to the performance of a large, representative sample of people of the same age.

Threats & Controls: The Principles of Good Testing

For any psychological assessment to be useful, it must adhere to strict psychometric principles. These principles act as controls against threats to the test's accuracy and fairness.

  • Standardization: This is the process of administering a test to a representative sample of future test-takers to establish a basis for meaningful comparison. It also requires that all test-takers receive the test under the same, consistent procedures. Without standardization, we cannot know if score differences are due to ability or to variations in the testing environment.

  • Reliability: A test is reliable if it yields dependably consistent scores.

    • Test-retest reliability is checked by having the same person take the test on two or more occasions.

    • Split-half reliability is checked by seeing if scores on two halves of the same test (e.g., odd vs. even questions) are consistent.

  • Validity: A test is valid if it measures or predicts what it is supposed to. A test can be reliable without being valid (e.g., a miscalibrated scale consistently gives the wrong weight).

    • Construct validity refers to how well a test measures the underlying theoretical concept it's designed to assess (e.g., does an IQ test truly measure "intelligence"?).

    • Predictive validity is the success with which a test predicts the behavior it is designed to predict (e.g., do intelligence test scores predict future academic success?).

A major threat to validity is systemic bias. Stereotype threat is a phenomenon where a person's performance is impaired by the fear that they will be evaluated based on a negative stereotype about their group. To counter this and other biases, researchers strive to develop socio-culturally responsive assessments that are fair and equitable for all test-takers.

Data Reading & Interpretation

Large-scale data from intelligence tests have revealed important societal trends and statistical truths.

  • The Flynn Effect: This refers to the finding that intelligence test scores have been rising steadily for decades across many parts of the world. This trend suggests that environmental and societal factors—such as better nutrition, more complex schooling, and exposure to abstract problem-solving—play a significant role in cognitive abilities, rather than just innate factors.

  • Group Differences: While average scores on intelligence tests sometimes vary between different demographic groups, research consistently shows that the variation within any given group is far greater than the average difference between groups. This finding cautions against overgeneralizing or stereotyping based on group averages.

Ethics & Application

The data from intelligence tests have not always been used ethically. Historically, IQ scores were used to justify discriminatory practices, limiting access to education, jobs, and even immigration for certain groups. This history underscores the critical importance of interpreting test scores with caution and awareness of potential biases and inequities. Today, IQ scores are more often used to identify needs and provide access to educational services.

Data & Organization Tools

Theory & Assessment Matrix

ConceptCore IdeaPrimary Use/Example
General Intelligence (g)A single, underlying factor accounts for overall cognitive ability.Tests that yield a single IQ score to summarize ability.
Multiple AbilitiesIntelligence is composed of several distinct, independent abilities.Inspires tests that measure different skills (e.g., verbal, spatial).
Aptitude TestMeasures potential for future learning or performance.The SAT, used to predict first-year college performance.
Achievement TestMeasures what a person has already learned.A final exam in a history course.

Evidence Bank

  • General Intelligence (g): The theoretical single factor that underlies all specific mental abilities and is therefore measured by every task on an intelligence test.

  • Intelligence Quotient (IQ): Originally defined as the ratio of mental age to chronological age multiplied by 100. On contemporary tests, it is the average performance for a given age, which is assigned a score of 100.

  • Standardization: The process of defining uniform testing procedures and meaningful scores by comparison with the performance of a pretested group.

  • Reliability: The extent to which a test yields consistent results, as assessed by the consistency of scores on two halves of the test, on alternate forms, or on retesting.

  • Validity: The extent to which a test measures or predicts what it is supposed to.

  • Flynn Effect: The observed, long-sustained increase in intelligence test scores measured in many parts of the world from the 1930s to the present day.

  • Stereotype Threat: A self-confirming concern that one will be evaluated based on a negative stereotype, which can disrupt cognitive performance.

  • Aptitude Test: A test designed to predict a person's future performance or capacity to learn a new skill.

  • Achievement Test: A test designed to assess what a person has learned in a specific area.

  • Growth Mindset: The belief that intelligence is not fixed but can be developed through dedication and hard work. This belief can positively affect academic achievement.

Skill Snapshots

Mechanism Pairs

  • Cause: Consistent test procedures (standardization) → Effect: More comparable and meaningful scores across individuals.

  • Cause: Activating a negative stereotype (stereotype threat) → Effect: Reduced performance on a cognitive task due to anxiety and mental load.

  • Cause: Believing intelligence is malleable (growth mindset) → Effect: Increased effort and persistence, leading to higher academic achievement.

Perspective Contrasts

  • General Intelligence vs. Multiple Abilities: The "g" perspective sees intelligence as a single, core cognitive engine, while the multiple abilities perspective sees it as a set of separate, independent engines (e.g., verbal, spatial, musical).

  • Aptitude Test vs. Achievement Test: An aptitude test aims to predict your future performance (e.g., a musical aptitude test), whereas an achievement test assesses what you already know (e.g., a piano recital of a learned piece).

  • Historical IQ vs. Modern IQ: The historical IQ formula (mental age / chronological age) worked for children but not adults. Modern IQ scores are standardized by comparing an individual's performance to others in their same age group.

Change Track

  • Baseline: Early 20th-century intelligence tests were developed and used to sort individuals, often leading to biased and inequitable outcomes.

  • Change 1: The discovery of the Flynn Effect in the late 20th century demonstrated that societal and environmental factors, not just fixed traits, significantly influence IQ scores over time.

  • Change 2: Growing awareness of stereotype threat and cultural bias has led to efforts to create more socio-culturally responsive and equitable intelligence assessments.

  • Persistence: The fundamental debate over whether intelligence is a single general ability (g) or a collection of multiple, distinct abilities continues to shape research and test design.

Common Misconceptions & Clarifications

  1. Misconception: An IQ score is a fixed, permanent measure of a person's worth.

    Clarification: IQ scores can change over time and are influenced by many factors, including education and environment. They measure a narrow set of cognitive abilities, not a person's total value.

  2. Misconception: A test that is reliable must also be valid.

    Clarification: A test can produce highly consistent (reliable) results but still not measure what it claims to measure (invalid). For example, a test that uses head circumference to measure intelligence would be reliable but not valid.

  3. Misconception: Group differences in average IQ scores are primarily due to genetics.

    Clarification: The vast majority of research indicates that group differences are more heavily influenced by environmental, social, and economic factors, including access to education, nutrition, and the presence of systemic bias.

  4. Misconception: Intelligence tests and achievement tests measure the same thing.

    Clarification: While they are often correlated, intelligence (aptitude) tests aim to predict future potential, while achievement tests measure current knowledge and skills.

One-Paragraph Summary

The study of intelligence involves a complex debate over whether it is a single general ability or a set of multiple abilities. Measuring this construct depends on sound psychometric principles, including standardization, reliability, and validity, to ensure tests are consistent and accurate. However, the interpretation of test scores is complicated by systemic issues like cultural bias, stereotype threat, and the Flynn Effect, which shows that scores can rise with societal changes. Historically, these tests have been misused to create inequity, highlighting the importance of ethical application. Ultimately, distinguishing between intelligence as aptitude and academic achievement—and fostering a growth mindset that intelligence can develop—is crucial for understanding human potential.