Operant Conditioning | AP Psych Unit 3 Study Guide

Getting Started

Why does a child learn to say "please" to get a cookie, or a student study to earn a good grade? The answers lie in how we learn from the consequences of our actions. Operant conditioning is a type of learning in which voluntary behavior is strengthened if followed by a reinforcer or diminished if followed by a punisher, explaining how we acquire a vast range of skills and habits.

What You Should Be Able to Do

Explain how consequences (reinforcement and punishment) alter the future likelihood of a behavior.
Compare and contrast positive reinforcement, negative reinforcement, positive punishment, and negative punishment.
Describe how complex new behaviors can be taught through the process of shaping.
Analyze how different schedules of reinforcement affect the rate of learning and the persistence of a behavior.
Apply conditioning principles to explain phenomena like superstitious behavior and learned helplessness.

Key Developments & Analysis

Baseline & Context

Before operant conditioning occurs, an organism exists in a state of performing various natural, voluntary behaviors. This is the behavioral baseline. The context is an environment where these actions can produce different outcomes or consequences. The fundamental principle at play is the Law of Effect, which states that behaviors followed by favorable consequences become more likely, and behaviors followed by unfavorable consequences become less likely. Learning occurs as the individual discovers the connection between its actions and the outcomes they produce.

Change Processes

Acquisition via Reinforcement: A behavior is acquired or strengthened through reinforcement, a consequence that increases the likelihood of that behavior occurring again. For example, if a rat presses a lever and receives a food pellet, it will learn to press the lever more often. The food pellet is the reinforcer that drives the change from random behavior to a deliberate, learned action.
Suppression via Punishment: A behavior is suppressed or weakened through punishment, a consequence that decreases the likelihood of that behavior occurring again. If the rat presses the lever and receives a mild electric shock, it will learn to avoid the lever. The shock is the punisher that causes the change, reducing the frequency of the lever-pressing behavior.
Shaping & Successive Approximations: Complex behaviors are rarely learned all at once. Shaping is the process of guiding an organism's behavior toward a desired goal by reinforcing successive approximations—small steps that get closer and closer to the final behavior. To teach a dog to roll over, you might first reward it for lying down, then for lying down and tilting to one side, and finally only for completing the full roll.

Stability vs. Change

Persistence through Partial Reinforcement: Once a behavior is learned, its stability depends on the pattern of reinforcement. Behaviors that are rewarded every single time (continuous reinforcement) are learned quickly but also extinguish quickly once the reward stops. In contrast, behaviors maintained on partial (or intermittent) reinforcement schedules, where rewards are given only sometimes, are much more resistant to extinction. This is why gambling can be so addictive; the unpredictable wins keep the behavior stable and persistent.
Learned Helplessness: Sometimes, an individual learns that their behavior has no effect on the consequences they experience. Learned helplessness is a state of passive resignation that occurs when an organism repeatedly faces uncontrollable, aversive events. After learning that nothing they do can stop the negative outcome, they may stop trying altogether, even when presented with a new situation where their actions could be effective. This creates a stable, maladaptive pattern of inaction.

Data & Organization Tools

Process Sequence: Shaping a New Behavior

This table illustrates how a complex behavior, like a child learning to clean their room, can be shaped over time.

Step	Target Behavior (Successive Approximation)	Consequence (Reinforcement)	Outcome
1	Puts one toy in the toy box.	Parent gives verbal praise ("Great start!").	Child is more likely to put toys away.
2	Puts several toys in the toy box.	Parent gives praise and a sticker.	Child learns a larger part of the task.
3	Puts all toys away.	Parent gives praise and reads an extra story.	Child masters one component of cleaning.
4	Puts all toys away and makes the bed.	Parent gives praise and allows extra screen time.	The full, complex behavior is established.

Evidence Bank

Law of Effect: The principle, developed by Edward Thorndike, that behaviors followed by satisfying consequences become more likely to recur, while behaviors followed by unsatisfying consequences become less likely.
Operant Conditioning: A type of learning, most closely associated with B.F. Skinner, in which behavior is controlled by its consequences.
Reinforcement: Any event or stimulus that, when following a response, increases the probability that the response will occur again.
Punishment: Any event or stimulus that, when following a response, decreases the probability that the response will occur again.
Shaping: An operant conditioning procedure in which reinforcers guide behavior toward closer and closer approximations of the desired behavior.
Primary Reinforcer: An innately reinforcing stimulus, such as one that satisfies a biological need like food, water, or warmth.
Secondary (Conditioned) Reinforcer: A stimulus that gains its reinforcing power through its association with a primary reinforcer; examples include money, grades, or praise.
Learned Helplessness: The hopelessness and passive resignation an animal or human learns when unable to avoid repeated aversive events.
Superstitious Behavior: Behavior that is accidentally or non-contingently reinforced, leading the organism to repeat it as if the action caused the desired outcome.
Reinforcement Schedule: A pattern that defines how often a desired response will be reinforced.

Skill Snapshots

Mechanism Pairs

Cause → Effect: A voluntary behavior is followed by a reinforcing consequence → The frequency of that behavior increases.
Cause → Effect: Reinforcement is delivered only for behaviors that get progressively closer to a goal → A new, complex behavior is shaped.
Cause → Effect: Reinforcement is delivered on an unpredictable (variable) schedule → The learned behavior becomes highly resistant to extinction.

Perspective Contrasts

Reinforcement vs. Punishment: Reinforcement strengthens or increases the frequency of a behavior, whereas punishment weakens or decreases its frequency.
Positive vs. Negative: The term "positive" means a stimulus is added or presented, while "negative" means a stimulus is removed or taken away.
Ratio vs. Interval Schedules: Ratio schedules deliver reinforcement based on the number of responses made, while interval schedules deliver reinforcement based on the first response after a certain amount of time has passed.

Change Track

Baseline: A student does not regularly complete their homework assignments.
Change 1 (Positive Reinforcement): The teacher praises the student and gives them a sticker for turning in a completed assignment. The student begins completing homework more often.
Change 2 (Shaping): The teacher now only provides praise for assignments that are both complete and on time, shaping more responsible behavior.
Persistence: The teacher switches to a variable-interval schedule, giving praise for on-time work unpredictably (e.g., on Monday, then Friday, then Tuesday). The student's habit of completing homework on time becomes stable and persists even on days without praise.

Common Misconceptions & Clarifications

"Negative" does not mean "bad." In conditioning, "negative" simply means the removal of a stimulus. Negative reinforcement (e.g., taking aspirin to remove a headache) is a pleasant outcome that strengthens the behavior that caused it.
Punishment is not the same as negative reinforcement. Punishment decreases a behavior. Negative reinforcement increases a behavior by removing an unpleasant stimulus.
Operant conditioning applies to voluntary actions, not reflexes. It explains how we learn to operate on our environment (e.g., pressing a button, asking a question), not involuntary responses like flinching or salivating.
Superstitious behaviors are a product of accidental reinforcement. If a baseball player hits a home run while wearing a certain pair of socks, they may associate the socks with success and continue wearing them, even though the reinforcement (the home run) was not actually caused by the socks.

One-Paragraph Summary

Operant conditioning is a fundamental learning process where voluntary behaviors are shaped by their consequences, as described by the Law of Effect. Actions followed by reinforcement (either adding a positive stimulus or removing a negative one) are strengthened, while those followed by punishment are weakened. Complex skills are built gradually through shaping, which involves rewarding successive approximations of a target behavior. The pattern of reinforcement, known as the schedule, is critical; partial reinforcement schedules create behaviors that are much more persistent and resistant to extinction than those rewarded continuously. This framework helps explain a wide range of human and animal behaviors, from simple habits to complex phenomena like learned helplessness and superstitious actions.

Operant Conditioning - AP Psychology Study Guide