Study smarter with Fiveable
Get study guides, practice questions, and cheatsheets for all your subjects. Join 500,000+ students with a 96% pass rate.
Language acquisition sits at the heart of cognitive psychology. It's where nature vs. nurture, biological constraints, social learning, and information processing all collide. When you're tested on this material, you're not just being asked to match theorists to their ideas. You're being assessed on whether you understand the fundamental debate: Is language learned, innate, or something in between? These theories represent different answers to that question, and each one emphasizes different cognitive mechanisms.
Exam questions frequently ask you to compare perspectives and apply them to scenarios. A free-response question might describe a child's language development and ask you to explain it through multiple theoretical lenses. Don't just memorize names and dates. Know what cognitive process each theory prioritizes and how they contradict or complement each other. That's where the points are.
These theories argue that language is fundamentally hardwired into the human brain. We're born ready to acquire it, and experience simply activates what's already there.
Chomsky's central claim is that all humans are born with universal grammar, an innate mental framework containing the structural principles common to all languages. The Language Acquisition Device (LAD) is his hypothetical brain mechanism that allows children to rapidly acquire complex grammar despite receiving messy, incomplete input from the adults around them.
His strongest argument is the poverty of the stimulus: children routinely produce grammatical sentences they've never heard before and avoid errors that would be logical if they were simply imitating. For example, a child who hears "He is happy" and "Is he happy?" never incorrectly forms "Is the boy who happy is running?" from "The boy who is happy is running." They somehow know which "is" to move, even though no one taught them the rule. This, Chomsky argues, proves language can't be purely learned from the environment.
Lenneberg proposed that there is a biological window for optimal language acquisition, roughly from birth to puberty. During this period, the brain has sufficient plasticity to fully acquire a language's grammar and phonology. After the window closes, acquisition becomes significantly harder.
The tragic case of Genie, a girl isolated from language input until age 13, provides supporting evidence. Despite years of therapy and language instruction, she never fully acquired grammar, though she did learn vocabulary. Research on second language acquisition tells a similar story: adults can build large vocabularies but consistently struggle to achieve native-like pronunciation and grammatical intuition.
Compare: Chomsky vs. Lenneberg: both emphasize biology, but Chomsky focuses on what we're born with (universal grammar) while Lenneberg focuses on when we must use it (critical period). Exam questions often ask you to use both together to explain why children outperform adults in language learning.
These theories emphasize that language is acquired through experience, whether through reinforcement, social interaction, or statistical patterns in input.
Skinner argued that language is learned the same way any other behavior is learned: through operant conditioning. A child babbles, and when a sound resembles a real word ("mama"), caregivers respond with attention and praise. That reinforcement makes the child more likely to repeat the sound. Over time, through imitation and shaping, children gradually approximate adult speech as closer and closer attempts are selectively reinforced.
Skinner laid this out in Verbal Behavior (1957), which applied behaviorist principles to all language use. Chomsky's 1959 review of this book is one of the most famous critiques in the field. Chomsky argued that reinforcement can't explain novel sentence production: children say things they've never heard and couldn't have been reinforced for. The "goed" example (overgeneralizing the past-tense rule) is a classic illustration, since no adult models that form and no one reinforces it.
Vygotsky placed social interaction at the center of language development. His key concept is the Zone of Proximal Development (ZPD): the gap between what a child can do alone and what they can do with help from a more capable speaker. Through scaffolding, caregivers provide structured support just beyond the child's current ability, gradually pulling back as the child gains competence.
For Vygotsky, language is fundamentally a cultural tool. Children don't just learn words; they learn how their community uses language to think, categorize, and solve problems. His concept of private speech (children talking to themselves while working through a task) illustrates this nicely. What starts as social communication gets internalized as a tool for guiding one's own thinking, showing that language actively shapes cognitive development.
Tomasello argues that children learn language by understanding what speakers mean, not just what they say. This capacity for intention-reading is what separates human language learning from anything other primates can do.
Children build grammar from the bottom up through pattern extraction. They hear high-frequency phrases ("I wanna ___," "Give me the ___") and gradually extract the abstract patterns underlying them. Rather than being born with grammatical rules, children generalize from thousands of specific examples.
Joint attention is critical to this process. Children learn new words most effectively when an adult and the child are both focused on the same object or event. This links language acquisition directly to social cognition: you need to understand what someone else is paying attention to before you can figure out what their words refer to.
Compare: Skinner vs. Vygotsky: both are "nurture" theorists, but Skinner sees the child as passive (shaped by reinforcement) while Vygotsky sees the child as active (constructing meaning through social participation). This distinction is heavily tested.
These theories position language acquisition as dependent on broader cognitive development. Children can only express what they can first understand.
Piaget's core claim is that cognition precedes language. A child must first develop a mental concept before they can learn a word for it. For instance, a child needs to grasp object permanence (the understanding that objects continue to exist when out of sight) before they can meaningfully use words like "gone" or "where."
Language is also constrained by Piaget's stages of development. A preoperational child (roughly ages 2-7) uses egocentric speech because they can't yet take another person's perspective. As a constructivist, Piaget believed children actively build linguistic knowledge through exploration and interaction with their environment. Language isn't passively absorbed; it reflects the child's current cognitive stage.
Compare: Piaget vs. Vygotsky: both are constructivists, but they disagree on the direction of the relationship between language and thought. Piaget says thought drives language; Vygotsky says language drives thought. This is a classic comparison question. Think of it this way: for Piaget, a child thinks first and then finds words; for Vygotsky, learning words actually transforms how the child thinks.
These theories model language acquisition as pattern recognition and neural network learning. The brain extracts statistical regularities from linguistic input without needing pre-built grammatical rules.
Elman showed that artificial neural networks can learn grammatical patterns purely from exposure to input, without any built-in rules. This directly challenges Chomsky's claim that grammar must be innate. In these models, statistical learning does the work: the network tracks which sounds follow which, which words tend to co-occur, and which sentence structures are most common.
Learning happens through gradual weight adjustment across connections in the network. This mirrors how children slowly refine their language through exposure, making errors along the way that reflect incomplete pattern learning rather than broken rules.
PDP models propose that processing language involves simultaneous activation of multiple representations. When you read a word, its spelling, sound, and meaning are all activated at the same time, not in a step-by-step sequence.
Knowledge is stored as distributed representations across networks of connections, not in single locations. This explains why brain damage typically causes partial rather than total language loss: if a concept is spread across many connections, losing some of them degrades performance without eliminating it entirely.
PDP models also neatly explain U-shaped learning curves. A child first says "went" (memorized from input), then shifts to "goed" (overgeneralizing the regular past-tense pattern as the network learns the rule), then returns to "went" (as the network adjusts to accommodate both regular and irregular forms).
Unlike single-mechanism connectionist models, Coltheart's dual-route model proposes two separate pathways for reading:
This model has strong clinical applications. Surface dyslexia results from damage to the lexical route (the person can sound words out but can't recognize irregular words like "yacht" as wholes). Phonological dyslexia results from damage to the phonological route (the person can read familiar words but struggles to sound out novel ones).
Compare: Connectionist vs. Dual-Route models: both are computational, but connectionist models use a single mechanism that learns everything, while dual-route models propose separate specialized pathways. This parallels the broader debate about modular vs. distributed processing in cognition.
These theories attempt to synthesize multiple factors, arguing that language emerges from the interaction of biology, cognition, and environment.
Bates rejected Chomsky's idea of a dedicated language module. Instead, she argued that language emerges from general cognitive abilities like memory, attention, and pattern recognition. There's no LAD; there's just a powerful brain doing what it does with any complex input.
Her dynamic systems perspective treats language development as nonlinear and context-dependent. Small changes in input or cognitive maturity can trigger large developmental shifts, much like how a small temperature change can cause water to suddenly boil.
The competition model is one of her most testable contributions. It proposes that learners weigh multiple cues (word order, morphology, animacy) when interpreting sentences, and the relative importance of each cue differs across languages. English speakers rely heavily on word order ("The dog bit the cat" vs. "The cat bit the dog"), while speakers of languages with richer morphology may rely more on case markings. This explains cross-linguistic variation in acquisition patterns.
Compare: Emergentism vs. Nativism: this is the fundamental theoretical divide in the field. Emergentists say language is domain-general (using the same cognitive systems as other learning); nativists say it's domain-specific (requiring dedicated language machinery). Know which theorists fall on each side.
| Concept | Best Examples |
|---|---|
| Innate/biological basis | Chomsky (universal grammar), Lenneberg (critical period) |
| Environmental learning | Skinner (reinforcement), Tomasello (usage-based) |
| Social-cultural factors | Vygotsky (ZPD, scaffolding), Tomasello (joint attention) |
| Cognition-language relationship | Piaget (cognition first), Vygotsky (language shapes thought) |
| Neural/computational processing | Elman (connectionist), McClelland & Rumelhart (PDP) |
| Reading-specific models | Coltheart (dual-route), PDP (distributed) |
| Integrative/emergent | Bates (emergentist), Tomasello (usage-based) |
Compare and contrast: How would Skinner and Chomsky each explain a child who says "I goed to the store"? What does this error reveal about the limits of behaviorist theory?
Which two theorists would most strongly disagree about whether language requires a specialized brain module? Explain the core of their disagreement.
A researcher finds that children in highly interactive households develop vocabulary faster than children who watch educational videos alone. Which two theories best explain this finding, and why?
FRQ-style: Describe how the Critical Period Hypothesis and Social Interactionist Theory could both be used to explain difficulties in second language acquisition for adult immigrants. Where do they agree and disagree?
If a patient has damage that impairs reading of familiar words but preserves the ability to sound out nonsense words, which model best explains this pattern? What route is damaged?