Every spoken word can be broken into syllables, the basic units of pronunciation. Each syllable is built around a vowel sound, with optional consonants attached before or after it. Languages differ in how they allow these pieces to combine, and those rules (called phonotactics) shape everything from word formation to how we perceive speech.

Components of Syllables

A syllable has up to three parts:

Onset – the consonant(s) at the beginning of a syllable. In "stop," the onset is /st/. The onset is optional; a word like "at" has no onset on its first syllable.
Nucleus – the core of the syllable, almost always a vowel (though syllabic consonants like the /l/ in "bottle" can fill this role). The nucleus is the only obligatory part.
Coda – the consonant(s) at the end of a syllable. In "best," the coda is /st/. Like the onset, the coda is optional.

The nucleus and coda together are often grouped as the rhyme (sometimes spelled rime). That's why "cat" and "bat" rhyme: they share the same nucleus + coda sequence.

A syllable with no coda (like "go" or "ba") is called an open syllable. A syllable with a coda (like "bat" or "best") is called a closed syllable.

Phonotactics and Sound Sequences

Phonotactics are the language-specific rules that determine which combinations of phonemes are allowed and which are not. These rules constrain syllable structure, word formation, and even how languages borrow words from other languages.

A few examples of phonotactic constraints:

In English, /ŋ/ (the "ng" sound) can appear in the coda ("sing") but never in the onset. You won't find an English word starting with that sound.
English allows onset clusters like /pl/ ("play") and /str/ ("street"), but not /pw/ or /tl/.
Japanese has very strict phonotactics: most syllables follow a CV pattern, and consonant clusters are generally not permitted.

Phonotactics also affect speech perception. Native speakers unconsciously know which sound sequences are legal in their language, and they use that knowledge to segment continuous speech into words.

Components of syllables, A perfect end: A study of syllable codas in South African Sign Language

Syllable Structure Across Languages

Languages vary widely in how complex their syllables can be:

CV (consonant + vowel) is the most universal syllable type. Nearly every language allows it, and many languages strongly prefer it.
Some languages permit only simple structures. Hawaiian, for instance, allows only V and CV syllables.
Others allow much more complexity. English permits structures as heavy as CCCVCCCC (as in "strengths," /strɛŋkθs/).

Syllabification is the process of dividing a word into its component syllables. Different languages apply different rules for where syllable boundaries fall. For example, a word like "extra" could be syllabified as /ɛk.strə/ or /ɛks.trə/ depending on the language's rules about how consonants are assigned to onsets versus codas. A common cross-linguistic tendency is the Maximal Onset Principle: consonants between vowels are assigned to the onset of the following syllable whenever the language's phonotactics allow it.

The Sonority Sequencing Principle

The Sonority Sequencing Principle (SSP) helps explain why certain consonant clusters are allowed and others aren't. Sonority is roughly how "loud" or "open" a sound is. Vowels are the most sonorous, then glides, then liquids, then nasals, then fricatives, then stops (least sonorous).

The SSP states that sonority should rise from the onset toward the nucleus and fall from the nucleus toward the coda. That's why English allows /pl/ in an onset (stop → liquid = rising sonority) but not /lp/ (liquid → stop = falling sonority).

Some clusters violate the SSP. English /st/ in "stop" is a common example (fricative → stop = falling sonority in the onset). These are treated as exceptions or analyzed as falling outside the core syllable structure.

Components of syllables, Frontiers | Syllable Structure Universals and Native Language Interference in Second Language ...

Syllable Structure in Language Use

Syllable knowledge connects to several broader patterns:

Allophonic variation is often tied to syllable position. In English, aspirated [pʰ] appears in syllable-initial position ("pin"), while unaspirated [p] appears after /s/ in the same syllable ("spin").
Vowel sequences are handled differently across languages. Some allow diphthongs (two vowel qualities in one nucleus, like /aɪ/ in "ride"). Others avoid vowel hiatus (two vowels in a row across syllable boundaries) by inserting a consonant or merging the vowels.
Language acquisition reflects syllable structure: children tend to produce CV syllables first, regardless of their native language. Second-language learners often struggle with syllable types that don't exist in their first language, sometimes inserting extra vowels to break up unfamiliar clusters (e.g., a Japanese speaker pronouncing "strike" as something closer to /sɯtoraɪkɯ/).

Comparing Syllable Types Across Languages

Feature	Simple syllable languages (e.g., Japanese, Hawaiian)	Complex syllable languages (e.g., English, Georgian)
Common patterns	CV, V	CVC, CCVC, CVCC, and beyond
Consonant clusters	Rare or absent	Common in onsets and/or codas
Open vs. closed syllables	Strongly favor open	Both open and closed are frequent

Universal tendencies emerge across languages: CV syllables are the most common type worldwide, complex codas are generally rarer (more "marked") than complex onsets, and open syllables are preferred over closed ones. These patterns suggest that syllable structure isn't just arbitrary but reflects something about how human speech production and perception work.

2,589 studying →