Distributed morphology


In generative linguistics, Distributed Morphology is a theoretical framework introduced in 1993 by Morris Halle and Alec Marantz. The central claim of Distributed Morphology is that there is no divide between the construction of words and sentences. The syntax is the single generative engine that forms sound-meaning correspondences, both complex phrases and complex words. This approach challenges the traditional notion of the Lexicon as the unit where derived words are formed and idiosyncratic word-meaning correspondences are stored. In Distributed Morphology there is no unified Lexicon as in earlier generative treatments of word-formation. Rather, the functions that other theories ascribe to the Lexicon are distributed among other components of the grammar.

Overview of Distributed Morphology

The basic principle of Distributed Morphology is that there is a single generative engine for the formation of both complex words and complex phrases; there is no division between syntax and morphology and there is no Lexicon in the sense it has in traditional generative grammar. In fact, Distributed Morphology completely rejects the notion of a lexicon in the way it had been used. Any operation that would occur in the 'lexicon' according to lexicalist approaches is considered too vague in Distributed Morphology, which instead distributes these operations over various steps and lists.
In contrast to lexicalist models of morphosyntax, Distributed Morphology posits three steps in building an utterance: first, the Formative List provides the input for syntax, followed by the syntactic operations themselves, then the Exponent List is consulted to provide the utterance with phonological content. The term Distributed Morphology is thus used because the morphology of an utterance is the product of operations distributed over more than one step, with content from more than one list.
There are three relevant lists in Distributed Morphology: the Formative List, the Exponent List, and the Encyclopedia. Items from these lists enter the derivation at different stages.

Formative list

The formative list, sometimes called the lexicon in Distributed Morphology includes all the bundles of semantic and sometimes syntactic features that can enter the syntactic computation. These are interpretable or uninterpretable features which are manipulated in syntax through the traditional syntactic operations. These bundles of features do not have any phonological content; phonological content is assigned to them only at spell-out, that is after all syntactic operations are over. The Formative List in Distributed Morphology differs, thus, from the Lexicon in traditional generative grammar, which includes the lexical items in a language.
As its name would suggest, the Formative List contains what are known as formatives, or roots. Roots have no grammatical categories in and of themselves, and merely represent the bundle of semantic features to be exponed. The notation for roots in Distributed Morphology generally uses a square root symbol, with an arbitrary number. For example, 'love,'without a grammatical category, could be expressed as √lv362.

Vocabulary items

Vocabulary items associate phonological content with arrays of underspecified syntactic and/or semantic features – the features listed in the Lexicon – and they are the closest notion to the traditional morpheme known from generative grammar. Postsyntactic Morphology posits that this operation takes place after the syntax itself has occurred.
Vocabulary items are also known as the Exponent List. In Distributed Morphology, after the syntax of a given utterance is complete, the Exponent List must be consulted to provide phonological content. This is known as 'exponing' an item. In other words, a vocabulary item is a relation between a phonological string and the context in which this string may be inserted. Vocabulary items compete for insertion to syntactic nodes at spell-out, i.e. after syntactic operations are complete. The following is an example of a vocabulary item in Distributed Morphology:
An affix in Russian can be exponed as follows:
/n/ <-->
The phonological string on the left side is available for insertion to a node with the features described on the right side.
Roots, i.e. formatives from the Formative List, are exponed based on their features. For example, the first-person singular pronominal paradigm in English is exponed as follows:
←→ /aj/
←→ /mi/
The use of /mi/ does not seem infelicitous in a nominative context at first glance. If /mi/ acquired nominative case in the syntax, it would seem appropriate to use it. However, /aj/ is specified for the feature , and therefore must block the use of /mi/ in a nominative context. This is known as the Maximal Subset Condition or the Elsewhere Principle: if two items have a similar set of features, the one that is more specific will win. Illustrated in logical notation:
f ⊂ f, f ⊂ f, and f ⊂ f → f wins.
In this case, both /mi/ and /aj/ have a subset of features f, but /aj/ has the maximal subset.

Encyclopedia

The Encyclopedia associates syntactic units with special, non-compositional aspects of meaning. This list specifies interpretive operations that realize in a semantic sense the terminal nodes of a complete syntactic derivation. For example, adjectives compárable and cómparable are thought to represent two different structures. First one, has a composition meaning of ‘being able to compare’ – root combines with a categorizer V- and the two combine with the suffix –able. The second one has an idiomatic meaning of ‘equal’ taken directly from the Encyclopedia – here root combines directly with the suffix –able.

Derivation

The Y-model of Minimalism, as well as the syntactic operations postulated in Minimalism, are preserved in Distributed Morphology. The derivation of a phrase/word proceeds as follows:
Distributed Morphology recognizes a number of morphology-specific operations that occur post-syntactically. There is no consensus about the order of application of these morphological operations with respect to vocabulary insertion, and it is generally believed that certain operations apply before vocabulary insertion, while others apply to the vocabulary items themselves. For example, Embick and Noyer argue that Lowering applies before Vocabulary insertion, while Local Dislocation applies afterwards.
Apart from the operations described above, some researchers have suggested that there are morphemes that represent purely formal features and are inserted post-syntactically but before spell-out: these morphemes are called "dissociated morphemes".

Morphological merger

Morphological Merger is generalized as follows in Marantz 1988: 261:
Morphological Merger:

At any level of syntactic analysis, a relation between X and Y may be replaced by the affixation of the lexical head of X to the lexical head of Y.
Two syntactic nodes can undergo Morphological Merger subject to morphophonological well-formedness conditions.

Fusion

Two nodes that have undergone Morphological Merger or that have been adjoined through syntactic head movement can undergo Fusion, yielding one single node for Vocabulary insertion. Many-to-one relation where two syntactic terminals are realized as a single exponent.
An example can be found in Swahili, which has separate exponents for subject agreement and negation :
tu- ta- pend-a kiswahili
we- will- love Swahili
ha- tu- ta- pend-a kiswahili
NEG- we- will- love Swahili
However, 1st person singular exponent ni- and negation ha- undergo fusion and realized as si-:
*ha- ni- ta pend-a kiswahili
NEG- I- will- love Swahili
si- ta- pend-a Kiswahili
NEG.I- will- love Swaihili
An alternative analysis of si- exponent says that there is no fusion but rather context sensitive allomorphy:
si- Ø ta- pend-a Kiswahili
NEG- I- will- love Swaihili

Fission

Fission refers to the splitting of one terminal node into two distinct terminal nodes prior to Vocabulary Insertion. Some of the most well-known cases of fission involve the imperfect conjugations of Semitic, in which agreement morphology is split into a prefixal and suffixal part, as investigated in the work of Noyer. Fission may also occur where insertion of a Vocabulary item discharges the intrinsic features of the Vocabulary item from the terminal node, leaving others features available for possible insertion; if fission applies, then other Vocabulary items can be inserted to discharge the remaining features. When Fission occurs, the order of morphemes is influenced by the featural complexity of Vocabulary items.

Impoverishment

Impoverishment refers to a change in the feature content on a terminal node prior to Vocabulary Insertion, resulting in a less marked feature content. This can be accomplished by deleting a feature or by changing it from a marked to an unmarked value. Impoverishment accounts for cases in which spell-out of a terminal node by a featurally specific Vocabulary Item is blocked by a less specific Vocabulary Item.
Impoverishment can also target an entire terminal node, in which case it is referred to as 'obliteration'. This results in the complete absence of the morpheme from the structure of the word.

Lowering

Lowering is sensitive to syntactic headedness and operates on abstract feature bundles, after syntactic movement but prior to vocabulary insertion. Lowering takes place when a head X lowers to the head of its complement, Y. For example, T in English lowers to be realized on the head of its complement V, as in "John TP tT [vP play-ed piano." An adjoined adverb will not block this syntactic movement, since it is sensitive to syntactic headedness rather than linear adjacency: "John skillfully play-ed piano." [On the other hand, a Merged Negation head will block this movement and trigger 'do insertion':" John did not play piano".

Local dislocation

String-adjacent Vocabulary items may undergo Local Dislocation, in which the two items form a unit, with reversed linear order. Embick and Noyer suggest that linearization takes place at Vocabulary Insertion. At this point it is possible to reorder linearly adjacent vocabulary items. This reordering must respect the relationship between the constituents, however. In a linearization *Y, since Z is still left-adjacent to Y though Z is now an internally complex head. The relationship between X and Z has been properly converted through Local Dislocation. Since the relationships between the constituents have been respected or properly converted, the derivation is well-formed. Local Dislocation applies after Vocabulary insertion to reorder two linearly adjacent elements, such as the comparative feature and an adjective in John is smarter than Mary., which contrasts with John is more intelligent than Mary.; in this case the movement makes reference to the phonological features of the moved items, moving -er after an adjective of one syllable, while leaving more in a position dominating the adjective .

Distributed morphology approach to core theoretical issues

Morpheme order

In Distributed Morphology, the linear order of morphemes is determined by their hierarchical position in the syntactic structure, as well as by certain post-syntactic operations. Head movement is the main syntactic operation determining morpheme order, while Morphological Merger is the main post-syntactic operation targeting affix order. Other post-syntactic operations that might affect morpheme order are Lowering and Local Dislocation.
The general principle behind morpheme order is the Mirror Principle, according to which the linear order of morphemes is the mirror image of the hierarchy of syntactic projections. For example, in a plural noun like cat-s, the plural morpheme is higher in the hierarchy than the noun: NumP -s[NP cat. [The Mirror Principle dictates that the linear order of the plural morpheme with respect to the noun should be the mirror image of their hierarchy, namely the attested cat-s. The syntactic mechanism responsible for the effects of the Mirror Principle is head movement: heads raise and left-adjoin to higher heads. Moreover, head movement is subject to the Head Movement Constraint, according to which when a head moves, it cannot skip an intervening head. Left adjunction and the Head Movement Constraint ensure that the Mirror Principle holds.
Research subsequent to Baker has shown that there are some apparent violations to the Mirror Principle and that there are more operations involved in the determination of the final linear order of morphemes. Firstly, the left adjunction requirement of head movement has been relaxed, as right adjunction has been shown to be possible. Therefore, different heads can have a specification for right vs. left adjunction. We could imagine, for example, that there is a language in which head movement of the noun to the Number head is specified for right adjunction, rather than left adjunction which is the case in English. In that language, the predicted order of the noun and plural morpheme would then be s-cat. Notice, however, that right vs. left adjunction determines whether an affix will be realized as a prefix or a suffix: its closeness to the root will still reflect hierarchical order. Let's look at a hypothetical example to make this clear. Assuming that Tense is merged higher than Aspect, there are four possible orders for the tense and aspect morphemes with respect to the verb stem, once we allow for variation between left and right adjunction in head movement.
  1. Verb – Aspect – Tense
  2. Tense – Verb – Aspect
  3. Aspect-Verb-Tense
  4. Tense-Aspect-Verb
Crucially, though, the following two orders are predicted to be impossible.
  1. Aspect-Tense- Verb
  2. Verb – Tense – Aspect
These orders are the orders in which the tense morpheme is closer to the root than the aspect morpheme. Since Aspect is merged before Tense and morpheme order still reflects hierarchical order, such a configuration is predicted to be impossible.
Finally, certain post-syntactic operations can affect morpheme order. The best studied one is Morphological Merger or Merger Under Adjacency. This operation merges two adjacent terminal nodes into one morphological word. In other words, it allows for two heads which are adjacent to merge into one word without syntactic head movement – the operation is post-syntactic. This operation is doing the work of, say, affix lowering of the past tense morpheme in English in early generative syntax. For the operation to apply, what is crucial is that the morphemes to be merged are linearly adjacent.

Allomorphy

A core idea in deriving allomorphy in Distributed Morphology is underspecification. Verbal agreement in the present tense in English takes the form /-s/ in the 3rd person singular, and /Ø/ in all other cases. The phonological exponents of the feature bundle terminal nodes in the syntactic tree are listed in the Exponent List. We can capture the fact that /-s/ has a much smaller distribution than /Ø/ using the following entries in the Exponent List:
  1. ↔ /-s/
  2. ↔ /Ø/
/-s/ will be inserted whenever its full featural specification is met. In all other cases in the present tense however, such as or , /Ø/ will be inserted. This is a use of underspecification, the idea that there is a ‘default’ morpheme that is inserted in the general case, and more specific morphemes that are inserted in more specific cases, when their featural specifications are met. In the above example, /Ø/ is underspecified in the sense that it is not specified for person. Underspecification relies on the ‘Maximal Subset Condition.’
The Maximal Subset Condition states firstly that, for a given exponent E to be inserted into some feature bundle T, the featural specification on E must be a subset of the features on T. In this way, /-s/ is not a possible exponent for a feature bundle . However, /Ø/ is a possible exponent for the feature bundle . To ensure that /-s/ is chosen over /Ø/ for the bundle , the Maximal Subset Condition states secondly that, between two exponents E and F which both contain a subset of the features in a feature bundle T, the exponent that contains the maximal subset of the features in T will be selected.
Featural specification derives allomorphy in featural paradigms. Allomorphy in which different phonological exponents of the same feature bundle are idiosyncratically realized depending on the morphological or phonological environment is captured through contextual specification. An example of such allomorphy is the English plural marker. The typical English plural marker is /-z/, as in bulls. However, the plural of child is children, and the plural of cactus is cacti. Since the choice of the plural morpheme exponent is not related to features, but rather simply to the root it attaches to, the roots must be listed in the contextual specification:
  1. ↔ /-z/
  2. ↔ /-ren/ / _
  3. ↔ /-i/ / _
If the contextual specification of some item is met, it is inserted. Otherwise, insert the item that has no contextual specification. This is an example of the ‘elsewhere condition’. Note that the Maximal Subset Condition stated above is a formal instantiation of the elsewhere condition.
Contextual specification is also used to account for phonologically conditioned suppletive allomorphy, using phonological contexts. Thus, the singular indefinite marker in English can be stated as follows :
  1. ↔ an / _V
  2. ↔ a / _C
A prediction about suppletive allomorphy in Distributed Allomorphy is that, assuming exponents are inserted in a bottom-up fashion of the syntactic tree, it should always be ‘inward-looking.’ This means that contextual allomorphy can only involve the selection of an allomorph based on something lower in the tree. That is, the contextual environments must always involve items lower in the tree.
Morphologically conditioned allomorphy may involve suppletion or readjustment rules that apply in the context of certain Vocabulary items. Suppletion and readjustment rules apply to a terminal node and its associated Vocabulary item – unlike affixation, which combines this terminal node with a separate terminal node that has its own distinct Vocabulary item. Suppletion arises from the competition of Vocabulary items for insertion into a terminal node. Competition involving root Vocabulary items is a topic of ongoing research, however. Early work in Distributed Morphology suggests that a single, abstract lexical root appears in the syntax; in this view, roots do not compete for insertion into root nodes, but exist in free variation, constrained only by semantic and pragmatic well-formedness. Subsequent research has suggested that the distribution of root Vocabulary items can be grammatically restricted ; this means that roots may be featurally restricted and thus subject to competition. The issue of whether root alternations such as buy-Ø/bough-t are better handled by suppletion or readjustment rules remains a topic of debate.
The term suppletion refers to allomorphy of an open-class lexical item. For a large-scale study of suppletion in the context of comparative and superlative adjectival morphology within the general framework of Distributed Morphology, see Bobaljik.

Containment hypothesis

The containment hypothesis is a theory under the framework of Distributed Morphology advanced by Bobaljik to account for the restrictions on the patterns of suppletion seen in language. It states:
"The representation of the superlative properly contains that of the comparative."
Bobaljik argues that if a language has a superlative form, it must build off the comparative form. In other words, the superlative form and the bare adjective cannot be built off the same root to the exclusion of the comparative, because the comparative is necessarily contained within the superlative form. Thus, languages allow:
  1. AAA: All three forms to be built off the same adjectival root
  2. ABB: The comparative and superlative are suppletive and thus built off a different root from that of the bare adjective
  3. ABC: The bare adjective, comparative, and superlative are all built off different roots
PatternLanguageAdjectiveComparativeSuperlative
AAAPersianXX-tærX-tær-in
AAAHungarianXX-bbleg-X-bb
AAAChukchiXX-əŋənan-X-əŋ
ABBEnglishgoodbetterbest
ABBCzechšpatn-ý
'bad'
hor-ší
'worse'
nej-hor-ši
'worst'
ABCLatinbonus
'good'
melior
'better'
optimus
'best'

The exclusion of the *ABA pattern means that there should theoretically exist no language with a pattern analogous to *good – better – goodest. AAB patterns are theoretically possible as well but happen to be rare in the world's languages. Under the model of Distributed Morphology, the structure of a superlative would be Comparative ] Superlative ]. Thus, the exponent of the comparative morpheme attaches to the bare adjective and the exponent of the superlative morpheme attaches to the comparative form. The following rules may be posited for Czech as an example:
  1. √BAD → špatn-
  2. √BAD → hor- / ___ CMPR
  3. CMPR → -ši
  4. SPRL → nej-
Rule 2 above states that the root BAD is suppleted in the environment of a comparative. By extension, the superlative form attaches the prefix nej- to horši and not to špatn- as the comparative is the only structure it can see. For Latin, in which both the comparative and the superlative are suppleted, the following rules may be posited:
  1. √GOOD → bon-
  2. √GOOD → mel- / ___ CMPR
  3. √GOOD → opt- / ___ SPRL
  4. CMPR → -ior
  5. SPRL → -imus
As an *ABA pattern would require the adjective to directly combine with the superlative node, it is theoretically impossible because of the intervening comparative node and is also unattested in the world's languages. Nevins supports the structure by arguing that the semantics of the superlative is dependent on that of the comparative.
  1. Comparative: Y is more ADJ than X
  2. Superlative: For all X, Y is more ADJ than X
The comparative definition is contained within the superlative one and thus, the superlative must obligatorily subsume it in its structure.

Morphological paradigms

In Distributed Morphology morphological paradigms are seen as epiphenomena. Irregular forms or gaps associated with paradigms are explained via competition for vocabulary insertion.

Meaning in Distributed Morphology

In Distributed Morphology there are two different types of meaning: the meaning associated with the bundles of features of the Lexicon and the idiosyncratic meaning listed in the Encyclopedia. It is believed that encyclopedic meaning is associated with lexical roots, rather than with complete phrases.

Allosemy

Allosemy – the phenomenon in which a single morpheme can have multiple semantic realizations – is handled the same way allomorphy is handled in DM: through contextual specification and the Elsewhere Condition. The Encyclopedic List contains the semantic meaning and context for each entry in the list. When a single morpheme is realized with multiple possible meanings, it has multiple entries in the Encyclopedic List. Thus, we can derive multiple possible meanings of ‘look’ with the following entries:
  1. √ look ←→ PHYSICAL APPEARANCE / V Adj
  2. √ look ←→ NOTABLE GLANCE / N
  3. √ look ←→ CONSULT A SOURCE / V
  4. √ look ←→ λ x, λ y, y looks at x / elsewhere
The contextual specifications for √ look will ensure that it has the appropriate interpretation given the context. The contextual specifications can only include various pieces of information, such as the word class of the item, the word class of a sister node, or even the specific morpheme of the sister node. However, it cannot contain any information about a non-sister node. And, just as in the Exponent List, an item in the Encyclopedic List can be specified as being inserted ‘elsewhere:’ in all contexts where the contextual specification of all other entries for that morpheme are not met. Besides expressing the notion of a ‘default’ semantic meaning of a given morpheme, having the elsewhere condition as an explicit contextual specification also allows for the expression of morphemes that do not have a default semantic meaning.
For many English speakers, ‘cahoot’ only has an interpretation when preceded by ‘in’ The entry for ‘cahoot’ might have the following entry for such a context:
  1. √ cahoot ←→ CONSPIRACY / ]

To express the fact that this is the only context where cahoot has a meaning, we simply posit that there is no entry for cahoot with the elsewhere condition for its contextual specification. Thus, by allowing for the omission of the elsewhere specification, we can express the fact that certain morphemes require a specific context for interpretation.
We can use contextual specification to model other aspects of idiomatic meaning, namely, the fact that idiomatic meaning often does not hold across different syntactic configurations. The expression the shit hit the fan loses its idiomatic meaning if passivized: #the fan was hit by the shit. We can express this fact by specifying voice on the contextual specification for the verb:
  1. √ hit ←→BECOME AN EXTREME SITUATION /
  2. √ eat ←→ FINISH THE FOOD /
The FINISH THE FOOD entry for √ eat will be inserted whenever eat up is followed by any food object. However, when eat up is followed by a non-food object, both entries’ contextual specifications will be met. However, the ENJOY entry is inserted, because more conditions in its contextual specification are met. Thus, the Maximal Subset Condition will choose the alloseme whose contextual specification is most completely satisfied, when there is competition among Encyclopedic List entries.

Crosslinguistic variation

Since the model of Distributed Morphology consists of three lists, we expect crosslinguistic variation to be located in all three of them. The feature bundles and their structure might be different from language to language, which could affect both syntactic and post-syntactic operations. Vocabulary Items can also be different crosslinguistically. Finally, the interpretation of roots is also expected to show variation.