Word-formation rules

Section 5.6 Word-formation rules

As we said in Section 5.10, morphology is the study of words and their internal structure — since simple words do not have internal structure, this means in particular complex words. We have seen a range of such words and word forms now, most importantly, words consisting of a base and one or more derivational affixes, like /ˈlɑː.ləs/ lawless or /ʌnˈlɑː.fəl/ unlawful, words consisting of a base and an inflectional morpheme, like /lɑːz/ laws, and words consisting of two (or more) free morphemes, like /ˈlɑː.mən/ lawman.

🔗

In order to account for the internal structure of these words, we need to describe the bound morphemes along three dimensions: their form, their meaning and their combinatorial properties, i.e., (a) where they attach to the base (whether they are prefixes, suffixes, circumfixes or infixes), (b) what wordclass the base must have, and (c) what wordclass the derivative word has.

🔗

In morphology, such a description typically takes the form of a word-formation rule (WFR) — essentially, a rule for attaching a bound morpheme to a base. There is no generally agreed format for such rules, but all formats you are likely to encounter will make reference to the three dimensions just mentioned. A relatively typical example of such a format would look like this:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{WORDCLASS}}} ~~ \text{AFF]}{\tiny{\text{WORDCLASS}}} \\ \text{Meaning:} \amp \text{‘paraphrase of meaning’} \\ \text{Conditions:} \amp \text{additional conditions} \\ \end{array}\tag{5.6.1} \end{equation}

🔗

The first line shows the form: The base is represented by an X with a subscript specifying the word class, the affix is shown in the position it has relative to the stem. The combination of base and affix is enclosed in square brackets with a subscript specifying the word class of the derivative. The second line shows the meaning in the same way we have used throughout the book so far: either as a paraphrase in single quotation marks, or in uppercase. This notation can be read as follows:

🔗

Take a word X with the specified word class and attach the affix in the position shown, and you will get a new word (or word form) with the specified word class and meaning.
🔗

🔗

The third line describes any additional conditions that must hold in order for the rule to apply (if there are such conditions).

🔗

The WFR for the affix {‑less} would look like this:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{NOUN}}} ~~ \text{/ləs/]}{\tiny{\text{ADJECTIVE}}} \\ \text{Meaning:} \amp \text{‘without X’} \\ \text{Conditions:} \amp \text{—} \\ \end{array}\tag{5.6.2} \end{equation}

🔗

Note that we do not need a hyphen in front of the affix, as the order of base and affix is directly represented in the rule, but we will see below that there are good reasons to mark the boundary between base and affix anyway.

🔗

The word-formation rules for derivational and inflectional affixes follow the same structure — any differences between the two types of affix can be included among the conditions, if necessary. For example, the WFR for the plural affix {-z} would look like this:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{NOUN.SINGULAR}}} ~~ \text{/z/]}{\tiny{\text{NOUN.PLURAL}}} \\ \text{Meaning:} \amp \text{PLURAL} \\ \text{Conditions:} \amp \text{obligatory in plural contexts} \\ \end{array}\tag{5.6.3} \end{equation}

🔗

Note that we need to specify that the base must be a singular noun — if we did not, the rule could apply recursively, adding plural affixes to words that already contain such an affix: *law-s-s-s-s-s-….

🔗

These WFRs may look simple, but they are very powerful tools for capturing the structure of polysyllabic words. Take the word unlawful. Recall that we pointed out that in order to derive this word, we must first derive the word lawful and then add the affix {un‑}. The other order would not work, because the word *unlaw does not exist. These facts follow automatically from the following WFRs:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{NOUN}}} ~~ \text{/fəl/]}{\tiny{\text{ADJECTIVE}}} \\ \text{Meaning:} \amp \text{‘having or causing X’} \\ \text{Conditions:} \amp \text{X must be an abstract quality} \\ \end{array}\tag{5.6.4} \end{equation}

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[/ʌn/ X}{\tiny{\text{ADJECTIVE}}}\text{]}{\tiny{\text{ADJECTIVE}}} \\ \text{Meaning:} \amp \text{‘not X’} \\ \text{Conditions:} \amp \text{X must be a default property} \\ \end{array}\tag{5.6.5} \end{equation}

🔗

The form *unlaw cannot be derived, because the WFR for {un‑} specifies that the base must be an adjective. Thus, the rule for {‑ful} (or another rule deriving adjectives from nouns) must apply first, giving us first lawful and then unlawful.

🔗

The two rules also illustrate the kind of conditions that we may have to specify for a WFR. The rule for {‑ful} specifies that the base must be an abstract noun — this is to avoid the formation of words like *homeful, *stainful, *breathful, *priceful, *windowful etc. Note that {‑less} does not have this restriction, so we get words like homeless, stainless, breathless, priceless, windowless.

🔗

The rule for {un‑} specifies that the base must refer to a “default property” — by this we mean properties that are assumed to be the “normal” case. We expect lawfulness to be normal state of affairs, so we get unlawful, but not *unlawless, we expect happiness to the the normal state of affairs, so we get unhappy, but not *unsad, we expect necessity to be the normal state of affairs so we get unnecessary but not *unsuperfluous, and so on. Admittedly, this concept is somewhat vague, but that is why linguistics is still an area of active research!

🔗

Question 5.6.1.

Note that we also do not get words like *unlong or *unshort, *unlight, *undark or *unheavy, *unfar or *unclose, *unnew or *unold, etc. What additional conditions might we have to posit in the WFR for {un‑} in order to avoid the derivation of these forms?

🔗

When you look at words created by a specific WFR, you will often note that they mean something more specific or even something slightly different than what the rule would lead you to expect. For example, the rule for {‑ful} states that the meaning of derivatives with this affix should be ‘having/causing X’, and for many cases, this is a decent definition: beautiful ‘having beauty’, successful ‘having success’, powerful ‘having power’, painful ‘causing pain’, stressful ‘causing stress’ etc. The first question would be how we know when the affix means ‘having X’ and when it means ‘causing X’. Why do we use beautiful for something that has beauty (a beautiful person), but not for something that causes beauty (*a beautiful surgery for a plastic surgery), and painful for something that causes pain (a painful surgery) but not something that has pain (a painful person for someone who is in pain)? The answer is that the WFR specifies the range of meanings that the derivate can have, but once the derivate is established, it will typically have a more specific meaning rather than the whole range of meanings. It may even develop a slightly different meaning altogether, like plentiful, which means neither ‘having plenty’ nor ‘causing plenty’, but ‘being available in great quantity’, or mindful, which means neither ‘having a mind’ nor ‘causing a mind’, but ‘paying careful attention to something.

🔗

Subsection Conversion

There is a specific type of derivation that would be difficult to describe in terms of affixation but that can easily be captured by a WFR. Consider the following examples:

🔗

(1) 🔗: I will send you an email → I will email you.

🔗

(2) 🔗: They will send our order by ship → They will ship our order.

🔗

(3) 🔗: Do not make a fool of me → Do not fool me.

🔗

It seems that derivation is taking place — nouns are being made into verbs. However, there is no affix accompanying the change. Some linguists have suggested that there may be an ‘invisible’ zero affix {-Ø), so that the verb email actually has the structure email-Ø. But invisible affixes are a weird idea, and we do not need them. We can simply posit a WFR like the following:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{NOUN}}}\text{]}{\tiny{\text{VERB}}} \\ \text{Meaning:} \amp \text{‘do something involving X’} \\ \text{Conditions:} \amp \text{—} \\ \end{array}\tag{5.6.6} \end{equation}

🔗

The only difference between this and the previously shown WFRs is that there is no affix — because the derivatives do not contain affixes! In other words, the rule says: “You can take a noun and use it as a verb, and the meaning will be ‘to do something involving that noun’”. Because there is no affix, most linguists refer to this type of derivation as conversion. Similar rules can be posited to account for the conversion of verbs to nouns (to walk → a walk), adjectives to nouns (rich → the rich), etc.

🔗

Subsection Clitics

We can also extend WFRs to clitics: recall the English possessive clitic {=’s}, which attaches to noun phrases instead of nouns. This can be captured very straightforwardly in the following WFR:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{NOUN.PHRASE}}} ~ \text{=/z/]}{\tiny{\text{POSS}}} \\ \text{Meaning:} \amp \text{‘belonging to X’} \\ \text{Conditions:} \amp \text{—} \\ \end{array}\tag{5.6.7} \end{equation}

🔗

Note, again, that we do not, strictly speaking, need to mark the boundary, because we can see that {=/z/} is a clitic based on the fact that it attaches to a phrase; we mark the boundary anyway, to draw extra attention to this fact. Note that we do not necessarily have to treat clitics in this way — we will discuss an alternative way in Section 7.4, but since attaching the possessive clitic to a noun phrase results in a unit that behaves like a word, it is a reasonable thing to do. What kind of word would this be? Well, a noun phrase with a possessive clitic behaves just like a possessive determiner (traditionally called “possessive pronoun”): my neighbor’s dog ~ his dog.

🔗

Subsection Backformation

One interesting piece of evidence for the existence of word-formation rules is that language users sometimes apply them backwards. Consider the following word pairs:

🔗

(4a) 🔗: teach ~ teacher

🔗

(4b) 🔗: speak ~ speaker

🔗

(4c) 🔗: read ~ reader

🔗

(4d) 🔗: work ~ worker

🔗

(4e) 🔗: curate ~ curator

🔗

(4f) 🔗: legislate ~ legislator

🔗

(4g) 🔗: gamble ~ gambler

🔗

(4h) 🔗: peddle ~ peddler

🔗

These word pairs suggest the existence of a word-formation rule like the following, which turns a verb into a noun meaning ‘someone who performs the action described by the verb’:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{VERB}}} ~~ \text{‑/əɹ/]}{\tiny{\text{NOUN}}} \\ \text{Meaning:} \amp \text{‘someone who X-es’} \\ \text{Conditions:} \amp \text{—} \\ \end{array}\tag{5.6.8} \end{equation}

🔗

And indeed, such a rule exists, as we can easily see by the fact that it is routinely applied to new verbs entering the language, for example to the verb scam (first documented in 1963) to form scammer (first documented in 1972), to the verb blog (first documented in August 1999) to form blogger (first documented in December 1999), or to podcast to form podcaster (both first documented in September 2004).

🔗

The words in (4a–d), and many others, were formed in exactly the same way in the history of English. However, the words in (4e–f) were not: in these cases, the noun existed before the verb: the noun curator is first documented in 1390 as a borrowing of French curatour, the verb curate is first documented more than 500 years later; the noun legislator is first documented in 1513 as a borrowing of Latin lēgislātor, the verb legislate is first documented 143 years later; the noun gambler is first documented in 1735, probably as an alteration of the word gamester, but the verb gamble is first documented 17 years later; and the noun peddler is first documented in 1307 with uncertain origin, but the verb peddle is first documented 247 years later.

🔗

In all of these cases, a noun existed that looked as though it contained the suffix shown in the WFR above, so at some point speakers assumed that it must have been created by this rule and used it to reconstruct a verbal base that never actually existed!

🔗

Subsection Boundaries in word formation

As mentioned above, it is a good idea to mark the boundary between bases and affixes in WFRs. The reason is that there are two different types of derivational affixes, something we have ignored so far. Consider the words in Table 5.6.2

🔗

Table 5.6.2. Derviatives with {‑ness} and {‑ity}

🔗

	{‑ness}	{‑ity}
/əˈfɛk.tɪv/ effective	/əˈfɛk.tɪv.nəs/	/ə.fəkˈtɪv.ə.ti/
/kəmˈpɛt.ə.t̬ɪv/ competitive	/kəmˈpɛt.ə.t̬ɪv.nəs/	/ˌkɑːm.pəˈtɪˈtɪv.ə.ti/
/ˈɹiː.əl/ real	/ˈɹiː.əl.nəs/	/ɹiˈæl.ə.ti/
/ˈpʌb.lɪk/ public	/ˈpʌb.lɪk.nəs/	/pʌbˈlɪs.ə.ti/
/ɡɹeɪv/ grave	/ɡɹeɪv.nəs/	/ˈɡɹæv.ə.ti/
/ɪnˈtɛns/ intense	/ɪnˈtɛns.nəs/	/ɪnˈtɛn.sə.ti/

The first column contains adjectives, the second column contains nouns derived from these adjectives using the suffix {‑ness} we already know from lawlessness, and the third column contains nouns derived from the same adjectives using the suffix {‑ity}. The word-formation rule for {‑ness} is straightforward, it looks just like the ones we have already seen:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{ADJECTIVE}}} ~~ \text{/nəs/]}{\tiny{\text{NOUN}}} \\ \text{Meaning:} \amp \text{‘the quality of being X’} \\ \text{Conditions:} \amp \text{—} \\ \end{array}\tag{5.6.9} \end{equation}

🔗

We can take any adjective from the first column, apply this rule and get the word in the second column.

🔗

This is not the case for the suffix {-ity}. We can try to posit an analogous rule:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{ADJECTIVE}}} ~~ \text{/ə.ti/]}{\tiny{\text{NOUN}}} \\ \text{Meaning:} \amp \text{‘the quality of being X’} \\ \text{Conditions:} \amp \text{—} \\ \end{array}\tag{5.6.10} \end{equation}

🔗

And in a few cases, this rule will seem to work — for example, if we apply it to /ɪnˈtɛns/ intense, we get /ɪnˈtɛn.sə.ti/ intensity. In most cases, the rule will not work:

🔗

if we apply it to /əˈfɛk.tɪv/ effective, we should get /əˈfɛk.tɪv.ə.ti/, but we get /ə.fəkˈtɪv.ə.ti/ — the stress shifts from the second to the third syllable of /əˈfɛk.tɪv/, and consequently, the vowel /ɛ/ is reduced to a schwa;

🔗
similarly, if we apply it to /kəmˈpɛt.ə.tɪv/, we should get /kəmˈpɛt.ə.tɪv.ə.ti/, but we get /ˌkɑːm.pəˈtɪˈtɪv.ə.ti/ — the stress shifts from the second to the fourth syllable of /kəmˈpɛt.ə.tɪv/, and again, the vowel /ɛ/ is reduced to a schwa;

🔗
if we apply it to /ˈɹiː.əl/, we should get /ˈɹiː.əl.ə.ti/, but we get /ɹiˈæl.ə.ti/ — the stress shifts from the first to the second syllable, the /i/ in the first syllable becomes a schwa and the schwa in the second syllable becomes an /æ/;

🔗
if we apply it to /ɡɹeɪv/, there is no stress shift (there is only one syllable), but the vowel changes from an /eɪ/ to an /æ/.

🔗

🔗

So, {‑ness} leaves its base unchanged, but {‑ity} changes it various ways. Affixes like ‑ness are sometimes referred to as word-boundary affixes, because they respect the word boundary of the base. They can be represented with a hash sign instead of a hyphen: {#ness} or {#/nəs/}. Affixes like {‑ity} are sometimes referred to as formative-boundary (or morpheme-boundary) affixes, because they treat the base as a sequence of morphemes which can be changed in the process of forming a new word. They can be represented with a plus sign instead of a hyphen: {+ity} or {+/.ə.ti/}.

🔗

Typically, formative boundary affixes do not change the base in random ways. If you look at the words in the list above, you might see that there is a system behind it. First the stress of the base always shifts to the final syllable of the base, changing the vowel in that syllable from a schwa to a full vowel if necessary; the vowel in the formerly stressed syllable becomes a schwa, which is predictable, because this is the norm for unstressed syllables. Second, the vowel in the final syllable of the base cannot be a diphthong — if it is, it is changed to the closest monophthong (e.g. from /eɪ/ to /æ/). Finally, the final consonant of the base cannot be /k/ — if it is, this is changed to a /s/.

🔗

This accounts for all changes observable in the list above. If a base already has the required properties — stress on the final syllable, no /k/ at the end —, it remains unchanged, as in /ɪnˈtɛn.sə.ti/. Thus, the word-formation rule must look as follows:

🔗

\begin{equation} \begin{array}{ll} \text{Form:} \amp \text{[X}{\tiny{\text{ADJECTIVE}}} ~~ \text{+/ə.ti/]}{\tiny{\text{NOUN}}} \\ \text{Meaning:} \amp \text{‘the quality of being X’} \\ \text{Conditions:} \amp \text{(a) X must be stressed on the final syllable} \\ \text{} \amp \text{(b) the nucleus of the final syllable of X} \\ \text{} \amp ~~~~~ \text{must be a monophthong} \\ \text{} \amp \text{(c) the final consonant of the base cannot be /k/} \\ \end{array}\tag{5.6.11} \end{equation}

🔗

The fact that the affix is marked as a formative-boundary affix means that it can change the base to meet those conditions that it does not meet.

🔗

Subsection

CC-BY-NC-SA 4.0. Written by Anatol Stefanowitsch

🔗

Prev Top Next