{"id":1151,"date":"2024-11-05T13:06:05","date_gmt":"2024-11-05T11:06:05","guid":{"rendered":"https:\/\/linguistica.info\/b\/lei\/?page_id=1151"},"modified":"2025-07-01T09:25:56","modified_gmt":"2025-07-01T07:25:56","slug":"5-7-compounds","status":"publish","type":"page","link":"https:\/\/linguistica.info\/b\/leiwp\/toc\/5-morphology\/5-7-compounds\/","title":{"rendered":"5.7 Compounds"},"content":{"rendered":"<p>Derivation is a widely-used way of creating new words in human languages, but it is not the only one. As mentioned at the beginning of Section 5.2, we can also combine two (or more) bases, a process called <strong>compounding<\/strong>.<\/p>\n<p>The bases involved can belong to every word-class so the first way in which we can classify basic types of compounds is in terms of the word-class of their constituent parts \u2014 for example, noun-noun compound (<em>lawman<\/em>), adjective-noun compound (<em>blackbird<\/em>), or verb-adjective compound (<em>failsafe<\/em>). Table 5.7.1 gives examples of compounds involving the three major word-classes, but keep in mind that there are also compounds involving adverbs (<em>wellbeing<\/em>, <em>broadcast<\/em>), prepositions (<em>for-profit<\/em>, <em>downstream<\/em>), and even pronouns (<em>me time<\/em>, <em>she-wolf<\/em>).<\/p>\n<table style=\"width: 100%;\">\n<caption>Table 5.7.1 Compounds involving the major word-classes<\/caption>\n<thead>\n<tr>\n<th style=\"width: 15.2255%;\">\u00a0<\/th>\n<th style=\"width: 17.4812%;\">\u00a0<\/th>\n<th style=\"width: 22.5564%;\">SECOND PART<\/th>\n<th style=\"width: 22.7444%;\">\u00a0<\/th>\n<th style=\"width: 21.4285%;\">\u00a0<\/th>\n<\/tr>\n<tr>\n<th style=\"width: 15.2255%;\">\u00a0<\/th>\n<th style=\"width: 17.4812%;\">\u00a0<\/th>\n<th style=\"width: 22.5564%;\">Noun<\/th>\n<th style=\"width: 22.7444%;\">Adjective<\/th>\n<th style=\"width: 21.4285%;\">Verb<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<th style=\"width: 15.2255%;\">FIRST PART<\/th>\n<th style=\"width: 17.4812%;\">Noun<\/th>\n<td style=\"width: 22.5564%;\"><em>lawman<\/em><br \/>\n<em>ice <\/em><em>cream<\/em><br \/>\n<em>light-year<\/em><br \/>\n<em>sunflower<\/em><br \/>\n<em>quarter-final<\/em><\/td>\n<td style=\"width: 22.7444%;\"><em>lifelong<\/em><br \/>\n<em>trigger-happy<\/em><br \/>\n<em>watertight<\/em><br \/>\n<em>knee-deep<\/em><br \/>\n<em>camera-shy<\/em><\/td>\n<td style=\"width: 21.4285%;\"><em>crowdsource<\/em><br \/>\n<em>water-cool<\/em><br \/>\n<em>lip-read<\/em><br \/>\n<em>sidestep<\/em><br \/>\n<em>steam-clean<\/em><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.2255%;\">\u00a0<\/td>\n<th style=\"width: 17.4812%;\">Adjective<\/th>\n<td style=\"width: 22.5564%;\"><em>greenhouse<\/em><br \/>\n<em>blackboard<\/em><br \/>\n<em>small <\/em><em>talk<\/em><br \/>\n<em>black <\/em><em>hole<\/em><br \/>\n<em>solar <\/em><em>system<\/em><\/td>\n<td style=\"width: 22.7444%;\"><em>purebred<\/em><br \/>\n<em>wide-eyed<\/em><br \/>\n<em>kindhearted<\/em><br \/>\n<em>ready-made<\/em><br \/>\n<em>widespread<\/em><\/td>\n<td style=\"width: 21.4285%;\"><em>right-click<\/em><br \/>\n<em>dry-clean<\/em><br \/>\n<em>cold-call<\/em><br \/>\n<em>parallel <\/em><em>park<\/em><br \/>\n<em>whitewash<\/em><\/td>\n<\/tr>\n<tr>\n<td style=\"width: 15.2255%;\">\u00a0<\/td>\n<th style=\"width: 17.4812%;\">Verb<\/th>\n<td style=\"width: 22.5564%;\"><em>think-piece<\/em><br \/>\n<em>kill <\/em><em>switch<\/em><br \/>\n<em>delete <\/em><em>button<\/em><br \/>\n<em>driveshaft<\/em><br \/>\n<em>growshop<\/em><\/td>\n<td style=\"width: 22.7444%;\"><em>fail-safe<\/em><br \/>\n<em>kill-crazy<\/em><br \/>\n<em>sue-happy<\/em><br \/>\n<em>think-aloud<\/em><\/td>\n<td style=\"width: 21.4285%;\"><em>blow-dry<\/em><br \/>\n<em>stir-fry<\/em><br \/>\n<em>tumble-dry<\/em><br \/>\n<em>jump-start<\/em><br \/>\n<em>write-protect<\/em><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Note that not all of these compound-types are equally common \u2014 there are only a handful of cases each of established verb-adjective compounds, verb-verb compounds, adjective-verb compounds or adjective-adjective compounds, but hundreds of cases of noun-noun compounds and adjective-noun compounds, so we will focus on these types in the following (but that does not mean that the other types are not interesting \u2014 on the contrary, there is much less research on them, so in a sense they are <em>more<\/em> interesting)!<\/p>\n<p>Before we continue however, here is an important reminder concerning spelling: remember that spelling is almost never a useful indication of linguistic structure. In particular, whether a sequence of words is spelled as a single orthographic word, with a hyphen or with whitespace between the roots has little to do with whether the sequence is a compound. English is different, in this respect, from other Germanic languages, where compounds tend to be spelled as single words, which sometimes leads to the perception that these languages have much longer words than English. Take the longest documented actual word from German, shown in (1):<\/p>\n<div class=\"example\">\n<div class=\"number\">(1)<\/div>\n<div class=\"sentence\">Rindfleisch\u00adetikettierungs\u00ad\u00fcberwachungs\u00adaufgaben\u00ad\u00fcbertragungs\u00adgesetz<\/div>\n<div class=\"break\">\u00a0<\/div>\n<div class=\"number\">\u00a0<\/div>\n<div class=\"sentence\">\/\u02c8\u0281\u026ant.fla\u026a\u0283.e.ti.k\u025b\u02ccti\u02d0.\u0281\u028a\u014bs.y\u02d0.b\u0250\u02ccvax.\u028a\u014bs\u02cca\u028af.\u0261a\u02d0b.n\u0329.y\u02d0.b\u0250\u02cct\u0281a\u02d0.\u0261\u028a\u014bs.\u0261\u0259\u02ccz\u025bt\u0361s\/<\/div>\n<div class=\"break\">\u00a0<\/div>\n<div class=\"number\">\u00a0<\/div>\n<div class=\"sentence\">Rind-fleisch-\u00adetikettier-ung-s-\u00ad\u00fcberwach-ung-s-\u00adaufgabe-n-\u00ad\u00fcbertragung-s-\u00adgesetz<\/div>\n<div class=\"break\">\u00a0<\/div>\n<div class=\"number\">\u00a0<\/div>\n<div class=\"sentence\">cattle-meat-label-NOM-FORM-supervise-NOM-FORM-duty-FORM-delegate-NOM-FORM-law<\/div>\n<div class=\"break\">\u00a0<\/div>\n<div class=\"number\">\u00a0<\/div>\n<div class=\"sentence\">\u2018beef labeling supervision duties delegation law\u2019<\/div>\n<\/div>\n<p>It consists of six bases, one of which, <em>Rindfleisch<\/em> \u2018cattle\u2019 is itself a compound, and three of which are nouns derived from verbs using the suffix {-<em>ung<\/em>}; in addition, four of the bases contain a special affix referred to as <em>Fugenmorphem<\/em> \u2018gap morpheme\u2019, which sometimes marks words that are part of a compound. Totalling 67 letters (corresponding to 52 phonemes organized into 20 syllables), it seems much longer than any English word could ever be. But consider sentence in (2) from an academic paper, which contains the compound noun shown in (3):<\/p>\n<div class=\"example\">\n<div class=\"number\">(2)<\/div>\n<div class=\"sentence\"><em>The dependent variable was an additive composite rating of applicant reactions to a community college business faculty recruitment advertisement<\/em>.<\/div>\n<\/div>\n<div class=\"example\">\n<div class=\"number\">(3)<\/div>\n<div class=\"sentence\"><em>community college business faculty recruitment advertisement<\/em><\/div>\n<div class=\"break\">\u00a0<\/div>\n<div class=\"number\">\u00a0<\/div>\n<div class=\"sentence\">\/k\u0259\u02c8mju\u02d0.n\u0259.ti\u02cck\u0251\u02d0.l\u026ad\u0292\u02c8b\u026az.n\u026as\u02ccf\u00e6k.\u0259l.ti.r\u026a\u02c8kru\u02d0t.m\u0259nt\u02cc\u00e6d.v\u025a.ta\u026az.m\u0259nt\/<\/div>\n<div class=\"break\">\u00a0<\/div>\n<div class=\"number\">\u00a0<\/div>\n<div class=\"sentence\">commun-ity college busi-ness faculty recruit-ment advert-ise-ment<\/div>\n<\/div>\n<p>The noun in (3) is very similar in length to the German noun in (1): it consists of six bases, three of which are derivatives with one affix (<em>commun-ity<\/em>, <em>busi-ness<\/em>, <em>recruit-ment<\/em>), and one with two affixes (<em>advert-ise-ment<\/em>), totalling 60 characters (counting the whitespaces), corresponding to 49 phonemes. The only difference is that, in English, compounds are often spelled with whitespace between the bases \u2014 but that is an orthographic convention that has nothing to do with morphology (or phonology).<\/p>\n<p>Compounding is a <strong>recursive<\/strong> process \u2014 you can create one or more compounds and then use them in the creation of further compounds, and repeat this process until you get the word you want:<\/p>\n<ol>\n<li>community + college<\/li>\n<li>business + faculty<\/li>\n<li>[community college] + [business faculty]<\/li>\n<li>[ [community college] [business faculty] ] + recruitment<\/li>\n<li>[ [ [community college] [business faculty] ] recruitment] + advertisement<\/li>\n<\/ol>\n<p>So, while English, German and many other languages allow the creation of very long compounds, these compounds are created in a step-wise fashion, with every step combining just two bases.<\/p>\n<p>The most important property of compounds is the relation between these two bases. Normally, this relation can be described in terms of a <strong>head<\/strong>, which determines what general type of entity, process or property the compound refers to and what word class it belongs to, and a <strong>modifier<\/strong>, which specifies the specific type of entity, process or property but does not have an influence on the compound\u2019s word class. Such compounds are sometimes referred to as <strong>endocentric<\/strong> compounds, because their center \u2014 the element that determines their form and meaning \u2014 is contained within the compound (<em>endo<\/em>&#8211; means \u201cin(side)\u201d.<\/p>\n<p>In English, the head in endocentric compounds is always the last element of the compound. For example,<\/p>\n<ul>\n<li><em>greenhouse<\/em> refers to a type of house, not a type of green, and it is a noun (like <em>house<\/em>), not an adjective (like <em>green<\/em>);<\/li>\n<li><em>camera-shy<\/em> refers to a particular type of being shy, not a particular type of camera, and it is an adjective (like <em>shy<\/em>), not a noun (like <em>camera<\/em>);<\/li>\n<li>to <em>lipread<\/em> refers to a particular type of reading, not to a particular type of lip, and it is a verb (like <em>read<\/em>), not a noun (like <em>lip<\/em>).<\/li>\n<\/ul>\n<div class=\"box\">Describe some of the other compounds in Table 5.7.1 in terms of head and modifier.<\/div>\n<p>There are a few exceptions where the modifier follows the head. Most of these are French loanwords (in French, adjectives mostly follow the noun), such as <em>attorney general<\/em> (the government\u2019s own attorney), <em>court martial<\/em> (a military court), or <em>force majeure<\/em> (a higher force) but some were formed on this model within English, such as <em>president elect<\/em> or <em>sum total<\/em>.<\/p>\n<p>There are also compounds which do not seem to have a head-modifier structure at all:<\/p>\n<div class=\"example\">\n<div class=\"number\">(4a)<\/div>\n<div class=\"sentence\"><em>owner-occupier<\/em>, <em>singer-songwriter<\/em>, <em>spacetime<\/em>, <em>toaster-oven<\/em>,<em> tractor-trailer<\/em><\/div>\n<div class=\"break\">\u00a0<\/div>\n<div class=\"number\">(4b)<\/div>\n<div class=\"sentence\"><em>redhead<\/em>, <em>red-eye<\/em>, <em>scatterbrain<\/em>, <em>paperback<\/em>, <em>skinhead<\/em><\/div>\n<div class=\"break\">\u00a0<\/div>\n<div class=\"number\">(4c)<\/div>\n<div class=\"sentence\"><em>cutthroat<\/em>, <em>killjoy<\/em>, <em>pick-pocket<\/em>, <em>scarecrow<\/em>, <em>turncoat<\/em><\/div>\n<\/div>\n<p>Looking at the compounds in (4a), note that <em>owner-occupier<\/em> is both the owner and the occupier of their house or apartment, a <em>singer-songwriter<\/em> writes <em>and<\/em> sings their songs, <em>spacetime<\/em> is both space and time, and so on. It seems that both parts of the compound are head and modifier at the same time. Such compounds (which are rather rare) are sometimes referred to as <strong>dvandva<\/strong> compounds (<em>dvandva<\/em> means \u2018pair\u2019 in Sanskrit, where such compounds are more common).<\/p>\n<p>Turning to the compounds in (4b), a <em>redhead<\/em> is not a type of <em>head<\/em> (or a shade of red), it is a word for a person (typically a woman) with red hair, a <em>red-eye<\/em> is not a type of eye, it is a word for a flight taken very early in the morning, a <em>scatterbrain<\/em> is not a type of brain but a person who behaves as though their brain was scattered all over the place, etc. In (4c), likewise, a <em>cutthroat<\/em> is not a type of cut or a type of throat, but a person who cuts other people\u2019s throats, a <em>killjoy<\/em> is a person who kills other people\u2019s joy, a <em>pick-pocket<\/em> is a person who picks other people\u2019s pockets, etc.<\/p>\n<p>Both of these types of compounds are sometimes called <strong>exocentric compounds<\/strong>, because they do not seem to contain a head that determines the type of entity they refer to and their part of speech. The idea is that their head is somewhere outside of the actual compound \u2014 that, for example, <em>redhead<\/em> has an invisible structure like [[<em>red headed<\/em>] <em>person<\/em>] or <em>cutthroat<\/em> has an invisible structure like [[<em>throat-cutting<\/em>] <em>person<\/em>].<\/p>\n<p>With respect to the compounds in (b), this analysis is not really useful. First, notice that in all cases, the second part of the compound <em>does<\/em> determine the word class, and that in all cases, the first part does seem to modify the meaning of the second part to make it more specific. The difference to the more typical compounds shown in Table 5.7.1 is simply, that the word as a whole does not literally refer to whatever the second part would refer to, but to a related entity. A <em>red-head<\/em> is a \u2018person with a red head\u2019, a red-eye is a \u2018flight that causes red eyes (because you don&#8217;t get enough sleep\u2019 etc. But this shift in meaning is not specific to compounds, it also occurs with simple words \u2014 we can talk about wanting to hire a few good <em>heads<\/em> or <em>brains<\/em> for a team, meaning, of course, <em>people<\/em> with good heads\/brains, we can talk about a vintage car being the <em>pride and joy<\/em> of its owner, meaning, of course, that it is an \u2018entity causing pride and joy\u2019, and so on. This process is referred to as <strong>metonymy<\/strong>, and the compounds in (b) are plainly and simply endocentric compounds that are used metonymically.<\/p>\n<p>The compounds in (c) could be called exocentric, if we really want to. In these cases, the second part of the compound really does not seem to be a head: it does not specify what type of entity the compound refers to, the first part of the compound does not seem to modify the second part, and the second part does not determine the word-class of the compound. However, the idea that these compounds have a head somewhere outside of the word itself is a bit weird, and we would have to be able to say what that head is. It could be a phonologically empty noun with the meaning \u2018person\u2019, so that the structure of cut-throat would be [cut-throat \u00d8<sub>NOUN<\/sub>], but we already talked about the dangers of positing phonologically empty elements, and this structure would still not explain how the element\u00a0<em>cut-throat<\/em> itself is formed \u2014 normally, in English, compounds consisting of a verb and a noun functioning semantically as its object have the form [N V], as in\u00a0<em>baby-sit<\/em>,\u00a0<em>trouble-shoot<\/em>,\u00a0<em>back-stab<\/em>, etc.<\/p>\n<p>Instead, we could simply treat the different types of compounds as resulting from different word-formation rules. Unlike the WFRs for derivational and inflectional affixes, such rules would not include any morphemes, but this is not a problem: we have already seen that it is possible to have WFRs without any morphemes in the case of conversion.<\/p>\n<p>For \u201cendocentric\u201d noun-noun compounds, this rule could look like this (similar rules can be posited for the other types of \u201cendocentric\u201d compounds):<\/p>\n<table>\n<tbody>\n<tr>\n<td>Form:<\/td>\n<td>[X<sub>NOUN<\/sub> Y<sub>NOUN<\/sub>]<sub>NOUN<\/sub><\/td>\n<\/tr>\n<tr>\n<td>Meaning:<\/td>\n<td>\u2018Y characterized by X\u2019<\/td>\n<\/tr>\n<tr>\n<td>Conditions:<\/td>\n<td>\u2014<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>This WFR only assigns a very general meaning to such compounds: the modifier somehow characterizes the head. The specific relationship is left open \u2014 if speakers create a novel compound, the relationship is determined by context, in conventionalized compounds, it is determined by the past usage of the word. For example, <em>lawman<\/em> refers to a sheriff or marshal, but theoretically, it could refer to any man with a relationship to the law, such as a judge or lawyer or a politician who passes laws; <em>sunflower<\/em> refers to a flower that looks a bit like a child\u2019s drawing of a sun and that turns its head to the sun at all times of the day, but it could also refer to a flower growing on the surface of the sun (in a world where that were possible), or a flower that needs sunlight (i.e., any flower). Thus, like the WFRs for derivational affixes we saw in the preceding section, the WFRs for compounds only determine the range of possible meanings, but once a word is established, its meaning will not cover the whole range but be much more specific.<\/p>\n<p>For dvandva compounds, the WFR would look like this:<\/p>\n<table>\n<tbody>\n<tr>\n<td>Form:<\/td>\n<td>[X<sub>NOUN<\/sub> Y<sub>NOUN<\/sub>]<sub>NOUN<\/sub><\/td>\n<\/tr>\n<tr>\n<td>Meaning:<\/td>\n<td>\u2018an entity that is both X and Y \u2019<\/td>\n<\/tr>\n<tr>\n<td>Conditions:<\/td>\n<td>\u2014<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>And for the <em>cutthroat<\/em> type of \u201cexocentric\u201d compound, the WFR would look like this:<\/p>\n<table>\n<tbody>\n<tr>\n<td>Form:<\/td>\n<td>[X<sub>VERB<\/sub> Y<sub>NOUN<\/sub>]<sub>NOUN<\/sub><\/td>\n<\/tr>\n<tr>\n<td>Meaning:<\/td>\n<td>\u2018an entity that does X to Y \u2019<\/td>\n<\/tr>\n<tr>\n<td>Conditions:<\/td>\n<td>\u2014<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Unlike the first two WFRs, however, this last rule does not seem to exist in current English: all compounds of this type are fairly old and it does not seem possible for form new words based on it. For example, a bus driver is not a *<em>drive-bus<\/em>, a math teacher is not a\u00a0<em>*teach-math<\/em>, a lawn mower is not a\u00a0<em>mow-lawn<\/em>, a drug dealer is not a *<em>deal-drug<\/em>, etc.<\/p>\n<p>&nbsp;<\/p>\n<p><span class=\"nav-previous\"><a href=\"https:\/\/linguistica.info\/b\/lei\/toc\/5-morphology\/5-6-word-formation-rules\/\" rel=\"prev\"><span class=\"meta-nav\">\u2190<\/span> Previous section<\/a><\/span> <span class=\"nav-next\"><a href=\"https:\/\/linguistica.info\/b\/lei\/toc\/5-morphology\/5-8-word-formation-without-morphology\/\" rel=\"next\">Next section <span class=\"meta-nav\">\u2192<\/span><\/a><\/span><\/p>\n<p class=\"authshp\">CC-BY-NC-SA 4.0, Written by Anatol Stefanowitsch<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Derivation is a widely-used way of creating new words in human languages, but it is not the only one. As mentioned at the beginning of Section 5.2, we can also combine two (or more) bases, a process called compounding. The bases involved can belong to every word-class so the first way in which we can [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":1080,"menu_order":7,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-1151","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages\/1151","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/comments?post=1151"}],"version-history":[{"count":26,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages\/1151\/revisions"}],"predecessor-version":[{"id":2188,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages\/1151\/revisions\/2188"}],"up":[{"embeddable":true,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages\/1080"}],"wp:attachment":[{"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/media?parent=1151"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}