{"id":862,"date":"2024-10-20T18:29:17","date_gmt":"2024-10-20T16:29:17","guid":{"rendered":"https:\/\/linguistica.info\/b\/lei\/?page_id=862"},"modified":"2025-07-01T07:38:02","modified_gmt":"2025-07-01T05:38:02","slug":"4-3-a-closer-look-at-allophones","status":"publish","type":"page","link":"https:\/\/linguistica.info\/b\/leiwp\/toc\/4-phonology-2\/4-3-a-closer-look-at-allophones\/","title":{"rendered":"4.3 A closer look at allophones"},"content":{"rendered":"<p>To recapitulate: if two phones are in contrastive distribution \u2014 i.e., if they can occur in the same phonetic environment \u2014 our brains will categorize them as belonging to different categories, called <strong>phonemes<\/strong>. This allows speakers to use them in creating distinctive forms that can be associated with different meanings. Speakers typically <em>do<\/em> use them in this way, which is why finding a pair of words that differ by the phones in a single position (a <strong>minimal pair<\/strong>) is clear evidence that the two belong to different phonemes. We saw this, for example, with the phones [p] and [b], which can both occur in the environment [ _\u025b\u0279], resulting in the words <em>bear<\/em> and <em>pear<\/em>. We discussed the fact that cases where there is only a single minimal pair may be a bit problematic, but in the case of [p] and [b], there are many such pairs, so there is no question that they belong to different phonemes: \/p\/ and \/b\/.<\/p>\n<div class=\"box\">Find at least ten minimal pairs for [p] and [b].<\/div>\n<p>We also saw that there are phones that differ in their articulatory and acoustic properties, but that our brain categorizes as belonging to the same category, for example, the phone [p\u02b0] in <em>pear<\/em> and the phone [p] in <em>spare<\/em>. Let us take a closer look at these.<\/p>\n<p>The reason why our brain places them in the same category \u2014 why they sound \u201cthe same\u201d to us, unless we focus our attention on them \u2014 is, that they cannot occur in the same phonetic environments, so that they cannot be used to distinguish different words. Look at the following set of representative words containing the two phones:<\/p>\n<ol>\n<li>[p\u02b0\u026an] <em>pin<\/em><\/li>\n<li>[p\u02b0\u0254\u0279t] <em>port<\/em><\/li>\n<li>[l\u026ap] <em>lip<\/em><\/li>\n<li>[t\u0283\u025d\u02d0p] <em>chirp<\/em><\/li>\n<li>[p\u02b0le\u026as] <em>place<\/em><\/li>\n<li>[spl\u00e6\u0283] <em>splash<\/em><\/li>\n<li>[\u0261l\u026amps] <em>glimpse<\/em><\/li>\n<li>[k\u0279\u026apt] <em>crypt<\/em><\/li>\n<li>[\u02c8\u0279\u00e6.p\u026ad] <em>rapid<\/em><\/li>\n<li>[\u0259\u02c8p\u02b0i\u02d0l] <em>appeal<\/em><\/li>\n<\/ol>\n<p>Looking at the first four words, we see a pattern: [p\u02b0] occurs at the beginning of words before a vowel, [p] occurs at the end of words after a vowel. The next two words suggest that [p\u02b0] also occurs at the beginning of words before a consonant, but [p] occurs before a consonant if there is another consonant preceding it. The next word confirm that [p] occurs before a consonant if it is preceded by a consonant, the next word shows that this is also the case if it is preceded by a vowel. Now we notice that all words so far are monosyllabic, so we might try to summarize our observations in terms of the following hypothesis: [p\u02b0] occurs as the first element of a syllable onset, [p] occurs anywhere else (i.e, as the second element of the onset or as any element in the coda. The next word seems to contradict this hypothesis: [p] occurs as the first element of a syllable onset. In contrast, the next word follows our prediction, with [p\u02b0] occurring as the first element of a syllable onset. We now notice that in the ninth word, which contradicted our hypothesis, the syllable is unstressed, while in the tenth word, it is stressed. In the first, second and fifth word, where [p\u02b0] also occurs, the syllable is also stressed, as these words are monosyllabic, so there is no other syllable that could be stressed. We can reformulate our hypothesis: [p\u02b0] occurs as the first element of the onset of a stressed syllable, [p] occurs anywhere else.<\/p>\n<p>Thus, not only do the two phones never occur in the same phonetic environments, we can even predict, using a very simple generalization, where each of them will occur. It is this predictability that allows our brains to ignore the difference and place the two phones in the same category, treating them as different manifestations of the same entity. We call such manifestations <strong>allophones<\/strong> of the same phoneme.<\/p>\n<p>Situations where seemingly different entities are actually different manifestations of the same entity are not restricted to language \u2014 an example from the physical world would be the substances we call <i>water<\/i>, <em>steam<\/em> and\u00a0<em>ice<\/em><!-- Example first used by James Paul Gee\nin An Introduction to Human Language (1993) -->. Although they have different properties \u2014 water is liquid, steam gaseous, ice is solid \u2014 they are just different manifestations of the inorganic compound described by the chemical formula H<sub>2<\/sub>O. As in the case of allophones, we can predict which form it will take in which environment: if the temperature is below zero degrees Celsius (32 degrees Fahrenheit, 273.15 degrees Kelvin), it will appear as ice, if the temperature is above 100 degrees Celsius (212 degrees Fahrenheit, 373.15 degrees Kelvin), it will appear as steam, and in all other cases it will appear as water!<\/p>\n<p>One very widespread notation for capturing the relationship of allophones to a phoneme is in the form of a <strong>phonological rule.<\/strong> Such rules are typically represented in the following format, where P stands for <em>phoneme<\/em>, A stands for <em>allophone<\/em> and E stands for <em>(phonetic) environment<\/em>, with the underscore showing the position of the allophone:<\/p>\n<table style=\"border: none!important; width: fit-content!important; font-size: 1em; color: black;\">\n<tbody>\n<tr>\n<td>\/P\/<\/td>\n<td>\u2192<\/td>\n<td>A \/ E _ E<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Before we apply it to the distribution of [p] and [p\u02b0], let us apply it to our example of H<sub>2<\/sub>O. Using the format of the phonological rule, the predictions about the form of this substance can be represented as follows:<\/p>\n<table style=\"border: none!important; width: fit-content!important; font-size: 1em; color: black;\">\n<tbody>\n<tr>\n<td>\/H<sub>2<\/sub>O\/<\/td>\n<td>\u2192<\/td>\n<td>[ice] \/ __ \u2264 0\u00b0C<\/td>\n<\/tr>\n<tr>\n<td>&nbsp;<\/td>\n<td>\u2192<\/td>\n<td>[steam] \/ __ \u2265 100\u00b0C<\/td>\n<\/tr>\n<tr>\n<td>&nbsp;<\/td>\n<td>\u2192<\/td>\n<td>[water] \/ elsewhere<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>Note that we are using the word\u00a0<em>elsewhere<\/em> for one of the conditions: since the first two conditions only leave the range between 0\u00b0C and 100\u00b0C, we do not have to describe this range more precisely.<\/p>\n<p>In order to write a phonological rule corresponding to our generalization about the distribution of [p] and [p\u02b0], we have a problem that we do not have when talking about H<sub>2<\/sub>O \u2014 we need to determine a way of representing the abstract phoneme. In the case of chemical compounds, we can specify the molecular structure, but in the case of language, there is no equivalent for this.<\/p>\n<p>Theoretically, we could represent the phoneme using an arbitrary label, for example, PHON230216. But it is customary in phonology to use one of the allophones between slashes, as we did in the <a href=\"https:\/\/linguistica.info\/b\/lei\/toc\/4-phonology-2\/4-3-a-closer-look-at-allophones\/\">previous section<\/a>. In order to do so, we we need to decide which of the two allophones to choose to represent the phoneme. Again: neither of them <em>is<\/em> the phoneme, as the phoneme is an abstract category. There are two principles that can help us decide: a) we should choose the allophone with the least restricted distribution, b) we should choose the \u201csimplest\u201d allophone, i.e., the one characterizable with the least number of phonetic features. In the case of the aspirated and non-aspirated voiceless bilabial plosive in English, both criteria point to [p]: it occurs in almost all phonetic environments except one (at the beginning of an onset), and it has one feature less than [p\u02b0] (which is aspirated in addition to being voiceless, bilabial and plosive).<\/p>\n<p>Thus, the phonological rule corresponding to our generalization would look something like this (the dollar sign $ is a common way of representing syllable boundaries, and we are using the word <em>elsewhere<\/em> as we did above, as a shorthand for \u201call other environments\u201d):<\/p>\n<table style=\"border: none!important; width: fit-content!important; font-size: 1em; color: black;\">\n<tbody>\n<tr>\n<td>\/p\/<\/td>\n<td>\u2192<\/td>\n<td>[p\u02b0] \/ &#8216;$ _ (C)V\u2026<\/td>\n<\/tr>\n<tr>\n<td>&nbsp;<\/td>\n<td>\u2192<\/td>\n<td>[p] \/ elsewhere<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>The apostrophe before the dollar sign indicates that the syllable must be stressed, the underscore following the dollar sign represents the first segment of the onset, the C in parentheses indicates that there can be an optional second consonant, followed by the nucleus. The notation we use to capture the different aspects of the rule are not standardized in linguistics, they depend on the particular model of linguistics a researcher uses and\/or on the level of detail they want to capture.<\/p>\n<p>Things are not always as straightforward, however. Let us look at another example, the phenomenon of \u201cCanadian raising\u201d discussed in <a href=\"https:\/\/linguistica.info\/b\/lei\/toc\/3-phonetics\/3-6-the-international-phonetic-alphabet\/\">Section 3.6<\/a>. Recall that in Canadian English, the vowel in <em>pride<\/em> is [a\u026a], but in <em>price<\/em>, it is [\u028c\u026a], and likewise, the vowel in <em>cloud<\/em> is [a\u028a] but in <em>mouth<\/em>, it is [\u028c\u028a]. This, too, is predictable: note that in the words <em>pride<\/em> and <em>cloud<\/em>, the consonant following the diphthong is voiced, while in <em>price<\/em> and <em>mouth<\/em>, it is voiceless. This is not an accident: the realization of the diphthongs in question depends precisely on this distinction: [a\u026a] and [a\u028a] occur before voiced consonants, [\u028c\u026a] and [\u028c\u028a] before voiceless consonants.<\/p>\n<p>In principle, this generalization can easily be captured in a phonological rule. But which of the two allophones do we choose to represent the phoneme? Neither of them is more widely distributed than the other, and they both have the same number of features. Thus, both of the following representations are equally plausible (V theoretically stands for any vowel, but [\u026a] and [\u028a] are the only vowels occurring as the second part of a diphthong following [a]\/[\u028c] in Canadian English, so we can use the character here):<\/p>\n<table style=\"width: 100%; height: 48px;\" border=\"0\">\n<tbody>\n<tr style=\"height: 24px;\">\n<td style=\"height: 24px; width: 8.96%;\">\/a\u0361V\/<\/td>\n<td style=\"height: 24px; width: 7.84%;\">\u2192<\/td>\n<td style=\"height: 24px; width: 83.04%;\">[\u028c\u0361V] \/ _ [voiceless]<\/td>\n<\/tr>\n<tr style=\"height: 24px;\">\n<td style=\"height: 24px; width: 8.96%;\">\u00a0<\/td>\n<td style=\"height: 24px; width: 7.84%;\">\u2192<\/td>\n<td style=\"height: 24px; width: 83.04%;\">[a\u0361V] \/ _ [voiced]<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<p>or<\/p>\n<table style=\"width: 100%;\">\n<tbody>\n<tr>\n<td style=\"width: 8.96%;\">\/\u028c\u0361V\/<\/td>\n<td style=\"width: 7.84%;\">\u2192<\/td>\n<td style=\"width: 83.04%;\">[a\u0361V] \/ _ [voiced]<\/td>\n<\/tr>\n<tr>\n<td style=\"width: 8.96%;\">\u00a0<\/td>\n<td style=\"width: 7.84%;\">\u2192<\/td>\n<td style=\"width: 83.04%;\">[\u028c\u0361V] \/ _ [voiceless]<\/td>\n<\/tr>\n<\/tbody>\n<p><!-- newly written by AS --><\/table>\n<p>Recall that it doesn&#8217;t <em>really<\/em> matter, as phonemes are abstract categories. But it would be nice to make a principled decision. The term \u201cCanadian raising\u201d suggests that [aV] is considered the more basic variant, that is \u201craised\u201d to [\u028c] under certain conditions. But that is because [a] is the phone occurring in all environments in other North American varieties \u2014 [\u028c] is raised <em>in comparison to other varieties<\/em>. There may be one reason to prefer the first of the two rules: outside of the diphthongs, [a] is the more frequent vowel, occurring in many more words than [\u028c].<\/p>\n<div class=\"box\">Pronounce the words <em>keen<\/em>, <em>kill<\/em>, <em>case<\/em> and <em>cat<\/em>, and then the words <em>cool<\/em>, <em>con<\/em>, <em>coast<\/em> and <em>call<\/em>. Pay attention to where exactly the back of your tongue touches the upper part of your mouth when articulating the initial \/k\/ (check Figure 3.2.1. in <a href=\"https:\/\/linguistica.info\/b\/lei\/toc\/3-phonetics\/3-2-speech-articulators\/\">Section 3.2<\/a> to remind yourself of the labels for the different regions). You should feel that in the first set of words, the place of contact is the hard palate, in the second set of words it is the velum. Write a phonological rule to capture this distribution of allophones.<\/div>\n<div>\u00a0<\/div>\n<p><span class=\"nav-previous\"><a href=\"https:\/\/linguistica.info\/b\/lei\/toc\/4-phonology-2\/4-2-a-closer-look-at-phonemes\/\" rel=\"prev\"><span class=\"meta-nav\">\u2190<\/span> Previous section<\/a><\/span> <span class=\"nav-next\"><a href=\"https:\/\/linguistica.info\/b\/lei\/toc\/4-phonology-2\/4-4-motivations-and-limits-of-allophony\/\" rel=\"next\">Next section <span class=\"meta-nav\">\u2192<\/span><\/a><\/span><\/p>\n<p class=\"authshp\">CC-BY-NC-SA 4.0, Written by Anatol Stefanowitsch<\/p>\n","protected":false},"excerpt":{"rendered":"<p>To recapitulate: if two phones are in contrastive distribution \u2014 i.e., if they can occur in the same phonetic environment \u2014 our brains will categorize them as belonging to different categories, called phonemes. This allows speakers to use them in creating distinctive forms that can be associated with different meanings. Speakers typically do use them [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"parent":825,"menu_order":3,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-862","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages\/862","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/comments?post=862"}],"version-history":[{"count":35,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages\/862\/revisions"}],"predecessor-version":[{"id":2183,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages\/862\/revisions\/2183"}],"up":[{"embeddable":true,"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/pages\/825"}],"wp:attachment":[{"href":"https:\/\/linguistica.info\/b\/leiwp\/wp-json\/wp\/v2\/media?parent=862"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}