Language Evolution: The Little Lambs Who Lost Their Way: Lexical Exceptions

29 January 2013

The Little Lambs Who Lost Their Way: Lexical Exceptions

Consider the following Old English words: gān ‘gone’, clāþ ‘cloth’, brād ‘broad’. They belonged to the same lexical set as OE gāt ‘goat’, and we would expect them to have evolved like the rest of the GOAT set, since they do not share any characteristic subregularities with any recognised “minority flock”. Even the spelling of gone and broad (similar to that used in stone and goat, respectively) suggests that they were still members of the GOAT set at the time when the modern orthographic conventions were becoming fixed. And yet they have parted company with other words containing OE ā. Broad has joined the CAUGHT set (with Modern English /ɔː/, as in cause), while the other two vary between CAUGHT and LOT (Modern English /ɒ/ or its unrounded counterpart /ɑ/ as in American dialects). Note also that while OE sc(e)ān ‘shone’ yields the expected outcome /ʃoʊn/ in America, the normative British pronunciation is /ʃɒn/, with a shortened vowel.

Such cases are truly irregular and call for individual explanation. We know that the shortening of the vowel of clāþ cannot date back to Old English (OE “claþ” would have become Modern English “clath”). OE ā produced a mid-low rounded vowel /ɔː/ (conventionally spelt ǭ to distinguish it from other O spellings) after the Norman Conquest, during the Middle English period. Indeed, the word was very often spelt clothe, clooth or cloothe in Middle English, apparently indicating a long-vowel pronunciation. Note that the OE plural clāþas has normally developed into Mod.E clothes, with /oʊ/ (the th may be mute, but that is another story). Today, however, clothes is no longer regarded as the plural of cloth, but rather as an independent collective noun (a case of word duplication!). The distribution of the modern pronunciations of cloth points to an early shorthening of Middle English /ɔː/, as a result of which the word joined the LOT set. Then, in some (but not all) mainstream accents of Modern English, the short vowel was affected by the lengthening heard in moss, cost, lost, frost, moth, often, off, cough, etc., induced by the following voiceless fricative.

The development of broad must have been different, since the word does not show a short vowel in any major accent, and the final consonant is not a voiceless fricative. When the Great Vowel Shift of the 15th century transformed ME /ɔː/ into Early Modern English /oː/ (diphthongised to /oʊ/ in most contemporary varieties of English), one stray sheep left the flock as its vowel underwent an irregular lowering (for reasons that elude us). That lowered pronunciation merged with the new /ɔː/ that resulted from the smoothing of the diphthong /aʊ/ after the Great Vowel Shift (in such words as daughter, caught, law, cause, and drawn).

Gonna be gone

Perhaps there was another sheep of the same contrary disposition, since the long vowel of gone in the accents that rhyme it with drawn is best explained in the same way. Why do we find /gɒn/ ~ /gɑn/ as well? It’s hard to say at which historical stage the shortened variant originated. It could have appeared before the Great Vowel Shift, immediately after it, or still later, with the same result. It is quite possible that it has arisen many times. It is worth observing that high-frequency verbs often display irregular phonetic simplification, possibly because sloppy pronunciations are easier to tolerate in words more or less predictable from the context. Note the similarly unexpected short vowel of says and said, does and done, as well as been (pronounced like bin in American English). Been, said, does, done, says, and gone (in that order) are all among the 500 most frequently occurring English word-forms.

I will return to this interesting correlation between frequency of use and erratic behaviour (which usually consists in some kind of phonological erosion – the shortening, reduction or loss of speech segments).

35 comments:

Podpora społeczeństwa10 February 2013 at 22:37
Re frequency: a great champion thereof has been (still is?) the Polish linguist Witold Mańczak, whose 'cela est du^ a` la fre'quence' (he wrote mostly in French) was a 'caeterum censeo'. He was also an admirer of Zipf. Are you familiar with his work?
ReplyDelete
Replies
Piotr Gąsiorowski10 February 2013 at 23:10
I am. I also have the honour to have met Prof. Mańczak on several occasions. I disagree with him on many things. As regards sound change, he places an almost exclusive emphasis on frequency effects, and makes his position unnecessarily dogmatic. But there is a sound core in his views, and I'm convinced the significance of the Zipf distribution is often insufficiently appreciated.
ReplyDelete
Replies
Podpora społeczeństwa11 February 2013 at 11:36
' It is worth observing that high-frequency verbs often display irregular phonetic simplification, , possibly because sloppy pronunciations are easier to tolerate in words more or less predictable from the context'.

I know at least one verb for which the above does not work (for me): can/can't as pronounced by Americans: I tend to hear both as something like 'cayn' or 'cairn'. I often have to ask: pardon, you are saying you can or you can't (cahn't)? Maybe the contexts are not always sufficiently predictable?

Frequency often issues in simplification or 'erosion' but sometimes in complication, too, such with the initial w- in 'one' (wun), I suppose. Often, too, complex forms are remembered and used precisely "du^ a` la fre'quence", as Mańczak would have said, for instance we say 'am, are, is', not 'be, bees'.
ReplyDelete
Replies
Piotr Gąsiorowski11 February 2013 at 12:16
Being a frequently occurring form is a mixed blessing. On the one hand, it protects a word from lexical replacement. On the other, it produces more opportunity for "mutations", especially those giving rise to weak forms. In a nutshell: live longer, change faster.

By the way, the conjugation of "to be" is so complex because it is suppletive. No fewer than four originally different Indo-European verb roots have contributed to the creation of this Frankenstein monster. In Old English, there were still two competing infinitives with two competing present-tense paradigms:

eom, eart, is; sind(on)

versus

bēo, bist, biþ; bēoþ
ReplyDelete
Replies
Podpora społeczeństwa11 February 2013 at 13:01
Conjugation of 'be' suppletive.

Yessir, it very much is. So in Polish, by-, es- s-, in Latin fu- (the same as by- in Pol.), es-, s-.

Frequency helps suppletive verbs to survive in speakers' memory, e.g. 'am' etc., but it did not help in Afrikaans (just 'is' for all persons, as compared with ben, zijt, is, zijn, zijt, zijn of older Dutch) or in Modern Polish, jeśm (am) and jeś (art) got replaced with analogical formations from jest, (jestem, jesteś), not unlike in Lithuanian where 'regular' forms esu, esi replaced the old ones esmi, essi. Something similar in Modern Persian, hastam, hasti, hast.
ReplyDelete
Replies
Piotr Gąsiorowski11 February 2013 at 14:40
The Modern Polish present-tense pattern is not so much suppletive as partly "diploid": except in the 3sg./pl., the finite forms of 'to be' consist of jest- (treated as if it were a root) plus personal endings which are, historically, enclitic forms ot 'to be'!

jest + jeśm' 'I am' → jest-em 'I am'

The model for this must have been the periphrastic past tense (= the "Slavic perfect"), which consisted of the participle był plus the enclitic auxiliary 'to be':

był + jeśm' → był-em 'I was'

Since in the third person the auxiliary came to be dropped, the bare participle był/byli was reinterpreted as a finite verb. The proportional relation:

był : był-em :: jest : X

produced a new form of the present tense: X = jest-em. The only forms retained from pre-Polish times are those of the third person: sg. jest, pl. są. Of course they occur far more frequently than others, which may explain their survival.
ReplyDelete
Replies
Podpora społeczeństwa11 February 2013 at 19:42
I was saying: the Modern Polish 'be' _is no longer_ suppletive _despite_ frequency.

But how come the pattern---you describe above---applicable till then and till now only to past-tense forms has been used with, or rather for, a present-tense form? Also, not perfectly, not consistently, for we do not say *jestam or *jestom when we are feminine or gender-neutral beings, like we say byłam or byłom for 'I was'. It's a conundrum to me.

In Greek, a rather archaic language, they say, it's 'eisi', an analogical formation from es-, for the usual s- in the 3rd pers. plural (są, sont, sind(on), sunt, what not...). This does not put me off, coz es- is on the whole far more frequent than s-. But why did in so many other languages the s- stem survive, in the 3rd person plural?
Even if this is all 'du^ a` la fre'quence', we must ask ourselves: fre'quence de quoi? Which forms, being more frequent (than which?) have prevailed here?
ReplyDelete
Replies
Piotr Gąsiorowski11 February 2013 at 20:25
The -a/o- in the past tense is the gender suffix of the participle (był/była/było). Likewise in the plural: byli/były. The -e- of the masculine, however, was originally the vowel of the auxiliary (retained when the auxiliary was added to a consonant-final word-form):

'you were'
był-eś (masculine)
była-ś (feminine)

In the present tense, the neo-root jest- was genderless, so the prop vowel of the suffix is invariably -e-

'you are'
jest-eś (not marked for gender)

What is a little surprising is that jest was used in the plural despite the fact that the 3pl. form was są. But we have ample textual evidence of forms like sąście ~ -ście są in older Polish. It's clear that jest- and są- competed in the 1/2pl., and jest- eventually won.
ReplyDelete
Replies
Piotr Gąsiorowski11 February 2013 at 20:37
P.S. Greek eisi is not analogical. It's the normal development of PIE *h1s-énti > *ehensi (cf. Myc. e-e-si).

The PIE verb *h1es- had a root present which, like other typical root presents, had a shift of accent from the root in the singular to the ending in the plural:

3.sg. *h1és-ti, 3pl. *h1s-énti

In most groups the initial *h1 was simply dropped, but Greek regularly vocalised it as a "prothetic vowel".
ReplyDelete
Replies
Podpora społeczeństwa12 February 2013 at 21:26
' Note also that while OE sc(e)ān ‘shone’ yields the expected outcome /ʃoʊn/ in America, the normative British pronunciation is /ʃɒn/, with a shortened vowel.'

I have always thought that 'shown' for 'shone' in the US is a spelling pronunciation, like so many American peculiarities... The word looks like it ought to be read 'shown'.... . Is there any evidence to the contrary? Frequency must have something to do with it, for how often have we an opportunity to say that something or other shone? I personally -- not very often, alas...
ReplyDelete
Replies
Podpora społeczeństwa12 February 2013 at 21:47
'The American pronunciation is in fact the expected regular outcome of OE sċān.'

I know. But has it in fact come out of it, or is this a case of a happy coincidence of how the word 'must have sounded' by its origin and how it 'ought to be pronounced' by its spelling? For 'gone' I'd say: du^ a` la fre'quence...
ReplyDelete
Replies
John Cowan16 March 2013 at 02:26
In Australian English, gone has unique phonetics: it is usually /gɒːn/, with a lengthened /ɒ/ vowel not found in any other word.
ReplyDelete
Replies

Add comment

Language Evolution

29 January 2013

The Little Lambs Who Lost Their Way: Lexical Exceptions

35 comments:

About me

Some really great blogs

Blog Archive

Popular Posts

Total Pageviews