Language Evolution: Yesterday’s Words – Today’s Morphemes

22 February 2013

Yesterday’s Words – Today’s Morphemes –Tomorrow’s Segments

The final /θ/ of filth no longer plays any useful morphological function. It has become fused with its derivational base into an indivisible whole. This is quite often the terminal stage in the life-cycles of linguistic replicators. Old English -þ- was still a morpheme, but it had already lost most of its phonological substance. A few hundred years earlier, in Proto-Germanic, its ancestral form had been *-iþō, continuing still earlier (pre-Germanic) *-étā. A linguistic entity that used to be a suffix of some length has ended up as phonological raw material. It means nothing by itself and has degenerated into a speech sound which, together with three others, encodes a meaning (or rather a cluster of meanings) but is no different, as far as its status is concerned, from the final /m/ of film.

Whole words may become reduced to the role of ‘bound’ (non-independent) morphological elements. Many derivational affixes used to be words which, through being frequently used in composition, survived in that function while their free-standing variant went extinct. Old English hād meant ‘person, social status’. When added to a noun it meant ‘the state or condition of being an X’. Hence, for example, OE ċild-hād ‘infancy, childhood’. The word hād > hǭd lingered on in Middle English, but seems to have become rare by the thirteenth century and eventually died out as an independent word. Curiously, in modern ‘gangsta’ slang hood (no connection with hood = ‘head covering’) is used as an abbreviation of neighbourhood. It has become a word again, though with a brand new meaning.

Words have fractal-like properties: the more closely you look at them,
the more structure they reveal

When such reduction and fusion processes have operated for millennia, they may compact a whole string of morphemes into a short word without any visible internal structure. If you look at young /jʌŋ/ today, it’s short even for an English word. In the reconstructed remote ancestor of English, the Proto-Indo-European language, it looked roughly like this: *h₂ju-h₃n̥-ḱó-s. The first element, *h₂ju-, was the compositional variant of the noun *h₂óju ‘vitality, youthful vigour’; the second was a suffix (possibly derived from an independent word) meaning ‘having, loaded with’. Together they formed the noun *h₂jú-h₃on- meaning ‘energetic young man’ (literally: ‘having the strength of young age’, cf. Skt. yúvan-). The addition of the suffix *-ḱó- produced an adjective with the meaning ‘like a young man, juvenile’. We find its reflexes for example in Sanskrit (yuvaśá-), Latin (iuvencus), Welsh (ieuanc ~ ifanc), and of course in the Germanic languages (PGmc. *jungaz > OE ġeong ~ iung [juŋg] > young). In other words, the /jʌ/ part of young is what has remained of a once independent noun, and the /ŋ/ represents two concatenated morphemes compressed into a single segment. Incidentally, *h₂óju is a very interesting item in the Proto-Indo-European lexicon, and I hope to return to it soon.

26 comments:

Mikołaj23 February 2013 at 21:38
Has *-étā survived in Polish?
ReplyDelete
Replies
Mikołaj23 February 2013 at 22:16
I started to look for the answer and I have found:
Pl. jesieć (dial.) ‘grain sieve’; osieć (E. dial.) ‘granary’; jesiótka (dial.) ‘grain sieve’; osiótka (W dial.) ‘granary’
OHG egida f. ‘harrow’; OE eg(e)þe f. ‘harrow’;
ReplyDelete
Replies
Piotr Gąsiorowski23 February 2013 at 22:34
Slavic has a very similar formation in *-ota: *vysota 'height', *širota 'width', *dьlgota 'length', *glupota 'stupidity', etc. It must be the same thing. I'm not sure how to explain the o-grade in Slavic. It seems to me that the suffix originated as the combination of the thematic vowel *-e/o- with the actual abstract suffix *-tah2. Since *o was the generalised colour of the thematic vowel in adjectives before case endings, it may have affected their derivatives too: *glupo-ta, etc. Compare the adjective *vyso-kъ 'high', in which *-kъ is a secondary extension (absent from the comparative *vyše).
ReplyDelete
Replies
Hans24 February 2013 at 11:53
Can we be sure that PGmc. *-iþō continues pre-Germanic *-étā and not a suffix *-ítā that had been abstracted from i-stems?
ReplyDelete
Replies
Piotr Gąsiorowski24 February 2013 at 16:17
*-tah2(t)- abstracts were normally derived from adjectival stems, so we get e.g. Gk. barú-tēs 'heaviness', Skt. vasú-tā 'wealthiness' (from u-stems). It's hard to say what happened to the thematic vowel in such derivatives, since we have conflicting or ambiguous evidence. For example, Germanic *-iþō points to *-i- or *-e-, forms like Gk. neó-tēt- 'youth' and Slavic *-o-ta point to *-o-, Sanskrit -á-tā(t)- is compatible with either *-e- or *-o-, and Latin -i-tāt- (-ie-tāt- from *-io- stems) is compatible with just about any short vowel. I don't think a transfer from i-stems is likely, because i-stem adjectives were vanishingly rare in PIE. But *i is also a frequent alternant of the thematic vowel e.g. in the complex suffix *-i-ko- and several other formations. The allomorphy of stem-final *-e/o/i- is still poorly understood. I prefer *-e- in the ancestor of Germanic for two reasons: (1) there is no positive evidence for a high vowel in other branches (Latin is ambiguous); (2) it seems to me that *-i- was more likely to replace thematic *-e/o- if it wasn't accented. I may be wrong, of course.
ReplyDelete
Replies
Octavià Alexandre25 February 2013 at 17:26
The meaning of IE *h₂oju- can be more accurately described as 'vital force' > 'lifetime'. Notice also the combinatory form of this word is actually *h₂ju-h₃-, which gives an ablauting pattern *jeu- ~ *jou- in Baltic and Celtic, hence the traditional reconstruction *(h₂)jeu-. Thus your own reconstruction is rather innovative.

I agree this is a very insteresting item, and I'd link it (at a macro-comparative or supra-dialectal level) to *gʷje-h₃- 'to live'.
ReplyDelete
Replies
Piotr Gąsiorowski25 February 2013 at 17:59
I agree that *h₂ was most likely a voiceles fricative in the uvular/pharyngeal range (velar or epiglottal values are also thinkable), but as the phonetic reconstruction is based on indirect evidence and hard to pinpoint, I prefer to err on the cautious side. When I write *h₂, every Indo-Europeanist knows at once what I mean without asking what my personal preferences are.

I don't reconstruct the stem with a final *h₃. In *h₂jú-h₃on- we have the "Hoffmann suffix" *-h₃on- added to the zero-grade of the noun. *h₂jeu- 'young' does not exist as an adjective. It's a secondary full grade which isn't likely to be of PIE date (except possibly in a variant of the loc.sg.). The Sanskrit pattern ā́yu, gen. yóṣ, as if from *h₂ój-u/*h₂j-éu-s, looks impressively archaic but is in fact analogical (modelled on proterokinetic stems). The earliest reconstructible pattern was acrostatic, with *o/*e in the root (the latter coloured to *a by the laryngeal) -- something like nom.sg. *h₂óju, gen.sg. *h₂áju-s (→ *h₂áiw-os).

And yes, it also developed meanings like 'lifetime, longevity, long time' etc. already at an early date.
ReplyDelete
Replies
Octavià Alexandre26 February 2013 at 13:13
I'm not sure about what you mean by "indirect", as Anatolian (Hittite) does provide direct, although partial, evidence. Also data from other families (which most IE-ists seem reluctant to use) helps to the reconstruction. So in my opinion there's no excuse for not using a real phonemic value (albeit approximate) such as χ instead of the algebraic symbol H₂ (I prefer to use capitals for emphasis), which doesn't even represent a single but at least two
different phonemes (not just allophones) depending on being part or not of the syllable nucleus.

Also for *χáju- and *χjú-H₃en-
I'd like to see the actual evidence for o as representative of the ablaut pattern in your reconstruction. And in the case of the "Hoffmann suffix", also for H₃.

And although this doesn't seem to be the case, it's wrong to asume (as many IE-ists do) every case of non-ablauting *a is due to vowel-coloring. This is patent in Paleo-European substrate lexicon such as 'apple' and several 'water' words, where a vowel *ɑ could be reconstructed.

Notice also I carefully avoid using the term "PIE", because in my opinion it doesn't represent an actual and well-defined entity but rather a convenient fiction or comparative tool. I regard "PIE" as a screen where those features found in IE languages are projected, abut which at the same time hides the complexity (both diachronic and diatopic) behind it.
ReplyDelete
Replies
Octavià Alexandre26 February 2013 at 13:55
The Spanish IE-ist Francisco Villar reconstructs a 4-vowel system i, e, ɑ, u for earlier stages of IE. Then a coming from *χe would have merged with *ɑ in some IE languages, while in others *ɑ would have been backed to *o, giving raise to a 5-vowel system. I find this a better explanation than the traditional hypothesis of the merger of o and a.

This means *ɑ (later shited either to a or o) can be either be apophonic or the product of vowel-coloring from a labialized "laryngeal". In fact, the case of the 'apple' word would be the latter, with *ɑ- < *ʕa-. In fact, Uralic *omena 'apple' (from another variant of the same "Nostratic" root) has *o- instead.

http://vasco-caucasian.blogspot.com.es/2012/04/ie-apple.html
ReplyDelete
Replies
Piotr Gąsiorowski26 February 2013 at 14:55
I have my own ideas about PIE ablaut, but it would be premature to reveal them here before I have presented them to my collegues (to be criticised and perhaps demolished). As for symbols, an algebraic one is just as good as an IPA character as long as specialists agree on its meaning.

As for *h₃ in the Hoffamnn's suffix, there are cases where it causes voicing when added to a voiceless stop. Since *h₃ seems to have been distinctively voiced (*pi-ph₃-e-ti > *pibeti) as opposed to the other two "laryngeals", I prefer the reconstruction *-h₃on- despite Hoffmann's own preference for the first laryngeal here. The full vocalism of the suffix would be *o posttonically even if the initial laryngeal had no colouring effect.
ReplyDelete
Replies
Piotr Gąsiorowski26 February 2013 at 21:11
A syllabic consonant is not necessarily a separate phoneme. Also, the jury is still out on whether interconsonantal laryngeals in words like *ph₂tēr were really vocalised in PIE as opposed to being "repaired" in various ways (including cluster simplification in some environments and prop-vowel insertion in others, sometimes already in the protolanguage, but more often in the daughter languages).

Voicing before the Hoffmann suffix is quite well attested. Not only in *h₂ap-h₃on- > *abon- 'river', where it was first identified by Eric Hamp, but also e.g. in numerous Latin nouns in -g-on- derived from stems ending in /k/ (vertex : vertīgō, etc.).
ReplyDelete
Replies
Octavià Alexandre26 February 2013 at 22:19
Celtic *abon- is a derivate from Paleo-European *ɑb- 'water', a lexeme also found in Latin amnis. So I'm afraid there's no **H₂ap- here.
ReplyDelete
Replies
Piotr Gąsiorowski26 February 2013 at 23:49
Lat. abnis is of course cognate, but how does it demonstrate an underlying *b? Any labial stop followed by *n gives Latin /mn/, cf. *swepno- > somnus. So amnis may simply reflect *h₂ap-ni-, related to *h₂ap-no- (cf. Palaic hāpna- 'river'). I see no reason to label the Italic and Celtic 'river' words "Paleo-European" and separate them articifially from the Indo-Iranian and Anatolian word-family based on the acrostatic root noun *h₂ōp-s/*h₂ap- 'flowing water'.
ReplyDelete
Replies
Octavià Alexandre27 February 2013 at 09:37
Remember my former comment about substrate languages? I'm sure you're acquainted with Krahe's "Alteuropäische" aka Old European Hydronymy (OEH). Besides Celtic and Latin, *ɑb- can be found in German river names in -apa, -affa, as pointed by Krahe himself.

The word you mentioned is part of a family of 'water' words such as *ɑb-, *ɑkʷ-ā, *up-/*ub-, found in the OEH. For more information, I'd recommend you Villar et. al (2001): Lenguas, genes y culturas en la prehistoria de Europa y Asia Suroccidental.
ReplyDelete
Replies
Piotr Gąsiorowski27 February 2013 at 09:49
I'm familiar with that branch of research, but it is rather far from my idea of historical linguistica as a discipline based on sound and rigorous methodology. A "family" which includes *ab- ~ *akʷ-ā ~ *up-/*ub-, practically in free variation with each other, could include just about anything else. Espcially if a root is so short, making accidental similarity hard to rule out.

See here, slides 9-10, for a cautionary example.
ReplyDelete
Replies
Octavià Alexandre27 February 2013 at 10:43
I'm familiar with that branch of research, but it is rather far from my idea of historical linguistica as a discipline based on sound and rigorous methodology.
Your methodology "sees" languages as complete systems with lexicon, morphology, syntax and so on. This is appliable to well-documented languages with large sets of data, but not to fragmentary systems such as substrates and long-range relationships. However, this doesn't mean research on the latter couldn't be as rigorous as in the former, but only it's much more difficult to achieve satisfactory results.

A "family" which includes *ab- ~ *akʷ-ā ~ *up-/*ub-, practically in free variation with each other, could include just about anything else.
Sorry, but I disagree. I'm afraid you threw the baby with the bathwater.

As regarding "obscurum per obscurius", there's a funny joke. One night, a drunk man was searching in vain for his key under a solitary street lamp in a dark street. A passer-by saw this and asked him: -What are you doing? -I'm searching for my key. -Are you sure you lost it here? -No, but this is the only place where I could see it.

This illustrates what many historical linguists do: trying to explain the unknown exclusively from what is known, as in Coates's etymology of the toponym London.
ReplyDelete
Replies
Octavià Alexandre27 February 2013 at 12:24
Villar regards these 'water' words, as well as *ip-/*ib-, as descending from a common Paleo-IE ancestor language spoken in the Upper Paleolithic (Gravettian period), although I won't go that far. The ablaut pattern of these hydronyms points to a 3-vowel system *i, *ɑ, *u similar to the Semitic one, so a Neolithic chronology can be posited, in account of other lexical correspondences dating from that period.

In particular, *ɑkʷ-ā would be cognate to the Hittite verb eku-/aku- 'to drink', which shows the std IE ablaut. This reminds me of Iranian *dānu- 'river' and IE *dhen- 'to flow', which can be linked with Sino-Tibetan *dhɨ̄n/*dhɨ̄ŋ 'to drink, to swallow' and Basque e-dan 'to drink'.
ReplyDelete
Replies
Piotr Gąsiorowski27 February 2013 at 12:33
Sorry, Octavià, but I think that in these matters we can only agree to politely disagree. "Fragmentary systems" are not distinguishable from chance agreements and random noise, as far as I'm concerned. It's only wishful thinking that makes people see Palaeolithic substrates and long-range agreements behind them. In my opinion it's better to admit ignorance than to work with insufficient data. But of course it's my approach and I don't question your freedom to experiment with looser methodology.
ReplyDelete
Replies
Octavià Alexandre27 February 2013 at 16:39
This comment has been removed by the author.
ReplyDelete
Replies
Unknown10 February 2017 at 02:56
I'm no longer convinced by Kuryłowicz’ explanation of Ved. píbati etc. from *pí-ph₃e-ti. No laryngeal coloration occurs in Sicel πιβε ‘drink!’, Gaul. ibe ‘id.’, Old Ir. ibid ‘drinks’ < Celt. *fibeti, etc. If *ph₃ yielded *b, the thematic stem *pi-ph₃-e/o- should have been colored *pib-ø/o-. If there was pressure to remodel the colored thematic vowel after the usual uncolored *e, there would equally well have been pressure to remodel the consonantism after the usual reduplicated present type (as later in Lat. bibō). More plausibly *pib-e/o- is a thematization of earlier *pib-. This stem could have been extracted from a 2pl. middle imperative like those of the Vedic 3rd pres. class. Then corresponding to Ved. ju-hu-dhvá-m would have been *pi-ph₃-dʱwé ‘drink ye to one another!’, with indirect reciprocal force of the middle, referring to the drinkers’ tradition of toasting each other’s health. Hackstein’s Law *CH.CC > *C.CC (HS 115:1-22, 2002) would delete *h₃, whereupon *pdʱ would be assimilated to *bdʱ, and the new 2pl. impv. *pib-dʱwé could serve as the basis for a new athematic stem *pib-, later thematized as *pib-e/o- (which yielded the attested Ved. 2pl. mid. impv. piba-dhva-m, 3x RV, 1x AVP). Regeneration of a whole verbal paradigm from an imperative form is documented in Middle Indic. From dehi ‘give!’ (Ved. dehí, Av. dazdi, evidently corresponding to the Grk. aor. impv. δός + -θί, i.e. *dh₃es-dʱí), a new pres. stem de- was extracted, yielding deti ‘gives’ in Pāli and Prākrit. Thus *píbeti provides no basis for presuming that *h₃ or any laryngeal could spread its voicing or unvoicing to an adjacent stop. And so I see no advantage to positing *h₃ rather than *h₁ in the Hoffmann suffix. Hitt. ḫāpa- can have underlying *b, with Celt. *abon- continuing *h₂eb-h₁on-. Lat. vertīgō shouldn’t be regarded as a Hoffmann extension of vertex, -ĭcis, but as a replacement for a fem. *vortī-. For some reason vṛkī́ḥ-feminines became unacceptable in Italic and were repaired by conversion to i-stems (neptis) or jā-stems (avia), or by addition of *-k- (jūnīx, mātrīx, nūtrīx, etc.), *-nā- (gallīna), or *-gōn- (also in virāgō from *virā-, cf. Festus: feminas antiqui ... viras appellabant).
ReplyDelete
Replies

Add comment

Language Evolution

22 February 2013

Yesterday’s Words – Today’s Morphemes –Tomorrow’s Segments

26 comments:

About me

Some really great blogs

Blog Archive

Popular Posts

Total Pageviews