Proto-Uralic

Proto-Uralic is the reconstructed language ancestral to the Uralic language family. The language was originally spoken in a small area in about 7000–2000 BCE, and expanded to give differentiated protolanguages. The location of the area or Urheimat is not known, and various strongly differing proposals have been advocated, but likewise the vicinity of the Ural Mountains is generally assumed.

Definition

According to the traditional binary tree model, Proto-Uralic diverged into Proto-Samoyedic and Proto-Finno-Ugric. However, reconstructed Proto-Finno-Ugric differs little from Proto-Uralic, and many apparent differences follow from the methods used. Thus Proto-Finno-Ugric may not be separate from Proto-Uralic. Another reconstruction of the split of Proto-Uralic has three branches from the start.

"Comb" model

In the early 21st century, these tree-like models have been challenged by the hypothesis of larger number of proto-languages giving an image of a linguistic "comb" rather than a tree. Thus, the second-order groups of the Uralic phylum would then be: Sami, Finnic, Mordvinic, Mari, Permic, Hungarian, Mansi, Khanty and Samoyedic, all on equal footing. This order is both the order of geographical positions as well as linguistic similarity, with neighboring languages being more similar than distant ones.

Allo-genetic theory

A minority view considers that many features ascribed to descent from Proto-Uralic may have rather come about by convergence among originally different languages. Most scholars have rejected this model.
Another way to describe this phenomenon in terminology would be called an "Allo-genetic" language group. This was first coined and identified by Georgian-German linguist G. W. Tsereteli in his 1970 book "Zur Frage der Beziehung zwischen den semitischen und hamitischen Sprachen" while studying the Afro-Asiatic languages; to which Tsereteli suggested that the Afro-Asiatic phylum may be more of a comb and/or series of proto-languages rather than a genetic tree. In other words, according to Tsereteli's theory - just like Afro-Asiatic - one could probably surmise that Uralic may exhibit the same phenomenon and may not be a genetic language group at all but is actually an Allo-genetic language group. If G. W. Tsereteli is correct with this linguistic hypothesis; it may be possible that there was never any such thing as a Proto-Uralic language at all, for maybe Proto-Uralic actually formed out of a smaller onomastic pool rather than a larger one, and there was actually no "real" phylum.

Phonology

Similarly to the situation for Proto-Indo-European, reconstructions of Proto-Uralic are traditionally not written in IPA but in UPA.

Vowels

Proto-Uralic had vowel harmony and a rather large inventory of vowels in initial syllables, much like the modern Finnish or Estonian system:
Sometimes a mid vowel *ë is reconstructed in place of *ï, or a low back rounded *å in place of *a.
There were no monophonemic long vowels nor diphthongs, though sequences of vowel and semivowel within a single syllable could exist.

Unstressed vowels

Vowel inventory in non-initial syllables was restricted: only a two-way contrast of open and non-open vowels is incontestably reconstructible. The actual realization of this contrast is a question of debate: one view considers this two archiphonemic vowels and, realized as four allophones, as per vowel harmony.
For the non-open vowel, most branches reflect a reduced vowel ; only two branches give evidence for a specific value:

The Finnic languages show or depending on harmony, word-finally.
The Samic languages show a variety of reflexes, but these reflexes can be traced back to a Proto-Samic phoneme *ë, which is also the reflex of Proto-Uralic *i and *ü in stressed syllables.

While vowel reduction is a common sound change, Finnic is known to have adstrate influence from language groups that would not have known reduced vowels, so a value of already in Proto-Uralic remains a possibility.
Although these three or four stem types were certainly the most prominent ones in Proto-Uralic, it is possible that other, rarer types may have existed as well. These include for example kinship terms such as "sister-in-law", found as *kälü in both Proto-Finnic and Proto-Samoyedic. Janhunen and Sammallahti reconstruct here instead a word-final labial glide: *käliw.
A general difficulty in reconstructing unstressed vowels for Proto-Uralic lies in their heavy reduction and loss in many of the Uralic languages. Especially in the Ugric and Permic languages, almost no trace of unstressed vowels appears in basic word roots. The original bisyllabic root structure has been well preserved in only the more peripheral groups: Samic and Finnic in the northwest, Samoyedic in the east. The main correspondences of unstressed vowels between these are as follows:

Proto-Uralic	Proto-Samic	Proto-Finnic	Proto-Samoyedic	Notes
*-a	*-ē	*-a	*-å
*-ä	*-ē	*-ä	*-ä
*-ə	*-ë	*-e	∅	after original open syllables
*-ə	*-ë	*-e	*-ə	after original closed syllables

Developments in Mordvinic and Mari are rather more complicated. In the former, Proto-Uralic *-a and *-ä are usually reduced to *-ə; *-a is however regularly retained whenever the first syllable of the word contained *u. Proto-Uralic *-ə is regularly lost after open syllables, as well as in some other positions.

Conditional vowel shifts

A number of roots appear to diverge from the main picture of unstressed syllables in a different way: while Finnic, Samic and Samoyedic languages all have one of the "typical" stem shapes, they may not quite match. Words in these classes often feature discrepancies in the vowels of the first syllable as well, e.g. Finnic *a or *oo against Samic *ā or *oa.
A number of such cases may result simply from conditional vowel shifts in unstressed syllables. In fact, multiple vowel shifts are reconstructed in branches of Uralic sensitive to a particular combination of stem vowel and following reduced vowel, in which both change at once. A shift *a-ə > *o-a can be posited for Samic as well as the Mordvinic languages. E.g.:

Proto-Samic	Mordvinic	Proto-Finnic	Proto-Samoyedic	Hungarian	other reflexes	meaning
čoarvē < ćorwa	Erzya сюро Moksha сюра < *śorwa-	*sarvi	-	szarv		'horn'
čoalē < ćola	Erzya сюло Moksha сюра < *śola-	sooli < sali	-			'intestine'
koalō- < kola-	Erzya куло- Moksha куло- < *kola-	koole- < kali-	*kåə-	hal		'to die'
koamtē < komta	Erzya and Moksha кунда < *komta	kanci < kanti	-	-	Mari комдыш	'lid'

The change is, however, masked by the shift of *ë to *a in words such as:

Proto-Samic	Mordvinic	Proto-Finnic	Proto-Samoyedic	Hungarian	other reflexes	meaning
ńuolë < ńalə	Erzya, Moksha нал	nooli < nali	*ńël	nyíl		'arrow'
suonë < sanə	Erzya, Moksha сан	sooni < sani	*cën	ín		'vein, sinew'
θuomë < δamə	Erzya лём Moksha лайме	toomi < tami	*jëm	-		'bird cherry'
vuoptë < aptə	-	apci < apti	*ëptə	-		'hair'

In a second group, a change *ä-ä > *a-e appears to have taken place in Finnic in words such as:

Proto-Finnic	Proto-Samic	Proto-Samoyedic	Hungarian	other reflexes	meaning
loomi < lami	-	-	-	Erzya леме	'scab'
pooli < pali	*pealē	*pälä	fél	Erzya пеле	'half'
*sappi	*sāppē	-	epe	Erzya сэпе	'gall'
*talvi	*tālvē	-	tél	Erzya теле	'winter'
*vaski	*veaškē	*wäsa	vas	Mari -вож 'ore'	'copper, bronze' ~ 'iron'

Consonants

In the consonant system, palatalization, or palatal-laminal instead of apical articulation, was a phonemic feature, as it is in many modern Uralic languages. Only one series of stops existed:
The phonetic nature of the segment symbolized by *x is uncertain, though it is usually considered a back consonant;,,, and have been suggested among others. Janhunen takes no explicit stance, leaving open the option for even a vocalic value. The segment has some similarity to the Indo-European laryngeals : it is reconstructed by certain scholars in syllable-final position in word-stems where a contrastive long vowel later developed, best preserved in the Finnic languages, and where Samoyedic features a vowel sequence such as *åə. The correlation between these two stem classes is however not perfect, and alternate possibilities exist for explaining both vowel length in Finnic and vowel sequences in Samoyedic. *x is also reconstructed word-medially, and in this position it also develops to a Finnic long vowel, but has clear consonantal reflexes elsewhere: *k in Samic, *j in Mordvinic and *ɣ in Ugric. If a consonant, it probably derives from lenition of *k at a pre-Uralic stage; it is only found in words ending in a non-open vowel, while *k is infrequent or nonexistent in similar positions.
The phonetical identity of the consonant is also subject to some doubt. It is traditionally analyzed as the palatalized counterpart of the voiced dental fricative, that is, as ; however, this a typologically rare sound value for which no direct evidence is found in any Uralic language, and a pure palatal fricative is another option; a third option is a palatal liquid like, e. g., Czech ř. Some others propose to adjust the sound values of both this consonant and its plain counterpart. Ugricist László Honti has advanced a reconstruction with lateral fricatives:, for, while Frederik Kortlandt reconstructs palatalized and, alleging that they pattern like resonants.

Dubious segments

The phonemes in parentheses—*ć, *š, *ĺ—are supported by only limited evidence, and are not assumed by all scholars. Sammallahti notes that while instances of *ć are found in all three of Permic, Hungarian and Ob-Ugric, there are "very few satisfactory etymologies" showing any correlation between the branches in whether *ć or *ś appears. In the other languages, no consistent distinction between these consonants is found. The evidence for the postalveolar sibilant *š however is "scarce but probably conclusive" : it is treated distinctly from *s only in the more western languages, but certain loans from as far back as the Proto-Indo-European language have reflexes traceable to a postalveolar fricative. The possibility of *ĺ is not considered by him at all. In contrast, Janhunen, who considers Samoyedic evidence necessary for conclusions about Proto-Uralic, doubts that *š can be reconstructed, preferring to consider it a secondary, post-Proto-Uralic innovation. He agrees with Sammallahti in omitting *ĺ and in only considering a single palatal obstruent as necessary to reconstruct; for the latter he suggests the sound value of a palatal stop, .

Phonotactics

No initial or final consonant clusters were allowed, so words could begin and end with a maximum of one consonant only. The single consonants also could not occur word-initially, though at least for the first of these, this may be an coincidental omission in the data. A reconstruction "spleen" exists but is not found in Samoyedic and the most stringent criteria for a Proto-Uralic root thus exclude it. A similar case is "fox", a loanword from Indo-Iranian.
Inside word roots, only clusters of two consonants were permitted. Since *j and *w were consonants even between a vowel and another consonant, there were no sequences of a "diphthong" followed by two consonants, like in e.g. Finnish veitsi. While voicing was not a phonemic feature, double stops probably existed. The singleton–geminate contrast in most descendant languages developed into a voiced–voiceless distinction, although Finnic is a notable exception, e.g. Finnish appi, lykkää.
When, due to suffixation, consonant clusters arose that were not permitted, the non-low vowel was inserted as a prop vowel. This process was obscured in the Finnic languages by an opposing process which syncopated unstressed *e in many cases.

Prosody

Proto-Uralic did not have tones, which contrasts with Yeniseian and some Siberian languages. Neither was there contrastive stress as in Indo-European; in Proto-Uralic the first syllable was invariably stressed.

Phonological processes

may have occurred already in Proto-Uralic: if it did, it was probably a phonetical alternation involving allophonic voicing of the stop consonants: ~ , ~ , ~ .

Grammar

Grammatically Proto-Uralic was an agglutinative nominative–accusative language.

Nouns

Proto-Uralic nouns are reconstructed with at least six noun cases and three numbers, singular, dual and plural. Grammatical gender was not recognized and no Uralic language does so even today. Noun articles were unknown.
The plural marker of nouns was *-t in final position and *-j- in non-final position, as seen in Finnish talot and talojen. The dual marker has been reconstructed as *-k-, but the dual number has been lost in many of the contemporary Uralic languages.
The cases were:

nominative
accusative *-m
dative/genitive *-n
locative *-na / *-nä
ablative/instrumentative *-ta / *-tä
lative *-ŋ

The cases had only one three-way locative contrast of entering, residing and exiting. This is the origin of the three-way systems as the three different ones in Karelian Finnish. The partitive case, developed from the ablative, was a later innovation in the Finnic and Samic languages. Further cases are occasionally mentioned, e.g. Robert Austerlitz's reconstruction of Proto-Finno-Ugric includes a seventh, adverbial.
A further noun case likely already found in Proto-Uralic is the translative *-ksi. The abessive *-ktak / *-ktäk is not completely certain as it could also have been a derivational category rather than a noun case. So as many as seven or eight noun cases can be reconstructed for Proto-Uralic with high plausibility.
The nouns also had possessive suffixes, one for each combination of number and person. These took the place of possessive pronouns, which did not exist.

Verbs

Verbs were conjugated at least according to number, person and tense. The reconstructions of mood markers are controversial. Some scholars argue that there were separate subjective and objective conjugations, but this is disputed; clear reflexes of the objective conjugation are found in only the easternmost branches, and hence it may also represent an areal innovation. Negation was expressed with the means of a negative verb *e-, found as such in e.g. Finnish e+mme "we don't".

Ergativity hypothesis

Merlijn De Smit of Stockholm University has argued for ergativity in Proto-Uralic, reinterpreting the accusative case as a lative one and arguing for a marked subject via the genitive case and a verbal ending, *mV-. Support for this theory comes from the Finnish agent participle constructions, e.g. miehen ajama auto — car driven by the man, Naisen leipoma kakku — the cake that woman baked. In these constructions the subject, which is usually unmarked, is in the genitive case, while the direct object, usually marked with -n is unmarked.
This resembles a passive construction such as pater amatur a filio, filio being declined in the ablative case, except that the word order in Finnish is reversed.
This construction also occurs in Udmurt, Mari, Mordvinic, and Karelian. However, unlike Finnish, the construction is also used with intransitive sentences, characterized by the same -mV suffix on the verb, e.g. Udmurt gyrem busy, "a ploughed field, a field that has been ploughed", lyktem kišnomurt, "the arrived lady, the lady who has arrived". The -mV participle ending in Mari denotes a preterite passive meaning, e.g. in Eastern Mari omtam počmo, "the door opened", təj kaləkən mondəmo ulat, "you are forgotten by the people", and memnan tolmo korno, "the road that we have come".
This is problematic for the ergative theory because the -mV participle, labelled the ergative marker, is a passive marker in most of the languages that use it, and the Finnish agent participle constructions may in fact derive from similar constructions in Baltic languages, e.g. Lithuanian tėvo perkamas automobilis or automobilis tėvo perkamas. Notable is the unmistakable resemblance between the Baltic and Finnic verbal suffixes, and the fact that -mV is missing in both Estonian and Mordvinic, despite being two very close relatives of Finnish. However, the Baltic participle in -ma does not represent the most common Indo-European ending of a passive participle, even though it does have parallels in other Indo-European languages. Even if the ending derives from Proto-Uralic and not the Baltic languages, the transition from a passive to ergative construction is very common and has been observed in Indo-Aryan, Salish, and Polynesian. The transition begins when the unmarked subject of the passive sentence, usually marked in active sentences, is re-analyzed as an unmarked absolutive, and the marked agent as ergative.

Vocabulary

Only some 200 word roots can be reconstructed for Proto-Uralic, if it is required that every word reconstructed for the proto-language should be present in Samoyedic languages. With a laxer criterion of reconstructing words which are attested in most branches of the language family, a number in the range of 300–400 roots can be reached.
The following examples of reconstructed items are considered to fulfill the strictest criteria and are thus accepted as Proto-Uralic words by practically all scholars in the field:

Body parts and bodily functions

ïpti hair on the head
ojwa head
śilmä eye
poski cheek
käli tongue/language
mïksa liver
elä- to live
kali- to die
wajŋi breath
kosi cough
kunśi urine
küńili tear
seji pus
Kinship terms
emä mother
čečä uncle
koska aunt
mińä daughter-in-law
wäŋiw son-in-law
Verbs for universally known actions
meni- to go
toli- to come
aśkili- to step
imi- to suck
soski- to chew
pala- to eat up
uji- to swim
sala- to steal
kupsa- to extinguish
tumti- to know
Basic objects and concepts of the natural world
juka river
toxi lake
weti water
päjwä sun/warmth
kala fish
suŋi summer
śala- lightning
wanča root
koji birch
kasi spruce
sïksi Siberian pine
δ´ïmi bird cherry
muna egg
Elementary technology
tuli fire
śüδi coal
äjmä needle
pura drill/to bore
jïŋsi bow
jänti bow string
ńïli arrow
δ´ümä glue
lïpśi cradle
piksi rope
suksi ski
woča fence
Basic spatial concepts
ïla below
üli above
wasa left
pälä "half"
peli side
[Pronouns]
mun I
tun you
ke- who
mi- what

A reconstruction of a word *wäśkä, meaning 'reddish metal', has also been proposed. However, this word shows irregularities in sound correspondence, and some scholars believe it to be a Wanderwort instead.
The reconstructed vocabulary is compatible with a Mesolithic culture, a north Eurasian landscape, and contains interesting hints on kinship structure.
Examples of vocabulary correspondences between the modern Uralic languages are provided in the :fi:Uralilainen sanasto|list of comparisons at the Finnish Wikipedia.

In popular culture

The film Unna ja Nuuk has extensive dialogue in reconstructed Proto-Finno-Samic, the proto-language of the Finno-Samic languages.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...