Tajik alphabet


The Tajik language has been written in three alphabets over the course of its history: an adaptation of the Perso-Arabic script, an adaptation of the Latin script and an adaptation of the Cyrillic script. Any script used specifically for Tajik may be referred to as the Tajik alphabet, which is written as алифбои тоҷикӣ in Cyrillic characters, الفبای تاجیکی with Perso-Arabic script and alifʙoji toçikī in Latin script.
The use of a specific alphabet generally corresponds with stages in history, with Arabic being used first, followed by Latin for a short period and then Cyrillic, which remains the most widely used alphabet in Tajikistan. The Bukhori dialect spoken by Bukharan Jews traditionally used the Hebrew alphabet but more often today is written using the Cyrillic variant.

Political context

As with many post-Soviet states, the change in writing system and the debates surrounding it is closely intertwined with political themes. Although not having been used since the adoption of Cyrillic, the Latin script is supported by those who wish to bring the country closer to Uzbekistan, which has adopted the Latin-based Uzbek alphabet. The Persian alphabet is supported by the devoutly religious, Islamists, and by those who wish to bring the country closer to Iran and their Persian heritage. As the de facto standard, the Cyrillic alphabet is generally supported by those who wish to maintain the status quo, and not distance the country from Russia.

History

As a result of the influence of Islam in the region, Tajik was written in the Persian alphabet up to the 1920s. Until this time, the language was not thought of as separate and simply considered a dialect of the Persian language. The Soviets began by simplifying the Persian alphabet in 1923, before moving to a Latin-based system in 1927. The Latin script was introduced by the Soviet Union as part of an effort to increase literacy and distance the, at that time, largely illiterate population, from the Islamic Central Asia. There were also practical considerations. The regular Persian alphabet, being an abjad, does not provide sufficient letters for representing the vowel system of Tajik. In addition, the abjad is more difficult to learn, each letter having different forms depending on the position in the word.
The Decree on Romanisation made this law in April, 1928. The Latin variant for Tajik was based on the work by Turcophone scholars who aimed to produce a unified Turkic alphabet, despite Tajik not being a Turkic language. The literacy campaign was successful, with near-universal literacy being achieved by the 1950s.
As part of the "russification" of Central Asia, the Cyrillic script was introduced in the late 1930s. The alphabet remained Cyrillic until the end of the 1980s with the disintegration of the Soviet Union. In 1989, with the growth in Tajik nationalism, a law was enacted declaring Tajik the state language. In addition, the law officially equated Tajik with Persian, placing the word Farsi after Tajik. The law also called for a gradual reintroduction of the Perso-Arabic alphabet.
The Persian alphabet was introduced into education and public life, although the banning of the Islamic Renaissance Party in 1993 slowed down the adoption. In 1999, the word Farsi was removed from the state-language law. the de facto standard in use is a Cyrillic alphabet, and only a very small part of the population can read the Persian alphabet.

Variants

The letters of the major versions of the Tajik alphabet are presented below, along with their phonetic values. There is also a [|comparative table] below.

Persian alphabet

A variant of the Persian alphabet is used to write Tajik. In the Tajik version, as with all other versions of the Arabic script, with the exception of ا, vowels are not given unique letters, but rather optionally indicated with diacritic marks.

Latin

The Latin script was introduced after the Russian Revolution of 1917 in order to facilitate an increase in literacy and distance the language from Islamic influence. Only lowercase letters were found in the first versions of the Latin variant, between 1926 and 1929. A slightly different version used by Jews speaking the Bukhori dialect included three extra characters for phonemes not found in the other dialects:,, and.
The unusual character Gha is called Gha and represents the phoneme. The character is found in the Common Turkic Alphabet in which most non-Slavic languages of the Soviet Union were written until the late 1930s. The Latin alphabet is not used today, although its adoption is advocated by certain groups.

Cyrillic

The Cyrillic script was introduced in Tajik Soviet Socialist Republic in the late 1930s, replacing the Latin script that had been used since the October Revolution. After 1939, materials published in Persian in the Persian alphabet were banned from the country. The alphabet below was supplemented by the letters Щ and Ы in 1952.
In addition to these thirty-five letters, the letters ц, щ and ы can be found in loanwords, although they were officially dropped in the 1998 reform, along with the letter ь. Along with the deprecation of these letters, the 1998 reform also changed the order of the alphabet, which now has the characters with diacritics following their unaltered partners, e.g. г, ғ and к, қ, etc. leading to the present order: а б в г ғ д е ё ж з и ӣ й к қ л м н о п р с т у ӯ ф х ҳ ч ҷ ш ъ э ю я. In 2010, it was suggested that the letters е ё ю я might be dropped as well. The letters е and э have the same function, except that э is used at the beginning of a word.
The alphabet includes a number of letters not found in the Russian alphabet:
During the period when the Cyrillicization took place, Ӷ ӷ also appeared a few times in the table of the Tajik Cyrillic alphabet.

Transliteration standards

The transliteration standards for the Tajik alphabet in Cyrillic into the Latin alphabet are as follows:
CyrillicIPAISO 9 1KNAB 2WWS 3ALA-LC 4Allworth 5BGN/PCGN 6Google Translate
А аaaaaaaA a
Б бbbbbbbB b
В вvvvvvvV v
Г гggggggG g
Ғ ғġghghghghƢ ƣ
Д дddddddD d
Е еee, yeeeye‐, ‐e‐eE e
Ё ёëyoëëyoyoJo jo
Ж жžzhzhžzhzhƵ ƶ
З зzzzzzzZ z
И иiiiiiiI i
Ӣ ӣīīīīīíĪ ī
Й йjyĭjyyJ j
К кkkkkkkK k
Қ қķqqķqqQ q
Л лllllllL l
М мmmmmmmM m
Н нnnnnnnN n
О оooooooO o
П пppppppP p
Р рrrrrrrR r
С сssssssS s
Т тttttttT t
У уuuuuuuU u
Ӯ ӯūūūūūŭŪ ū
Ф фffffffF f
Х хhkhkhxkhkhX x
Ҳ ҳhxhhH h
Ч чčchchčchchC c
Ҷ ҷçjjč̦jjÇ ç
Ш шšshshšshshŞ ş
Ъ ъ''''"'ʺ̱ '
Э эèè, eėèeėE e
Ю юûyui͡ujuyuyuJu ju
Я яâyai͡ajayayaJa ja

Notes to the table above:
  1. ISO 9 — The International Organization for Standardization ISO 9 specification.
  2. KNAB — From the placenames database of the Institute of the Estonian Language.
  3. WWS — From World’s Writing Systems, Bernard Comrie
  4. ALA-LC — The standard of the Library of Congress and the American Library Association.
  5. Edward Allworth, ed. Nationalities of the Soviet East. Publications and Writing Systems
  6. BGN/PCGN — The standard of the United States Board on Geographic Names and the Permanent Committee on Geographical Names for British Official Use.

    Hebrew

The Hebrew alphabet is, like the Persian alphabet, an abjad. It is used for the Jewish Bukhori dialect primarily in Samarkand and Bukhara. Additionally, since 1940, when Jewish schools were closed in Central Asia, the use of the Hebrew Alphabet outside Hebrew liturgy fell into disuse and Bukharian Jewish publications such as books and newspapers began to appear using the Tajik Cyrillic Alphabet. Today, many older Bukharian Jews who speak Bukharian and went to Tajik or Russian schools in Central Asia only know the Tajik Cyrillic Alphabet when reading and writing Bukharian and Tajik.
Sample text: דר מוקאבילי זולם איתיפאק נמאייד. מראם נאמה פרוגרמי פירקהי יאש בוכארייאן. – Дар муқобили зулм иттифоқ намоед. Муромнома – пруграми фирқаи ёш бухориён.

Samples

Tajik Cyrillic, Tajik Latin and Persian alphabet

For reference, the Persian script variant transliterated letter-for-letter into the Latin script appears as follows:

tmạm ậdmạn ậzạd bh dnyạ my̱ ậynd w ạz lḥạẓ mnzlt w ḥqwq bạ hm brạbrnd. hmh ṣḥb ʿql w wjdạnnd, bạyd nsbt bh ykdygr brạdrwạr mnạsbt nmạynd.

And the BGN/PCGN transliteration of the Cyrillic text:

Tamomi odamon ozod ba dunyo meoyand va az lihozi manzilatu huquq bo ham barobarand. Hama sohibi aqlu vijdonand, boyad nisbat ba yakdigar barodarvor munosabat namoyand.

Tajik Cyrillic and Persian alphabet

Vowel-pointed Persian includes the vowels that are not usually written.

Comparative table

A table comparing the different writing systems used for the Tajik alphabet. The Latin here is based on the 1929 standard, the Cyrillic on the revised 1998 standard, and Persian letters are given in their stand-alone forms.
CyrillicLatinPersianPhonetic
value
Examples
А аA aاَ، ـَ، ـَهсанг = سنگ
Б бB ʙбарг = برگ
В вV vوномвар = نامور
Г гG gگсанг = سنگ
Ғ ғƢ ƣғор = غار, Бағдод = بغداد
Д дD dмодар = مادر, Бағдод = بغداد
Е еE eای، ـیшер = شیر, меравам = می‌روم
Ё ёJo joیاдарё = دریا, осиёб = آسیاب
Ж жƵ ƶژжола = ژاله, каждум = کژدم
З зZ zﺯ، ﺫ، ﺽ، ﻅбаъз = بعض, назар = نظر, заҳоб = ذهاب, замин = زمین
И иI iاِ، ـِ، ـِه؛ اِیـ، ـِیـихтиёр = اختیار
Ӣ ӣĪ īـِیзебоӣ = زیبائی
Й йJ jیмай = می
К кK kکкадом = کَدام
Қ қQ qқадам = قدم
Л лL lлола = لاله
М мM mмурдагӣ = مردگی
Н нN nнон = نان
О оO oآ، ـاорзу = آرزو
П пP pپпанҷ = پنج
Р рR rранг = رنگ
С сS sﺱ، ﺙ، ﺹсар = سر, субҳ = صبح, сурайё = ثریا
Т тT tﺕ، ﻁтоҷик = تاجیک, талаб = طلب
У уU uاُ، ـُ؛ اُو، ـُوдуд = دُود
Ӯ ӯŪ ūاو، ـوхӯрдан = خوردن, ӯ = او
Ф фF fфурӯғ = فروغ
Х хX xхондан = خواندَن
Ҳ ҳH hҳофиз = حافظ
Ч чC cچчӣ = چی
Ҷ ҷÇ çҷанг = جنگ
Ш шŞ şшаб = شب
ъ'ء; ﻉтаъриф = تعریف
Э эE eای، ـیЭрон = ایران
Ю юJu juیُ, یُوиюн = ایون
Я яJa jaیَ, یَهягонагӣ = یگانگی