Devanagari transliteration

sonu Manoj Gupta
Devanāgarī is an Indian script used for languages including Hindi, Marathi, Nepali and Sanskrit. There are several somewhat similar methods of transliteration from Devanāgarī to the Roman script, including the influential and lossless IAST notation.

IAST

The International Alphabet of Sanskrit Transliteration is a subset of the ISO 15919 standard, used for the transliteration of Sanskrit, Prakrit and Pāḷi into Roman script with diacritics. IAST is a widely used standard. It uses diacritics to disambiguate phonetically similar but not identical Sanskrit glyphs. For example, dental and retroflex consonants are disambiguated with an underdot: dental द=d and retroflex ड=ḍ. An important feature of IAST is that it is losslessly reversible, i.e., IAST transliteration may be converted back to correct Devanāgarī or to other South Asian scripts without ambiguity. Many Unicode fonts fully support IAST display and printing.

Hunterian system

The Hunterian system is the "national system of romanization in India" and the one officially adopted by the Government of India.
The Hunterian system was developed in the nineteenth century by William Wilson Hunter, then Surveyor General of India. When it was proposed, it immediately met with opposition from supporters of the earlier practiced non-systematic and often distorting "Sir Roger Dowler method" of phonetic transcription, which climaxed in a dramatic showdown in an India Council meeting on 28 May 1872 where the new Hunterian method carried the day. The Hunterian method was inherently simpler and extensible to several Indic scripts because it systematized grapheme transliteration, and it came to prevail and gain government and academic acceptance. Opponents of the grapheme transliteration model continued to mount unsuccessful attempts at reversing government policy until the turn of the century, with one critic calling appealing to "the Indian Government to give up the whole attempt at scientific transliteration, and decide once and for all in favour of a return to the old phonetic spelling."
Over time, the Hunterian method extended in reach to cover several Indic scripts, including Burmese and Tibetan. Provisions for schwa deletion in Indo-Aryan languages were also made where applicable, e.g. the Hindi कानपुर is transliterated as kānpur but the Sanskrit क्रम is transliterated as krama. The system has undergone some evolution over time. For instance, long vowels were marked with an accent diacritic in the original version, but this was later replaced in the 1954 Government of India update with a macron. Thus, जान was previously romanized as ján but began to be romanized as jān. The Hunterian system has faced criticism over the years for not producing phonetically accurate results and being "unashamedly geared towards an English-language receiver audience." Specifically, the lack of differentiation between retroflex and dental consonants has come in for repeated criticism and inspired several proposed modifications of Hunterian, including using a diacritic below retroflexes or capitalizing them.

Alternative transliteration methods

Schemes with diacritics

National Library at Kolkata romanization

The National Library at Kolkata romanization, intended for the romanization of all Indic scripts, is an extension of IAST. It differs from IAST in the use of the symbols ē and ō for ए and ओ, the use of 'ḷ' for the consonant ಳ, and the absence of symbols for ॠ, ऌ and ॡ.

ISO 15919

A standard transliteration convention not just for Devanagari, but for all South-Asian languages was codified in the ISO 15919 standard of 2001, providing the basis for modern digital libraries that conform to International Organisation for Standardisation norms. ISO 15919 defines the common Unicode basis for Roman transliteration of South-Asian texts in a wide variety of languages/scripts.
ISO 15919 transliterations are platform-independent texts so that they can be used identically on all modern operating systems and software packages, as long as they comply with ISO norms. This is a prerequisite for all modern platforms so that ISO 15919 has become the new standard for digital libraries and archives for transliterating all South Asian texts.
ISO 15919 uses diacritics to map the much larger set of Brahmic graphemes to the Latin script. The Devanagari-specific portion is nearly identical to the academic standard, IAST: "International Alphabet of Sanskrit Transliteration", and to ALA-LC, the United States Library of Congress standard.
Another standard, United Nations Romanization Systems for Geographical Names, was developed by the United Nations Group of Experts on Geographical Names and covers many Brahmic scripts. There are some differences between ISO 15919 and UNRSGN.

ASCII schemes

Harvard-Kyoto

Compared to IAST, Harvard-Kyoto looks much simpler.
It does not contain any of the diacritic marks that IAST contains.
Instead of diacritics, Harvard-Kyoto uses capital letters.
The use of capital letters makes typing in Harvard-Kyoto much easier than in IAST but produces words with capital letters inside them.

ITRANS scheme

is an extension of Harvard-Kyoto. Many webpages are written in ITRANS. Many forums are also written in ITRANS.
The ITRANS transliteration scheme was developed for the ITRANS software package, a pre-processor for Indic scripts. The user inputs in Roman letters and the ITRANS preprocessor converts the Roman letters into Devanāgarī. The latest version of ITRANS is version 5.30 released in July 2001.

Velthuis

The disadvantage of the above ASCII schemes is case-sensitivity, implying that transliterated names may not be capitalized. This difficulty is avoided with the system developed in 1996 by Frans Velthuis for TeX, loosely based on IAST, in which case is irrelevant.

SLP1

SLP1 is a case-sensitive scheme initially used by which was developed by Peter Scharf and Malcolm Hyman, who first described it in appendix B of their book Linguistic Issues in Encoding Sanskrit.
The advantage of SLP1 over other encodings is that a single ASCII character is used for each Devanagari letter, a peculiarity that eases reverse transliteration.

Others

Other less popular ASCII schemes include WX notation, Vedatype and the 7-bit ISO 15919. WX notation is a transliteration scheme for representing Indian languages in ASCII. It originated at IIT Kanpur for computational processing of Indian languages and is widely used among the natural language processing community in India. This scheme is described in . It is similar to, but not as versatile as, SLP1, as far as the coverage of Vedic Sanskrit is concerned. Comparison of WX with other schemes is found in . Vedatype is another scheme used for encoding Vedic texts at Maharishi University of Management. An online transcoding utility across all these schemes is provided at the . ISO 15919 includes a so-called "limited character set" option to replace the diacritics by prefixes, so that it is ASCII-compatible. A pictorial explanation is from .

Transliteration comparison

The following is a comparison of the major transliteration methods used for Devanāgarī.

Vowels

Devanāgarī	IAST	ISO 15919	Harvard-Kyoto	ITRANS	Velthuis	SLP1	WX
अ	a	a	a	a	a	a	a
आ	ā	ā	A	A/aa	aa	A	A
इ	i	i	i	i	i	i	i
ई	ī	ī	I	I/ii	ii	I	I
उ	u	u	u	u	u	u	u
ऊ	ū	ū	U	U/uu	uu	U	U
ए	e	ē	e	e	e	e	e
ऐ	ai	ai	ai	ai	ai	E	E
ओ	o	ō	o	o	o	o	o
औ	au	au	au	au	au	O	O
				-	-	-	-
ऋ	ṛ	r̥	R	RRi/R^i	.r	f	q
ॠ	ṝ	r̥̄	RR	RRI/R^I	.rr	F	Q
ऌ	ḷ	l̥	lR	LLi/L^i	.l	x	L
ॡ	ḹ	l̥̄	lRR	LLI/L^I	.ll	X	LY
				-	-	-	-
अं	ṃ	ṁ	M	M/.n/.m	.m	M	M
अः	ḥ	ḥ	H	H	.h	H	H
अँ		m̐		.N		~	az
				-	-	-	-
ऽ	'	’	'	.a	.a	'	Z

Consonants

The Devanāgarī consonant letters include an implicit 'a' sound. In all of the transliteration systems, that 'a' sound must be represented explicitly.

Devanāgarī	IAST	ISO 15919	Harvard-Kyoto	ITRANS	Velthuis	SLP1	WX
क	ka	ka	ka	ka	ka	ka	ka
ख	kha	kha	kha	kha	kha	Ka	Ka
ग	ga	ga	ga	ga	ga	ga	ga
घ	gha	gha	gha	gha	gha	Ga	Ga
ङ	ṅa	ṅa	Ga	~Na	"na	Na	fa
च	ca	ca	ca	cha	ca	ca	ca
छ	cha	cha	cha	Cha	cha	Ca	Ca
ज	ja	ja	ja	ja	ja	ja	ja
झ	jha	jha	jha	jha	jha	Ja	Ja
ञ	ña	ña	Ja	~na	~na	Ya	Fa
ट	ṭa	ṭa	Ta	Ta	.ta	wa	ta
ठ	ṭha	ṭha	Tha	Tha	.tha	Wa	Ta
ड	ḍa	ḍa	Da	Da	.da	qa	da
ढ	ḍha	ḍha	Dha	Dha	.dha	Qa	Da
ण	ṇa	ṇa	Na	Na	.na	Ra	Na
त	ta	ta	ta	ta	ta	ta	wa
थ	tha	tha	tha	tha	tha	Ta	Wa
द	da	da	da	da	da	da	xa
ध	dha	dha	dha	dha	dha	Da	Xa
न	na	na	na	na	na	na	na
प	pa	pa	pa	pa	pa	pa	pa
फ	pha	pha	pha	pha	pha	Pa	Pa
ब	ba	ba	ba	ba	ba	ba	ba
भ	bha	bha	bha	bha	bha	Ba	Ba
म	ma	ma	ma	ma	ma	ma	ma
य	ya	ya	ya	ya	ya	ya	ya
र	ra	ra	ra	ra	ra	ra	ra
ल	la	la	la	la	la	la	la
व	va	va	va	va/wa	va	va	va
श	śa	śa	za	sha	"sa	Sa	Sa
ष	ṣa	ṣa	Sa	Sha	.sa	za	Ra
स	sa	sa	sa	sa	sa	sa	sa
ह	ha	ha	ha	ha	ha	ha	ha

Irregular consonant clusters

Devanāgarī	ISO 15919	Harvard-Kyoto	ITRANS	Velthuis	SLP1	WX
क्ष	kṣa	kSa	kSa/kSha/xa	k.sa	kza	kRa
त्र	tra	tra	tra	tra	tra	wra
ज्ञ	jña	jJa	GYa/j~na	j~na	jYa	jFa
श्र	śra	zra	shra	"sra	Sra	Sra

Other consonants

Devanāgarī	ISO 15919	ITRANS	WX
क़	qa	qa	kZa
ख़	k͟ha	Ka	KZa
ग़	ġa	Ga	gZa
ज़	za	za	jZa
फ़	fa	fa	PZa
ड़	ṛa	.Da	dZa
ढ़	ṛha	.Dha/Rha	DZa

Comparison of IAST with ISO 15919

The table below shows just the differences between ISO 15919 and IAST for Devanagari transliteration.

Devanagari	ISO 15919	IAST	Comment
ए / े	ē		To distinguish between long and short 'e' in Dravidian languages, 'e' now represents ऎ / ॆ. Note that the use of ē is considered optional in ISO 15919, and using e for ए is acceptable for languages that do not distinguish long and short e.
ओ / ो	ō		To distinguish between long and short 'o' in Dravidian languages, 'o' now represents ऒ / ॊ. Note that the use of ō is considered optional in ISO 15919, and using o for ओ is acceptable for languages that do not distinguish long and short o.
ऋ / ृ	r̥		In ISO 15919, ṛ is used to represent ड़.
ॠ / ॄ	r̥̄		For consistency with r̥
ऌ / ॢ	l̥		In ISO 15919, ḷ is used to represent ळ.
ॡ / ॣ	l̥̄		For consistency with l̥
◌ं	ṁ		ISO 15919 has two options about anusvāra. In the simplified nasalization option, an anusvāra is always transliterated as ṁ. In the strict nasalization option, anusvāra before a class consonant is transliterated as the class nasal—ṅ before k, kh, g, gh, ṅ; ñ before c, ch, j, jh, ñ; ṇ before ṭ, ṭh, ḍ, ḍh, ṇ; n before t, th, d, dh, n; m before p, ph, b, bh, m. ṃ is sometimes used to specifically represent Gurmukhi Tippi ੰ.
◌ं	ṅ ñ ṇ n m		ISO 15919 has two options about anusvāra. In the simplified nasalization option, an anusvāra is always transliterated as ṁ. In the strict nasalization option, anusvāra before a class consonant is transliterated as the class nasal—ṅ before k, kh, g, gh, ṅ; ñ before c, ch, j, jh, ñ; ṇ before ṭ, ṭh, ḍ, ḍh, ṇ; n before t, th, d, dh, n; m before p, ph, b, bh, m. ṃ is sometimes used to specifically represent Gurmukhi Tippi ੰ.
◌ँ	m̐		Vowel nasalization is transliterated as a tilde above the transliterated vowel, except in Sanskrit.

Details

Treatment of inherent schwa

Devanāgarī consonants include an "inherent a" sound, called the schwa, that must be explicitly represented with an "a" character in the transliteration. Many words and names transliterated from Devanāgarī end with "a", to indicate the pronunciation in the original Sanskrit. This schwa is obligatorily deleted in several modern Indo-Aryan languages, like Hindi, Punjabi, Marathi and others. This results in differing transliterations for Sanskrit and schwa-deleting languages that retain or eliminate the schwa as appropriate:

Sanskrit: Mahābhārata, Rāmāyaṇa, Śiva, Sāmaveda
Hindi: Mahābhārat, Rāmāyaṇ, Śiv, Sāmved

Some words may keep the final a, generally because they would be difficult to say without it:

Krishna, Vajra, Maurya
Retroflex consonants

Most Indian languages make a distinction between the retroflex and dental forms of the dental consonants. In formal transliteration schemes, the standard Roman letters are used to indicate the dental form, and the retroflex form is indicated by special marks, or the use of other letters. E.g., in IAST transliteration, the retroflex forms are ṇ, ṭ, ḍ and ṣ.
In most informal transcriptions the distinction between retroflex and dental consonants is not indicated.

Aspirated consonants

Where the letter "h" appears after a plosive consonant in Devanāgarī transliteration, it always indicates aspiration. Thus "ph" is pronounced as the p in "pit", never as the ph in "photo". Similarly "th" is an aspirated "t", neither the th of "this" nor the th of "thin".
The aspiration is generally indicated in both formal and informal transliteration systems.

Computer use as a drive for romanization

As English is widely used a professional and higher-education language in India, availability of Devanagari keyboards is dwarfed by English keyboards. Similarly, software and user interfaces released and promoted in India are in English, as is much of the computer education available there. Due to low awareness of Devanagari keyboard layouts, many Indian users type Hindi in the Roman script.
Before Devanagari was added to Unicode, many workarounds were used to display Devanagari on the Internet, and many sites and services have continued using them despite widespread availability of Unicode fonts supporting Devanagari. Although there are several transliteration conventions on transliterating Hindi to Roman, most of these are reliant on diacritics. As most Indians are familiar with the Roman script through the English language, these transliteration systems are much less widely known. Most such "Romanagari" is transliterated arbitrarily to imitate English spelling, and thus results in numerous inconsistencies.
It is also detrimental to search engines, which do not classify Hindi text in the Roman script as Hindi. The same text may also not be classified as English.
Regardless of the physical keyboard's layout, it is possible to on most modern operating systems. There are many online services available that transliterate text written in Roman to Devanagari accurately, using Hindi dictionaries for reference, such as Google transliteration. This solution is similar to Input method Editors, which are traditionally used to input text in languages that use complex characters such as Chinese, Japanese or Korean.

History of Sanskrit transliteration

Early Sanskrit texts were originally transmitted by memorization and repetition. Post-Harappan India had no system for writing Indic languages until the creation of the Kharoshti and Brahmi scripts. These writing systems, though adequate for Middle Indic languages, were not well-adapted to writing Sanskrit. However, later descendants of Brahmi were modified so that they could record Sanskrit in exacting phonetic detail. The earliest physical text in Sanskrit is a rock inscription by the Western Kshatrapa ruler Rudradaman, written c. 150 CE in Junagadh, Gujarat. Due to the remarkable proliferation of different varieties of Brahmi in the Middle Ages, there is today no single script used for writing Sanskrit; rather, Sanskrit scholars can write the language in a form of whatever script is used to write their local language. However, since the late Middle Ages, there has been a tendency to use Devanagari for writing Sanskrit texts for a widespread readership.
Western scholars in the 19th century adopted Devanagari for printed editions of Sanskrit texts. The editio princeps of the Rigveda by Max Müller was in Devanagari. Müller's London typesetters competed with their Petersburg peers working on Böhtlingk's and Roth's dictionary in cutting all the required ligature types.
From its beginnings, Western Sanskrit philology also felt the need for a romanized spelling of the language. Franz Bopp in 1816 used a romanization scheme, alongside Devanagari, differing from IAST in expressing vowel length by a circumflex, and aspiration by a spiritus asper. The sibilants IAST ṣ and ś he expressed with spiritus asper and lenis, respectively. Monier-Williams in his 1899 dictionary used ć, ṡ and sh for IAST c, ś and ṣ, respectively.
From the late 19th century, Western interest in typesetting Devanagari decreased. Theodor Aufrecht published his 1877 edition of the Rigveda in romanized Sanskrit, and Arthur Macdonell's 1910 Vedic grammar likewise do without Devanagari. Contemporary Western editions of Sanskrit texts appear mostly in IAST.

Popular movies

The Hunger Games (film) - 2012 American dystopian action thriller science fiction-adventure film directed by Gary Ross and based on Suzanne Collins’s 2008 novel of the same name. It is the first insta...
untitled Captain Marvel sequel - part of Marvel Cinematic Universe....
Killers of the Flower Moon (film project) - Killers of the Flower Moon - film project in United States of America. It was presented as drama, detective fiction, thriller. The film project starred Leonardo Dicaprio, Robert De Niro. Director of...
Five Nights at Freddy's (film) - Five Nights at Freddy's - film published in 2017 in United States of America. Scenarist of the film - Scott Cawthon....

Popular books

Book of Revelation - The Book of Revelation is the final book of the New Testament, and consequently is also the final book of the Christian Bible. Its title is derived from the first word of the Koine Greek text: apok...
Book of Genesis - account of the creation of the world, the early history of humanity, Israel's ancestors and the origins...
Gospel of Matthew - The Gospel According to Matthew is the first book of the New Testament and one of the three synoptic gospels. It tells how Israel's Messiah, rejected and executed in Israel, pronounces judgement on ...
Michelin Guide - Michelin Guides are a series of guide books published by the French tyre company Michelin for more than a century. The term normally refers to the annually published Michelin Red Guide , the oldest...
Psalms - The Book of Psalms , commonly referred to simply as Psalms , the Psalter or "the Psalms", is the first book of the Ketuvim , the third section of the Hebrew Bible, and thus a book of th...
Ecclesiastes - Ecclesiastes is one of 24 books of the Tanakh , where it is classified as one of the Ketuvim . Originally written c. 450–200 BCE, it is also among the canonical Wisdom literature of the Old Tes...
The 48 Laws of Power - non-fiction book by American author Robert Greene. The book...

Popular television series

The Crown (TV series) - historical drama web television series about the reign of Queen Elizabeth II, created and principally written by Peter Morgan, and produced by Left Bank Pictures and Sony Pictures Tel...
Friends - American sitcom television series, created by David Crane and Marta Kauffman, which aired on NBC from September 22, 1994, to May 6, 2004, lasting ten seasons. With an ensemble cast sta...
Young Sheldon - spin-off prequel to The Big Bang Theory and begins with the character Sheldon...
Modern Family - American television mockumentary family sitcom created by Christopher Lloyd and Steven Levitan for the American Broadcasting Company. It ran for eleven seasons, from September 23...
Loki (TV series) - upcoming American web television miniseries created for Disney+ by Michael Waldron, based on the Marvel Comics character of the same name. It is set in the Marvel Cinematic Universe, shar...
Game of Thrones - American fantasy drama television series created by David Benioff and D. B. Weiss for HBO. It...
Shameless (American TV series) - American comedy-drama television series developed by John Wells which debuted on Showtime on January 9, 2011. It...