Macro-haplogroup L (mtDNA)


In human mitochondrial genetics, L is the mitochondrial DNA macro-haplogroup that is at the root of the anatomically modern human mtDNA phylogenetic tree. As such, it represents the most ancestral mitochondrial lineage of all currently living modern humans, also dubbed "Mitochondrial Eve".
Its two sub-clades are L1-6 and L0.
The split occurred during the Penultimate Glacial Period; L1-6 is estimated to have formed ca. 170 kya, and L0 ca. 150 kya. The formation of L0 is associated with the peopling of Southern Africa by populations ancestral to the Khoisan, ca. 140 kya, at the onset of the Eemian interglacial.
L1-6 is further subdivided into L2-6 and L1, dated ca. 150 kya and 130 kya, respectively.
Haplogroups L5, L2 and L6, L4 and L3 derive from L2-6.

Origin

The outgroup for mtDNA phylogeny of modern humans is the mtDNA of archaic humans, specifically Neanderthals and Denisovans.
The split of the modern human lineage from the Neanderthal and Denisovan lineage is dated to between ca. 760-550 kya based on full genome analysis. This is consistent with the estimate based on Y-chromosomal DNA, which places the split between ca. 806-447 kya.
In terms of mtDNA, however, it appears that modern humans and Neanderthals form a sister clade, with Denisovans as basal outgroup. The split of Neanderthal and modern human mtDNA is dated to about 498-295 kya, i.e. significantly younger than the date estimated based on nuclear DNA. This has been explained as reflecting early gene flow from Africa into the Neanderthal genome, around 270 kya or earlier, i.e. around the time of the first emergence of anatomically modern humans. Posth et al. suggest the possibility that early Homo sapiens mtDNA from Africa may have replaced the original Neanderthal mtDNA entirely even when assuming minimal admixture. The Neanderthal and Denisovan lineages diverged before about 430 kya, and Denisovan mtDNA was not affected by the introgression.
The most recent common ancestor of modern human mtDNA is dated to ca. 230-150 kya. The emergence of haplogroup L1-6 by definition dates a later time, at an estimated 200-130 kya, possibly in a population in eastern Africa. Haplogroup L0 emerges from the basal haplogroup L1-6* somewhat later, at an estimated 190-110 kya.
The deep time depth of these lineages entails that substructure of this haplogroup within Africa is complex and poorly understood.
Date estimates are necessarily imprecise. The intervals cited above represent high and low estimates of the 95% confidence interval following Soares et al., the most likely ages are to be taken near the center of these intervals.

Phylogeny

L1-6

Haplogroup L1-6 split off undifferentiated haplogroup L roughly 20,000 years after Mitochondrial Eve, or at roughly 170,000 years ago
. It diverged, in its turn, into L1, L5, and L2 before the recent out-of Africa event of ca. 70 kya. L3 emerges around 70 kya and is closely associated with the out-of-Africa event; it may have arisen either in East Africa or in Asia. L6 and L4 are a sister clades of L3, but they are limited to East Africa and did not participate in the out-of-Africa migration.
Undifferentiated L1'2'3'4'5'6 has been found in Neanderthal fossils from the Caucasus and the Altai, dated to before 50 kya. This suggests that an earlier wave of expansion of Homo sapiens left Africa between about 200-130 kya and left genetic traces by interbreeding with Neanderthals before disappearing.
Haplogroup L1 diverged from L1-6 at about 140,000 years ago. Its emergence is associated with the early peopling of Africa by anatomically modern humans during the Eemian, and it is now mostly found in African pygmies.
Haplogroup L5 was formerly classified as L1e, but is now recognized as having diverged from L2-6 at about 120 kya.
It is also mostly associated with pygmies, with highest frequency in Mbuti pygmies from Eastern Central Africa at 15%.
Haplogroup L2 diverged from L'2 at about 90 kya, associated with the peopling of West Africa.
As a result of the Bantu migration it is now widespread throughout Sub-Saharan Africa, at the expense of the previously more widespread L0, L1 and L5.
Haplogroup L6 diverged from L3'4'6 at about the same time, ca. 90 kya. It is now a minor haplogroup with distribution mostly limited to the Horn of Africa and southern Arabia.
Haplogroup L3 diverged from L3'4 at about 70 kya, likely shortly before the Southern Dispersal event, possibly in East Africa.
The mtDNA of all non-Africans is derived from L3, divided into two main lineages, M and N.
Haplogroup L4 is a minor haplogroup of East Africa that arose around 70 kya but did not participate in the out-of-Africa migration. The haplogroup formerly named L7 has been re-classified as a subclade of L4, named L4a.

L0

Haplogroup L0 arose between about 200 and 130 kya,
that is, at about the same time as L1, before the beginning of the Eemian. It is associated with the peopling of Southern Africa after about 140,000 years ago.
Its subclades are L0d and L0k. Both are almost exclusively restricted to the Khoisan of southern Africa, but L0d has also been detected among the Sandawe people of Tanzania, which suggests an ancient connection between the Khoisan and East African speakers of click languages.
Haplogroup L0f is present in relatively small frequencies in Tanzania among the Sandawe people who are known to be older then the Khoisan. L0a is most prevalent in South-East African populations, and L0b is found in Ethiopia.

Distribution

Putting aside its sub-branches, haplogroups M and N, the L haplogroups are predominant all over sub-Saharan Africa; L is at 96–100%, apart. It is found in North Africa, Arabian Peninsula, Middle East, Americas, Europe, ranging from low to high frequencies depending on the country.

Africa

With the exception of a number of lineages that returned to Africa from Eurasia after the out of Africa migration, all African lineages belong to haplogroup L. The "back-to-Africa" haplogroups including U6, X1 and possibly M1 have returned to Africa possibly as far back as 45,000 years ago. Haplogroup H, which is common among Berbers, is also believed to have entered Africa from Europe during the post-glacial expansion.
The mutations that are used to identify the basal lineages of haplogroup L, are ancient and may be 150,000 years old. The deep time depth of these lineages entails that substructure of this haplogroup within Africa is complex and, at present, poorly understood. The first split within haplogroup L occurred 140–200kya, with the mutations that define macrohaplogroups L0 and L1-6. These two haplogroups are found throughout Africa at varying frequencies and thus exhibit an entangled pattern of mtDNA variation. However the distribution of some subclades of haplogroup L is structured around geographic or ethnic units. For example, the deepest clades of haplogroup L0, L0d and L0k are almost exclusively restricted to the Khoisan of southern Africa. L0d has also been detected among the Sandawe of Tanzania, which suggests an ancient connection between the Khoisan and East African populations.

North Africa

Haplogroup L is also found at moderate frequencies in North Africa. For example, the various Berber populations have frequencies of haplogroup L lineages that range from 3% to 45%.
Haplogroup L has also been found at a small frequency of 2.2% in North African Jews from Morocco, Tunisia and Libya. Frequency was the highest in Libyan Jews 3.6%. Moroccan Arabs have more elevated SSA maternal admixture at around 21% to 36% Via L-mtDNA sequences, Highest frequencies of L-mtDNA is reported to Moroccan Arabs of The Surrounding area of El jadida at 33%

West Asia

Haplogroup L is also found in West Asia at low to moderate frequencies, most notably in Yemen where frequencies as high as 60% have been reported. It is also found at 15.50% in Bedouins from Israel, 13.68% in Palestinians, 12.55% in Jordanians, 9.48% in Iraqis, 9.15% in Syrians, 7.5% in the Hazara of Afghanistan, 6.66% in Saudi Arabians, 2.84% in Lebanese, 2.60% in Druzes from Israel, 2.44% in Kurds and 1.76% in Turks. Overall the Arab slave trade and expansion of foreign empires that encapsulated Saudi Arabia were linked to the negligible presence of haplogroup L in the Saudi Arabian gene pool.

Europe

In Europe, haplogroup L is found at low frequencies, typically less than 1% with the exception of Iberia where regional frequencies as high as 18.2% have been reported and some regions of Italy where frequencies between 2 and 3% have been found.
Overall frequency in Iberia is higher in Portugal than in Spain where frequencies are only high in the south and west of the country. Increasing frequencies are observed for Galicia and northern Portugal, through the center and to the south of Portugal. Relatively high frequencies of 7.40% and 8.30% were also reported respectively in South Spain, in the present population of Huelva and Priego de Cordoba by Casas et al. 2006. Significant frequencies were also found in the Autonomous regions of Portugal, with L haplogroups constituting about 13% of the lineages in Madeira and 3.4% in the Azores. In the Spanish archipelago of Canary Islands, frequencies have been reported at 6.6%. According to some researchers L lineages in Iberia are associated to Islamic invasions, while for others it may be due to more ancient processes as well as more recent ones through the introduction of these lineages by means of the modern slave trade. The highest frequency of Sub-Saharan lineages found so far in Europe were observed by Alvarez et al. 2010 in the comarca of Sayago in Spain and in Alcacer do Sal in Portugal.
In Italy, Haplogroup L lineages are present in some regions at frequencies between 2 and 3% in Latium, parts of Tuscany, Basilicata and Sicily.
In 2015 study found that a prehistoric episode would be the main contributor to the sub-Saharan presence in Mediterranean Europe and Iberia A 2018 study ascribed high levels of African admixture in Spain and Portugal to two separate episodes, one during the North African Islamic expansions into Iberia and one later one, possibly related to the slave trade.

The Americas

Haplogroup L lineages are found in the African diaspora of the Americas as well as indigenous Americans. Haplogroup L lineages are predominant among African Americans, Afro-Caribbeans and Afro-Latin-Americans. In Brazil, Pena et al. report that 85% of self-identified Afro-Brazilians have Haplogroup L mtDNA sequences. Haplogroup L lineages are also found at moderate frequencies in self-identified White Brazilians. Alves Silva reports that 28% of a sample of White Brazilians belong to haplogroup L. In Argentina, a minor contribution of African lineages was observed throughout the country. Haplogroup L lineages were also reported at 8% in Colombia, and at 4.50% in North-Central Mexico. In North America, haplogroup L lineages were reported at a frequency of 0.90% in White Americans of European ancestry.

Haplogroup L Frequencies (> 1%)