Monday, February 18, 2019

Harappa World Sri Lankan DNA Admixture


The Harappa DNA Ancestry project which ran from 2011-2016 has made many important discoveries including the clear DNA stratification of caste.

The Harappa World calculator which is available publicly on the Gedmatch website closely mirrors the latest South Asian DNA studies which models all South Asians primarily as a mixture of three core populations:

(1) Aboriginal hunter gatherers (AASI - Ancient Ancestral South Indians), the indigenous people of the subcontinent, and the component that makes South Asians genetically distinct from the rest of the world.

(2) Neolithic Iranian farmers (who introduced agriculture before 5000 BC and mixed with the AASI to form the Indus Valley Civilisation. These farmers likely introduced the Proto-Elamo-Dravidian languages).

(3) Bronze age Steppes pastoralists (The original 'Aryans' who entered the subcontinent circa 1500 BC).

The Neolithic Iranian farmers and Steppes pastoralists are both West Eurasian populations.



The Genomic Formation of South and Central Asia (2018)



This is a more accurate, scientific representation of the individual breakdown of South Asian populations, which in the past were erroneously described genetically as linguistic groups (e.g. racially Indo-Aryan or Dravidian, linguistic terms that should not be directly equated with genes on a one to one basis).

Harappa World DNA categories 

The seven major categories in the Harappa World calculator are the following:

South Indian
Baloch
Caucasian
NE Euro
SE Asian
Siberian
NE Asian.

These are not real populations, but the labels tell you which region these components are modal. For example, the “South Indian” component peaks in south Indian Dalits and tribals and closely correlates with the indigenous AASI component (hence why it is called 'South Indian').

The “Baloch” peaks among the Baloch people of southeastern Iran and southwest Pakistan and correlates somewhat with the Neolithic Iranian farmer component.

The “NE Euro” peaks among the eastern Baltic peoples and correlates with the Steppes people (the original Aryans).

The last three are East Asian components, running the latitude from south to north to center. They only concern populations on the eastern fringe of the subcontinent such as Bengalis.

Ancient Steppes DNA (The original Aryans) 

The following chart shows the breakdown of the DNA recovered from the skeletons of ancient Steppes people (which can be found on GEDmatch).
As it can be seen the ancient Steppes pastoralists who conquered much of Eurasia with the invention of the horse and chariot, and spread their Indo-European languages are around 60% North East European in DNA.

This is in sharp contrast to modern Iranian and Indian populations who only have a minority of Steppes DNA, despite them mainly speaking Aryan languages:

This is due to the process of elite dominance and language replacement. The Steppes people who were much smaller in number conquered these agricultural societies and replaced the local languages, whilst only contributing a relatively small amount to the gene pool.

Indeed, it is a fallacy to describe any population of modern day Iran or South Asia as being Aryan in race. They are all genetically far closer to the prehistoric farming communities that predated the Steppes migrations.

South Asian DNA

The limited data from the Harappa World calculator suggests that the Sinhalese are genetically closest to Tamils and other South Indian middle castes. These geographically close groups lie in the middle of the ‘Indian cline‘ in between South Indian Dalits and North Indian upper castes. 

The proportion of indigenous AASI genes generally rises as you go down the caste system and down the Indian subcontinent.

The South Indian Brahmins migrated from North India hence their lower proportion of ‘South Indian’ genes and higher Steppes component (North East European).

The Bengalis are clearly enriched with 'East Asian' and 'North East European' ancestry which is largely absent in Sri Lankan populations (with the exception of those recently mixed with Malays and Burghers).

This is corroborated by another recent study from 2017 which showed significant East Asian ancestry amongst Bengalis (in both the maternal and paternal lines) that is absent in most Sri Lankans:



A further study from 2015 analysed the overall autosomal DNA of 200 Sinhalese, 200 Indian Tamils, 103 Sri Lankan Tamils, 15 upper caste West Bengalis and 24 upper caste South Indians (likely Brahmins).

It found that on average the Sinhalese were genetically in between North and South Indian populations:



Harappa World DNA analysis of Sinhalese and Tamils:

https://sbarrkum.blogspot.com/2013/04/sinhalese-and-tamil-dna-admixture.html

References

1.  The Genomic Formation of South and Central Asia - Vagheesh M Narasimhan et al (2018). Available URLs:

2. Aryan migration: Everything you need to know about the new study on Indian genetics.
https://scroll.in/article/874102/aryan-migration-everything-you-need-to-know-about-the-new-study-on-indian-genetics

3. The mystery of the Sintashta people (Proto Indo-Iranians)
https://eurogenes.blogspot.com/2018/04/the-mystery-of-sintashta-people.html?fbclid=IwAR060dhe1ggEetUBycZGy7aqf7sY9d7TUHexXvKuo9DAaakWFD96XUDk4_g

4. Gedmatch database
https://www.gedmatch.com

5. Harappa DNA Ancestry project
https://www.harappadna.org/2012/05/harappaworld-admixture/

6. A genetic chronology for the Indian Subcontinent points to heavily sex-biased dispersals. Marina Silva et al. (2017)
https://bmcevolbiol.biomedcentral.com/articles/10.1186/s12862-017-0936-9





3 comments:

Sinhala DNA results - 2019 update

23andMe has by far the most specific and up to date DNA breakdowns for all South Asian populations. With a reference panel of over 83,0...