South Asian Genotype Project, Spring 2019 update
It’s been a while since I updated the South Asian Genotype Project. Well, I updated almost everyone. A few people had strangely formatted text files, so I’ll go add them tomorrow. Thanks to everyone who has submitted so far! One of the main things that I’ve been curious about is undersampled groups. I finally got […]
Swidden rice farming does not lead to high population density
I’ve been looking at the data from the recent Munda paper. Standard stuff, admixture, treemix, and f-statistics.The northern Munda samples were collected in Bangladesh. So I thought: I can test the hypothesis that the East Asian ancestry in Bangladesh is to a large part Santhal. After looking at it every which way, I think that […]
Indic civilization came to Southeast Asia because Indian people came to Southeast Asia. Lots of them
Reading Indonesia: Peoples and Histories. I selected it because unlike many books it wasn’t incredibly skewed to the early modern and postcolonial period. The author makes the interesting point that the Islamicization of western Indonesia and the rise of the great Javanese Hindu kingdom of Majapahit occurred around the same time. This, in contrast to […]
The main interesting thing about Bangladeshi genetics is how East Asian Bangladeshis are
I got a question about endogamy and Bangladeshis on of my other weblogs, as well as their relatedness to western (e.g., Iranian) and eastern (e.g., Southeast Asian) populations. Instead of talking, what do the data say? Most of you have probably seen me write about this before, but I think it might be useful to post […]
Punjabi genetic variation in 1000 Genomes: Hindu caste in the Land of the Pure?
In the 1000 Genomes, there is a Punjabi dataset. Here is the description: These cell lines and DNA samples were prepared from blood samples collected in Lahore, Pakistan. The samples are from a mix of parent- adult child trios and unrelated individuals who identified themselves and their parents as Punjabi. A few years ago I […]
How South Asian populations relate to each other
Since people asking me about this, and I’m running the South Asian Genotype Project, I thought I would post two non-PCA visualizations of how various South Asian groups relate to each other (along with a few outgroups). The radial plot above is a neighbor-joining tree visualized from pairwise Fst statistics (basically a proxy for genetic […]
South Asian Genotype Project, update
I’ve been working on the South Asian Genotype Project. Again, if you are interested: send me a 23andMe, Ancestry, or Family Tree DNA raw genotype file to contactgnxp -at- gmail.com. In the subject please put: “South Asian Genotype Project” The state/province your family is from Ethnolinguistic group If applicable, caste I changed the reference populations because the […]
South Asian Genotype Project update
Just a quick update. I know I haven’t been responsive, but I’ve been traveling and spending time with the family and working a lot for the past few weeks. I’m going to make some revisions to my pipeline as well. I will get back to generating results soon (as in a week or so). So […]
Present genetic variation is a weak guide to past genetic variation
As I’ve been harping on and on for the past few years that the patterns of contemporary genetic variation are probably only weakly tied to past patterns of genetic variation (though Henry Harpending warned me about this as far back as 2004). A …
Harappa Ancestry Project @ N ~ 50
Zack Ajmal now has over 50 participants in the Harappa Ancestry Project. This does not include the Pakistani populations in the HGDP, the HapMap Gujaratis, the Indians from the SVGP. Nevertheless, all these samples still barely cover vast heart of Sout…
Harappa Ancestry Project, update
Last week I announced the Harappa Ancestry Project. It now has its own dedicate website, http://www.harappadna.org. Additionally, it has its own Facebook page. For Zack to get his own URL he needs about 10 more “likes,” so please like it! (…
PCA, Razib around the world (a little)
I have put up a few posts warning readers to be careful of confusing PCA plots with real genetic variation. PCA plots are just ways to capture variation in large data sets and extract out the independent dimensions. Its great at detecting population substructure because the largest components of variation often track between population differences, […]