Genetic structure correlates with ethnolinguistic diversity in eastern and southern Africa

Citation:

Atkinson EG, Dalvie S, Pichkar Y, Kalungi A, Majara L, Stevenson A, Abebe T, Akena D, Alemayehu M, Ashaba FK, Atwoli L, Baker M, Chibnik LB, Creanza N, Daly MJ, Fekadu A, Gelaye B, Gichuru S, Injera WE, James R, Kariuki SM, Kigen G, Koen N, Koenen KC, Koenig Z, Kwobah E, Kyebuzibwa J, Musinguzi H, Mwema RM, Neale BM, Newman CP, Newton CRJC, Ongeri L, Ramachandran S, Ramesar R, Shiferaw W, Stein DJ, Stroud RE, Teferra S, Zingela Z, Martin AR,, R.K. Genetic structure correlates with ethnolinguistic diversity in eastern and southern Africa [Internet]. bioRxiv 2021;

Abstract:

African populations are the most diverse in the world yet are sorely underrepresented in medical genetics research. Here, we examine the structure of African populations using genetic and comprehensive multigenerational ethnolinguistic data from the Neuropsychiatric Genetics of African Populations-Psychosis study (NeuroGAP-Psychosis) consisting of 900 individuals from Ethiopia, Kenya, South Africa, and Uganda. We find that self-reported language classifications meaningfully tag underlying genetic variation that would be missed with consideration of geography alone, highlighting the importance of culture in shaping genetic diversity. Leveraging our uniquely rich multi-generational ethnolinguistic metadata, we track language transmission through the pedigree, observing the disappearance of several languages in our cohort as well as notable shifts in frequency over three generations. We further find significantly higher language transmission rates for matrilineal groups as compared to patrilineal. We highlight both the diversity of variation within the African continent, as well as how within-Africa variation can be informative for broader variant interpretation; many variants appearing rare elsewhere are common in parts of Africa. The work presented here improves the understanding of the spectrum of genetic variation in African populations and highlights the enormous and complex genetic and ethnolinguistic diversity within Africa.Competing Interest StatementA.R.M. has consulted for 23andMe and Illumina and received speaker fees from Genentech, Pfizer, and Illumina. B.M.N. is a member of the Deep Genomics Scientific Advisory Board. He also serves as a consultant for the Camp4 Therapeutics Corporation, Takeda Pharmaceutical and Biogen. M.J.D. is a founder of Maze Therapeutics. The remaining authors declare no competing interests.

Website