Please use this identifier to cite or link to this item: http://ir.mu.ac.ke:8080/jspui/handle/123456789/3562
Title: Low-coverage sequencing cost-effectively detects knownand novel variation in underrepresented populations
Authors: Martin, Alicia R.
Atkinson, Elizabeth G.
Ashaba, Fred K.
Atwoli, Lukoye
Gichuru​, Stella
Injera, Wilfred E.
Kariuki​, Symon M.
Roxanne, James​
Kigen, Gabriel
Dodge, Sheila
Stevenson, Anne
Keywords: Low-coverage sequencing
Whole genome sequencing
Study design
Issue Date: 2020
Publisher: bioRxiv
Abstract: Background Genetic studies of biomedical phenotypes in underrepresented populations identify disproportionate numbers of novel associations. However, current genomics infrastructure--including most genotyping arrays and sequenced reference panels--best serves populations of European descent. A critical step for facilitating genetic studies in underrepresented populations is to ensure that genetic technologies accurately capture variation in all populations. Here, we quantify the accuracy of low-coverage sequencing in diverse African populations. Results We sequenced the whole genomes of 91 individuals to high-coverage (≥20X) from the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study, in which participants were recruited from Ethiopia, Kenya, South Africa, and Uganda. We empirically tested two data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole genome sequencing data. We show that low-coverage sequencing at a depth of ≥4X captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1X) performed comparable to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation, with 4X sequencing detecting 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Conclusion These results indicate that low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, including those that capture variation most common in Europeans and Africans. Low-coverage sequencing effectively identifies novel variation (particularly in underrepresented populations), and presents opportunities to enhance variant discovery at a similar cost to traditional approaches.
URI: https://doi.org/10.1101/2020.04.27.064832
http://ir.mu.ac.ke:8080/jspui/handle/123456789/3562
Appears in Collections:School of Medicine

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.