Please use this identifier to cite or link to this item: http://ir.mu.ac.ke:8080/jspui/handle/123456789/3562
Full metadata record
DC FieldValueLanguage
dc.contributor.authorMartin, Alicia R.-
dc.contributor.authorAtkinson, Elizabeth G.-
dc.contributor.authorAshaba, Fred K.-
dc.contributor.authorAtwoli, Lukoye-
dc.contributor.authorGichuru​, Stella-
dc.contributor.authorInjera, Wilfred E.-
dc.contributor.authorKariuki​, Symon M.-
dc.contributor.authorRoxanne, James​-
dc.contributor.authorKigen, Gabriel-
dc.contributor.authorDodge, Sheila-
dc.contributor.authorStevenson, Anne-
dc.date.accessioned2020-10-14T08:39:23Z-
dc.date.available2020-10-14T08:39:23Z-
dc.date.issued2020-
dc.identifier.urihttps://doi.org/10.1101/2020.04.27.064832-
dc.identifier.urihttp://ir.mu.ac.ke:8080/jspui/handle/123456789/3562-
dc.description.abstractBackground Genetic studies of biomedical phenotypes in underrepresented populations identify disproportionate numbers of novel associations. However, current genomics infrastructure--including most genotyping arrays and sequenced reference panels--best serves populations of European descent. A critical step for facilitating genetic studies in underrepresented populations is to ensure that genetic technologies accurately capture variation in all populations. Here, we quantify the accuracy of low-coverage sequencing in diverse African populations. Results We sequenced the whole genomes of 91 individuals to high-coverage (≥20X) from the Neuropsychiatric Genetics of African Population-Psychosis (NeuroGAP-Psychosis) study, in which participants were recruited from Ethiopia, Kenya, South Africa, and Uganda. We empirically tested two data generation strategies, GWAS arrays versus low-coverage sequencing, by calculating the concordance of imputed variants from these technologies with those from deep whole genome sequencing data. We show that low-coverage sequencing at a depth of ≥4X captures variants of all frequencies more accurately than all commonly used GWAS arrays investigated and at a comparable cost. Lower depths of sequencing (0.5-1X) performed comparable to commonly used low-density GWAS arrays. Low-coverage sequencing is also sensitive to novel variation, with 4X sequencing detecting 45% of singletons and 95% of common variants identified in high-coverage African whole genomes. Conclusion These results indicate that low-coverage sequencing approaches surmount the problems induced by the ascertainment of common genotyping arrays, including those that capture variation most common in Europeans and Africans. Low-coverage sequencing effectively identifies novel variation (particularly in underrepresented populations), and presents opportunities to enhance variant discovery at a similar cost to traditional approaches.en_US
dc.language.isoenen_US
dc.publisherbioRxiven_US
dc.subjectLow-coverage sequencingen_US
dc.subjectWhole genome sequencingen_US
dc.subjectStudy designen_US
dc.titleLow-coverage sequencing cost-effectively detects knownand novel variation in underrepresented populationsen_US
dc.typeArticleen_US
Appears in Collections:School of Medicine

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.