SOTAVerified

A deep learning classifier for local ancestry inference

2020-11-04Code Available0· sign in to hype

Matthew Aguirre, Jan Sokol, Guhan Venkataraman, Alexander Ioannidis

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Local ancestry inference (LAI) identifies the ancestry of each segment of an individual's genome and is an important step in medical and population genetic studies of diverse cohorts. Several techniques have been used for LAI, including Hidden Markov Models and Random Forests. Here, we formulate the LAI task as an image segmentation problem and develop a new LAI tool using a deep convolutional neural network with an encoder-decoder architecture. We train our model using complete genome sequences from 982 unadmixed individuals from each of five continental ancestry groups, and we evaluate it using simulated admixed data derived from an additional 279 individuals selected from the same populations. We show that our model is able to learn admixture as a zero-shot task, yielding ancestry assignments that are nearly as accurate as those from the existing gold standard tool, RFMix.

Tasks

Reproductions