Understanding Admixture Fractions
Mason Liang, Rasmus Nielsen
doi: http://dx.doi.org/10.1101/008078
Estimation of admixture fractions has become one of the most commonly used computational tools in population genomics. However, there is remarkably little population genetic theory on their statistical properties. We develop theoretical results that can accurately predict means and variances of admixture proportions within a population using models with recombination and genetic drift. Based on established theory on measures of multilocus disequilibrium, we show that there is a set of recurrence relations that can be used to derive expectations for higher moments of the admixture fraction distribution. We obtain closed form solutions for some special cases. Using these results, we develop a method for estimating admixture parameters from estimated admixture proportion obtained from programs such as Structure or Admixture. We apply this method to HapMap data and find that the population history of African Americans, as expected, is not best explained by a single admixture event between people of European and African ancestry. A model of constant gene flow for the past 11 generations until 2 generations ago gives a better fit.
Hi Mason, we met briefly last year at SMBE — really interesting work. Did you compare your results to those of Simon Gravel (Genetics, 2012)? Specifically, I am thinking about your figure 3 and his figure 6 (more in the appendix). He derives an expression for the variance of admixture fractions when there is drift and recombination. I’m curious how your distributions are similar/different.
Thanks,
Amy
Pingback: Most viewed on Haldane’s Sieve: August 2014 | Haldane's Sieve