Evolutionary quantitative genomics of Populus trichocarpa

Posted on September 4, 2015 by schraib

Ilga Porth, Jaroslav Klapste, Athena D McKown, Jonathan La Mantia, Robert D Guy, Paer K Ingvarsson, Richard Hamelin, Shawn D Mansfield, Juergen Ehlting, Carl J Douglas, Yousry A El-Kassaby

bioRxiv doi: http://dx.doi.org/10.1101/026021

Forest trees generally show high levels of local adaptation and efforts focusing on understanding adaptation to climate will be crucial for species survival and management. Merging quantitative genetics and population genomics, we studied the molecular basis of climate adaptation in 433 Populus trichocarpa (black cottonwood) genotypes originating across western North America. Variation in 74 field-assessed traits (growth, ecophysiology, phenology, leaf stomata, wood, and disease resistance) was investigated for signatures of selection (comparing QST -FST) using clustering of individuals by climate of origin. 29,354 SNPs were investigated employing three different outlier detection methods. Narrow-sense QST for 53% of distinct field traits was significantly divergent from expectations of neutrality (indicating adaptive trait variation); 2,855 SNPs showed signals of diversifying selection, and of these, 118 SNPs (within 81 genes) were associated with adaptive traits (based on significant QST). Many SNPs were putatively pleiotropic for functionally uncorrelated adaptive traits, such as autumn phenology, height, and disease resistance. Evolutionary quantitative genomics in P. trichocarpa provides an enhanced understanding regarding the molecular basis of climate-driven selection in forest trees. We highlight that important loci underlying adaptive trait variation also show relationship to climate of origin.

Genetic evidence challenges the native status of a threatened freshwater fish (Carassius carassius) in England

Posted on September 4, 2015 by schraib

Genetic evidence challenges the native status of a threatened freshwater fish (Carassius carassius) in England

Daniel L Jeffries, Gordon H Copp, Lori-Jayne Lawson Handley, Carl D Sayer, Bernd Hänfling

bioRxiv doi: http://dx.doi.org/10.1101/026088

A fundamental consideration for the conservation of a species is the extent of its native range, however defining a native range is often challenging as changing environments drive shifts in species distributions over time. The crucian carp, Carassius carassius (L.) is a threatened freshwater fish native to much of Europe, however the extent of this range is ambiguous. One particularly contentious region is England, in which C. carassius is currently considered native on the basis of anecdotal evidence. Here, we use 13 microsatellite loci, population structure analyses and approximate bayesian computation (ABC), to empirically test the native status of C. carassius in England. Contrary to the current consensus, ABC yields strong support for introduced origins of C. carassius in England, with posterior distribution estimates placing their introduction in the 15th century, well after the loss of the doggerland landbridge. This result brings to light an interesting and timely debate surrounding our motivations for the conservation of species. We discuss this topic, and make arguments for the continued conservation of C. carassius in England, despite its non-native origins.

Comparing RADseq and microsatellites to infer complex phylogeographic patterns, a real data informed perspective in the Crucian carp, Carassius carassius, L.

Posted on September 3, 2015 by schraib

Comparing RADseq and microsatellites to infer complex phylogeographic patterns, a real data informed perspective in the Crucian carp, Carassius carassius, L.

Daniel L Jeffries, Gordon H Copp, Lori-Jayne Lawson Handley, Håkan Olsén, Carl D Sayer, Bernd Hänfling

bioRxiv doi: http://dx.doi.org/10.1101/025973

The conservation of threatened species must be underpinned by phylogeographic knowledge in order to be effective. This need is epitomised by the freshwater fish Carassius carassius, which has recently undergone drastic declines across much of its European range. Restriction Site Associated DNA sequencing (RADseq) is being increasingly used for such phylogeographic questions, however RADseq is expensive, and limitations on sample number must be weighed against the benefit of large numbers of markers. Such tradeoffs have predominantly been addressed using simulated data. Here we compare the results generated from microsatellites and RADseq to the phylogeography of C. carassius, to add real-data-informed perspectives to this important debate. These datasets, along with data from the mitochondrial cytochrome b gene, agree on broad phylogeographic patterns; showing the existence of two previously unidentified C. carassius lineages in Europe. These lineages have been isolated for approximately 2.2-2.3 M years, and should arguably be considered as separate conservation units. RADseq recovered finer population structure and stronger patterns of IBD than microsatellites, despite including only 17.6% of samples (38% of populations and 52% of samples per population). RADseq was also used along with Approximate Bayesian Computation to show that the postglacial colonisation routes of C. carassius differ from the general patterns of freshwater fish in Europe, likely as a result of their distinctive ecology.

Application of a dense genetic map for assessment of genomic responses to selection and inbreeding in Heliothis virescens.

Posted on September 3, 2015 by schraib

Application of a dense genetic map for assessment of genomic responses to selection and inbreeding in Heliothis virescens.

Megan Fritz, Sandra Paa, Jennifer Baltzegar, Fred Gould

bioRxiv doi: http://dx.doi.org/10.1101/025981

Adaptation of pest species to laboratory conditions and selection for resistance to toxins in the laboratory are expected to cause inbreeding and genetic bottlenecks that reduce genetic variation. Heliothis virescens, a major cotton pest, has been colonized in the laboratory many times, and a few laboratory colonies have been selected for Bt resistance. We developed 350 bp Double-Digest Restriction-site Associated DNA-sequencing (ddRAD-seq) molecular markers to examine and compare changes in genetic variation associated with laboratory adaptation, artificial selection, and inbreeding in this non-model insect species. We found that allelic and nucleotide diversity declined dramatically in laboratory-reared H. virescens as compared with field-collected populations. The declines were primarily due to the loss of low frequency alleles present in field-collected H. virescens. A further, albeit modest decline in genetic diversity was observed in a Bt-selected population. The greatest decline was seen in H. virescens that were sib-mated for 10 generations, where more than 80% of loci were fixed for a single allele. To determine which regions of the genome were resistant to fixation in our sib-mated and Bt-selected lines, we generated a dense intraspecific linkage map containing 3 PCR-based, and 659 ddRAD-seq markers. Markers that retained polymorphism were observed in small clusters spread over multiple linkage groups. These markers are likely associated with genomic regions under balancing selection, thus preventing fixation of deleterious alleles.

Extreme Mitogenomic Variation Without Cryptic Speciation in Chaetognaths

Posted on September 3, 2015 by schraib

Extreme Mitogenomic Variation Without Cryptic Speciation in Chaetognaths

Ferdinand Marletaz, Yannick Le Parco, Shenglin Liu, Katja Peijnenburg

bioRxiv doi: http://dx.doi.org/10.1101/025957

The extent of within-species genetic variation across the diversity of animal life is a fundamental but largely unexplored problem in ecology and evolution. The neutral theory of molecular evolution predicts that genetic variation scales positively with population size. However, the genetic diversity of mitochondrial DNA, a prominent marker used in DNA barcoding studies, shows very little variation across animal species. Here, we report an unprecedented case of extreme mitochondrial variation within natural populations of two species of chaetognaths (arrow worms). We determined that this diversity is composed of deep intraspecific mitochondrial lineages within single populations that could be as divergent as human and newt. This mitochondrial diversity is the highest ever reported in animals without evidence of cryptic speciation or allopatric divergence as supported by nuclear evidence. We sequenced 54 complete mitogenomes revealing gene order rearrangements between these intraspecific lineages. Such structural differences have never previously been reported within single species. We confirm that this divergence was not driven by positive selection, and conversely show that these lineages evolved under purifying selection, consistently with neutral expectations. Our findings question the generally accepted narrow range of genetic variation in animal mitochondria and argue for a reappraisal of DNA barcoding techniques. Furthermore, extreme levels of mitogenomic variation in chaetognaths challenge classical views regarding mitochondrial evolution and cyto-nuclear co-evolution.

Using Genetic Distance to Infer the Accuracy of Genomic Prediction

Posted on September 3, 2015 by schraib

Using Genetic Distance to Infer the Accuracy of Genomic Prediction
Marco Scutari, Ian Mackay, David Balding

The prediction of phenotypic traits using high-density genomic data has many applications such as the selection of plants and animals of commercial interest; and it is expected to play an increasing role in medical diagnostics. Statistical models used for this task are usually tested using cross-validation, which implicitly assumes that new individuals (whose phenotypes we would like to predict) originate from the same population the genomic prediction model is trained on.
In this paper we investigate the effect of increasing genetic distance between training and target populations when predicting quantitative traits. This is important for plant and animal genetics, where genomic selection programs rely on the precision of predictions in future rounds of breeding. Therefore, estimating how quickly predictive accuracy decays is important in deciding which training population to use and how often the model has to be recalibrated. We find that the correlation between true and predicted values decays approximately linearly with respect to either $\F$ or mean kinship between the training and the target populations. We illustrate this relationship using simulations and a collection of data sets from mice, wheat and human genetics.

Fundamental Properties of the Evolution of Mutational Robustness

Posted on September 3, 2015 by schraib

Fundamental Properties of the Evolution of Mutational Robustness
Lee Altenberg

Evolution on neutral networks of genotypes has been found in models to concentrate on genotypes with high mutational robustness, to a degree determined by the topology of the network. Here analysis is generalized beyond neutral networks to arbitrary selection and parent-offspring transmission. In this larger realm, geometric features determine mutational robustness: the alignment of fitness with the orthogonalized eigenvectors of the mutation matrix weighted by their eigenvalues. “House of cards” mutation is found to preclude the evolution of mutational robustness. Genetic load is shown to increase with increasing mutation in arbitrary single and multiple locus fitness landscapes. The rate of decrease in population fitness can never grow as mutation rates get higher, showing that “error catastrophes” for genotype frequencies never cause precipitous losses of population fitness. The “inclusive inheritance” approach taken here naturally extends these results to a new concept of dispersal robustness.

An in-host model of HIV incorporating latent infection and viral mutation

Posted on September 3, 2015 by schraib

An in-host model of HIV incorporating latent infection and viral mutation
Stephen Pankavich, Deborah Shutt

We construct a seven-component model of the in-host dynamics of the Human Immunodeficiency Virus Type-1 (i.e, HIV) that accounts for latent infection and the propensity of viral mutation. A dynamical analysis is conducted and a theorem is presented which characterizes the long time behavior of the model. Finally, we study the effects of an antiretroviral drug and treatment implications.

Estimating Reproducibility in Genome-Wide Association Studies

Posted on September 3, 2015 by schraib

Estimating Reproducibility in Genome-Wide Association Studies
Wei Jiang, Jing-Hao Xue, Weichuan Yu

Genome-wide association studies (GWAS) are widely used to discover genetic variants associated with diseases. To control false positives, all findings from GWAS need to be verified with additional evidences, even for associations discovered from a high power study. Replication study is a common verification method by using independent samples. An association is regarded as true positive with a high confidence when it can be identified in both primary study and replication study. Currently, there is no systematic study on the behavior of positives in the replication study when the positive results of primary study are considered as the prior information.
In this paper, two probabilistic measures named Reproducibility Rate (RR) and False Irreproducibility Rate (FIR) are proposed to quantitatively describe the behavior of primary positive associations (i.e. positive associations identified in the primary study) in the replication study. RR is a conditional probability measuring how likely a primary positive association will also be positive in the replication study. This can be used to guide the design of replication study, and to check the consistency between the results of primary study and those of replication study. FIR, on the contrary, measures how likely a primary positive association may still be a true positive even when it is negative in the replication study. This can be used to generate a list of potentially true associations in the irreproducible findings for further scrutiny. The estimation methods of these two measures are given. Simulation results and real experiments show that our estimation methods have high accuracy and good prediction performance.

There are no caterpillars in a wicked forest

Posted on September 3, 2015 by schraib

There are no caterpillars in a wicked forest
James H. Degnan, John A. Rhodes

Species trees represent the historical divergences of populations or species, while gene trees trace the ancestry of individual gene copies sampled within those populations. In cases involving rapid speciation, gene trees with topologies that differ from that of the species tree can be most probable under the standard multispecies coalescent model, making species tree inference more difficult. Such anomalous gene trees are not well understood except for some small cases. In this work, we establish one constraint that applies to trees of any size: gene trees with “caterpillar” topologies cannot be anomalous. The proof of this involves a new combinatorial object, called a population history, which keeps track of the number of coalescent events in each ancestral population.

Haldane's Sieve

Discussing preprints in population and evolutionary genetics

Author Archives: schraib

Evolutionary quantitative genomics of Populus trichocarpa

Genetic evidence challenges the native status of a threatened freshwater fish (Carassius carassius) in England

Comparing RADseq and microsatellites to infer complex phylogeographic patterns, a real data informed perspective in the Crucian carp, Carassius carassius, L.

Application of a dense genetic map for assessment of genomic responses to selection and inbreeding in Heliothis virescens.

Extreme Mitogenomic Variation Without Cryptic Speciation in Chaetognaths

Using Genetic Distance to Infer the Accuracy of Genomic Prediction

Fundamental Properties of the Evolution of Mutational Robustness

An in-host model of HIV incorporating latent infection and viral mutation

Estimating Reproducibility in Genome-Wide Association Studies

There are no caterpillars in a wicked forest

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: