Detecting the anomaly zone in species trees and evidence for a misleading signal in higher-level skink phylogeny (Squamata: Scincidae).

Detecting the anomaly zone in species trees and evidence for a misleading signal in higher-level skink phylogeny (Squamata: Scincidae).

Charles W Linkem, Vladimir N. Minin, Adam D Leache
doi: http://dx.doi.org/10.1101/012096

The anomaly zone presents a major challenge to the accurate resolution of many parts of the Tree of Life. The anomaly zone is defined by the presence of a gene tree topology that is more probable than the true species tree. This discrepancy can result from consecutive rapid speciation events in the species tree. Similar to the problem of long-branch attraction, including more data (loci) will only reinforce the support for the incorrect species tree. Empirical phylogenetic studies often implement coalescent based species tree methods to avoid the anomaly zone, but to this point these studies have not had a method for providing any direct evidence that the species tree is actually in the anomaly zone. In this study, we use 16 species of lizards in the family Scincidae to investigate whether nodes that are difficult to resolve are located within the anomaly zone. We analyze new phylogenomic data (429 loci), using both concatenation and coalescent based species tree estimation, to locate conflicting topological signal. We then use the unifying principle of the anomaly zone, together with estimates of ancestral population sizes and species persistence times, to determine whether the observed phylogenetic conflict is a result of the anomaly zone. We identify at least three regions of the Scindidae phylogeny that provide demographic signatures consistent with the anomaly zone, and this new information helps reconcile the phylogenetic conflict in previously published studies on these lizards. The anomaly zone presents a real problem in phylogenetics, and our new framework for identifying anomalous relationships will help empiricists leverage their resources appropriately for overcoming this challenge.

The infant airway microbiome in health and disease impacts later asthma development

The infant airway microbiome in health and disease impacts later asthma development

Shu Mei Teo, Danny Mok, Kym Pham, Merci Kusel, Michael Serralha, Niamh Troy, Barbara J Holt, Belinda J Hales, Michael L Walker, Elysia Hollams, Yury H Bochkov, Kristine Grindle, Sebastian L Johnston, James E Gern, Peter D Sly, Patrick G Holt, Kathryn E Holt, Michael Inouye
doi: http://dx.doi.org/10.1101/012070

The nasopharynx (NP) is a reservoir for microbes associated with acute respiratory illnesses (ARI). The development of asthma is initiated during infancy, driven by airway inflammation associated with infections. Here, we report viral and bacterial community profiling of NP aspirates across a birth cohort, capturing all lower respiratory illnesses during their first year. Most infants were initially colonized with Staphylococcus or Corynebacterium before stable colonization with Alloiococcus or Moraxella, with transient incursions of Streptococcus, Moraxella or Haemophilus marking virus-associated ARIs. Our data identify the NP microbiome as a determinant for infection spread to the lower airways, severity of accompanying inflammatory symptoms, and risk for future asthma development. Early asymptomatic colonization with Streptococcus was a strong asthma predictor, and antibiotic usage disrupted asymptomatic colonization patterns.

Most viewed on Haldane’s Sieve: November 2014

The most viewed posts on Haldane’s Sieve this month were:

Spider web DNA: a new spin on noninvasive genetics of predator and prey

Spider web DNA: a new spin on noninvasive genetics of predator and prey

Charles Cong Yang Xu, Ivy J Yen, Dean Bowman, Cameron R. Turner
doi: http://dx.doi.org/10.1101/011775

Noninvasive genetic approaches enable biomonitoring without the need to directly observe or disturb target organisms. Environmental DNA (eDNA) methods have recently extended this approach by assaying genetic material within bulk environmental samples without a priori knowledge about the presence of target biological material. This paper describes a novel and promising source of noninvasive spider DNA and insect eDNA from spider webs. Using black widow spiders (Latrodectus spp.) fed with house crickets (Acheta domesticus), we successfully extracted and amplified mitochondrial DNA sequences of both spider and prey from spider web. Detectability of spider DNA did not differ between assays with amplicon sizes from 135 to 497 base pairs. Spider DNA and prey eDNA remained detectable at least 88 days after living organisms were no longer present on the web. Spider web DNA may be an important tool in conservation research, pest management, biogeography studies, and biodiversity assessments.

Genetic landscape of populations along the Silk Road: admixture and migration patterns

Genetic landscape of populations along the Silk Road: admixture and migration patterns

Massimo Mezzavilla, Diego Vozzi, Nicola Pirastu, Giorgia Girotto, Pio D’Adamo, Paolo Gasparini, Vincenza Colonna
doi: http://dx.doi.org/10.1101/011759

Background The ancient Silk Road has been a trading route between Europe and Central Asia from the 2nd century BCE to the 15th century CE. While most populations on this route have been characterized, the genetic background of others remains poorly understood, and little is known about past migration patterns. The scientific expedition “Marco Polo” has recently collected genetic and phenotypic data in six regions (Georgia, Armenia, Azerbaijan, Uzbekistan, Kazakhstan, Tajikistan) along the Silk Road to study the genetics of a number of phenotypes. Results We characterized the genetic structure of these populations within a worldwide context. We observed a West-East subdivision albeit the existence of a genetic component shared within Central Asia and nearby populations from Europe and Near East. We observed a contribution of up to 50% from Europe and Asia to most of the populations that have been analyzed. The contribution from Asia dates back to ~25 generations and is limited to the Eastern Silk Road. Time and direction of this contribution are consistent with the Mongolian expansion era. Conclusions We clarified the genetic structure of six populations from Central Asia and suggested a complex pattern of gene flow among them. We provided a map of migration events in time and space and we quantified exchanges among populations. Altogether these novel findings will support the future studies aimed at understanding the genetics of the phenotypes that have been collected during the Marco Polo campaign, they will provide insights into the history of these populations, and they will be useful to reconstruct the developments and events that have shaped modern Eurasians genomes.

Dynamic epistasis for different alleles of the same gene

Dynamic epistasis for different alleles of the same gene

Lin Xu, Brandon Barker, Zhenglong Gu
(Submitted on 24 Nov 2014)

Epistasis refers to the phenomenon in which phenotypic consequences caused by mutation of one gene depend on one or more mutations at another gene. Epistasis is critical for understanding many genetic and evolutionary processes, including pathway organization, evolution of sexual reproduction, mutational load, ploidy, genomic complexity, speciation, and the origin of life. Nevertheless, current understandings for the genome-wide distribution of epistasis are mostly inferred from interactions among one mutant type per gene, whereas how epistatic interaction partners change dynamically for different mutant alleles of the same gene is largely unknown. Here we address this issue by combining predictions from flux balance analysis and data from a recently published high-throughput experiment. Our results show that different alleles can epistatically interact with very different gene sets. Furthermore, between two random mutant alleles of the same gene, the chance for the allele with more severe mutational consequence to develop a higher percentage of negative epistasis than the other allele is 50-70% in eukaryotic organisms, but only 20-30% in bacteria and archaea. We developed a population genetics model that predicts that the observed distribution for the sign of epistasis can speed up the process of purging deleterious mutations in eukaryotic organisms. Our results indicate that epistasis among genes can be dynamically rewired at the genome level, and call on future efforts to revisit theories that can integrate epistatic dynamics among genes in biological systems.

Tissue-specific evolution of protein coding genes in human and mouse

Tissue-specific evolution of protein coding genes in human and mouse

Nadezda Kryuchkova, Marc Robinson-Rechavi
doi: http://dx.doi.org/10.1101/011692

Protein-coding genes evolve at different rates, and the influence of different parameters, from gene size to expression level, has been extensively studied. While in yeast gene expression level is the major causal factor of gene evolutionary rate, the situation is more complex in animals. Here we investigate these relations further, especially taking in account gene expression in different organs as well as indirect correlations between parameters. We used RNA-seq data from two large datasets, covering 22 mouse tissues and 27 human tissues. Over all tissues, evolutionary rate only correlates weakly with levels and breadth of expression. The strongest explanatory factors of strong purifying selection are GC content, expression in many developmental stages, and expression in brain tissues. While the main component of evolutionary rate is purifying selection, we also find tissue-specific patterns for sites under neutral evolution and for positive selection. We observe fast evolution of genes expressed in testis, but also in other tissues, notably liver, which are explained by weak purifying selection rather than by positive selection.

Estimating the temporal and spatial extent of gene flow among sympatric lizard populations (genus Sceloporus) in the southern Mexican highlands

Estimating the temporal and spatial extent of gene flow among sympatric lizard populations (genus Sceloporus) in the southern Mexican highlands

Jared A Grummer, Martha L. Calderón, Adrián Nieto Montes-de Oca, Eric N Smith, Fausto Méndez-de la Cruz, Adam Leaché
doi: http://dx.doi.org/10.1101/008623

Interspecific gene flow is pervasive throughout the tree of life. Although detecting gene flow between populations has been facilitated by new analytical approaches, determining the timing and geography of hybridization has remained difficult, particularly for historical gene flow. A geographically explicit phylogenetic approach is needed to determine the ancestral population overlap. In this study, we performed population genetic analyses, species delimitation, simulations, and a recently developed approach of species tree diffusion to infer the phylogeographic history, timing and geographic extent of gene flow in lizards of the Sceloporus spinosus group. The two species in this group, S. spinosus and S. horridus, are distributed in eastern and western portions of Mexico, respectively, but populations of these species are sympatric in the southern Mexican highlands. We generated data consisting of three mitochondrial genes and eight nuclear loci for 148 and 68 individuals, respectively. We delimited six lineages in this group, but found strong evidence of mito-nuclear discordance in sympatric populations of S. spinosus and S. horridus owing to mitochondrial introgression. We used coalescent simulations to differentiate ancestral gene flow from secondary contact, but found mixed support for these two models. Bayesian phylogeography indicated more than 60% range overlap between ancestral S. spinosus and S. horridus populations since the time of their divergence. Isolation-migration analyses, however, revealed near-zero levels of gene flow between these ancestral populations. Interpreting results from both simulations and empirical data indicate that despite a long history of sympatry among these two species, gene flow in this group has only recently occurred.

Strong selection in the human-chimpanzee ancestor links the X chromosome to speciation

Strong selection in the human-chimpanzee ancestor links the X chromosome to speciation

Julien Y Dutheil, Kasper Munch, Thomas Mailund, Kiwoong Nam, Mikkel Schierup
doi: http://dx.doi.org/10.1101/011601

The human and chimpanzee X chromosomes are less divergent than expected based on autosomal divergence. This has led to a controversial hypothesis proposing a unique role of the X chromosome in complex human-chimpanzee speciation. Here, we study incomplete lineage sorting patterns between humans, chimpanzees and gorillas to show that this low divergence is entirely due to megabase-sized regions comprising one-third of the X chromosome, where polymorphism in the human-chimpanzee ancestral species was severely reduced. Background selection can explain 10% of this reduction at most. Instead, we show that several strong selective sweeps in the ancestral species can explain these patterns. We also report evidence of population specific sweeps of a similar magnitude in extant humans that overlap the regions of low diversity in the ancestral species. These regions further correspond to chromosomal sections shown to be devoid of Neanderthal introgression into modern humans. This suggests that these X-linked regions are directly involved in forming reproductive barriers.

The seed-bank coalescent

The seed-bank coalescent

Jochen Blath, Adrián González Casanova, Noemi Kurt, Maite Wilke-Berenguer
(Submitted on 18 Nov 2014)

We identify a new natural coalescent structure, the seed-bank coalescent, which describes the gene genealogy of populations under the influence of a strong seed-bank effect, where `dormant forms’ of individuals (such as seeds or spores) may jump a significant number of generations before joining the `active’ population. Mathematically, our seed-bank coalescent appears as scaling limit in a Wright-Fisher model with geometric seed-bank age structure if the average time of seed dormancy scales with the order of the total population size N. This extends earlier results of Kaj, Krone and Lascaux (2001) who show that the genealogy of a Wright-Fisher model in the presence of a `weak’ seed-bank effect is given by a suitably time-changed Kingman coalescent. The qualitatively new feature of the seed-bank coalescent is that ancestral lineages are independently blocked at a certain rate from taking part in coalescence events, thus strongly altering the predictions of classical coalescent models. In particular, the seed-bank coalescent `does not come down from infinity’, and the time to the most recent common ancestor of a sample of size n grows like loglogn, which is the order also observed for the Bolthausen-Sznitman coalescent. This is in line with the empirical observation that seed-banks drastically increase genetic variability in a population and indicates how they may serve as a buffer against other evolutionary forces such as genetic drift and selection.