Do Phylogenetic Tree Viewers correctly display Support Values?

Posted on December 28, 2015 by schraib

Lucas Czech, Alexandros Stamatakis

bioRxiv doi: http://dx.doi.org/10.1101/035360

Phylogenetic trees are routinely visualized to present and interpret the evolutionary relationships of the species that are being studied. Virtually all empirical evolutionary data studies contain a visualization of the inferred tree with support values using one of the popular and highly cited (e.g., TreeView, Dendroscope, FigTree, Archaeopteryx, etc.) tree viewing tools. As a consequence, programming errors or ambiguous semantics in tree file formats can lead to erroneous tree visualizations and consequently incorrect interpretations of phylogenetic analyses. Here, we discuss the problems that can and do arise when displaying branch support values on trees. Presumably for historical reasons, branch support values (e.g., bootstrap support or Bayesian posterior probabilities) are typically stored as node labels in the widely-used Newick tree format. However, support values are attributes of branches (bipartitions) in unrooted phylogenetic trees. Therefore, storing support values as node labels can potentially lead to incorrect support-value-to-bipartition mappings when re-rooting trees in tree viewers. This depends on the mostly implicit semantics of tree viewers for interpreting node labels. To assess the potential impact of these ambiguous and predominantly implicit semantics of support values, we analyzed 10 distinct tree viewers. We find that, most of them exhibit some sort of incorrect or unexpected behavior when re-rooting trees with support values. We find that Dendroscope interprets Newick node labels as simply that, node labels in Newick trees. However, if they are meant to represent branch support values, the support value to branch mapping is incorrect when re-rooting trees with Dendroscope. We illustrate such an incorrect mapping by example of an empirical phylogenetic study. As a solution, we suggest that (i) branch support values should exclusively be stored as meta-data associated to branches (and not nodes), and (ii) if this is not feasible, tree viewers should include a user dialogue that explicitly forces users to define if node labels shall be interpreted as node or branch labels, prior to tree visualization.

Host-pathogen co-evolution and the emergence of broadly neutralizing antibodies in chronic infections

Posted on December 28, 2015 by schraib

Host-pathogen co-evolution and the emergence of broadly neutralizing antibodies in chronic infections
Armita Nourmohammad, Jakub Otwinowski, Joshua B. Plotkin

The vertebrate adaptive immune system provides a flexible and diverse set of molecules to neutralize pathogens. Yet, viruses that cause chronic infections, such as HIV, can survive by evolving as quickly as the adaptive immune system, forming an evolutionary arms race within a host. Here we introduce a mathematical framework to study the co-evolutionary dynamics of antibodies with antigens within a patient. We focus on changes in the binding interactions between the antibody and antigen populations, which result from the underlying stochastic evolution of genotype frequencies driven by mutation, selection, and drift. We identify the critical viral and immune parameters that determine the distribution of antibody-antigen binding affinities. We also identify definitive signatures of co-evolution that measure the reciprocal response between the antibody and viruses, and we introduce experimentally measurable quantities that quantify the extent of adaptation during continual co-evolution of the two opposing populations. Finally, we analyze competition between clonal lineages of antibodies and characterize the fate of a given lineage dependent on the state of the antibody and viral populations. In particular, we derive the conditions that favor the emergence of broadly neutralizing antibodies, which may be used in designing a vaccine against HIV.

Finite-size effects and switching times for Moran dynamics with mutation

Posted on December 28, 2015 by schraib

Finite-size effects and switching times for Moran dynamics with mutation
Lee DeVille, Meghan Galiardi

We consider the Moran process with two populations competing under an iterated Prisoners’ Dilemma in the presence of mutation, and concentrate on the case where there are multiple Evolutionarily Stable Strategies. We perform a complete bifurcation analysis of the deterministic system which arises in the infinite population size. We also study the Master equation and obtain asymptotics for the invariant distribution and metastable switching times for the stochastic process in the case of large but finite population. We also show that the stochastic system has asymmetries in the form of a skew for parameter values where the deterministic limit is symmetric.

Reduction rules for the maximum parsimony distance on phylogenetic trees

Posted on December 28, 2015 by schraib

Reduction rules for the maximum parsimony distance on phylogenetic trees
Steven Kelk, Mareike Fischer, Vincent Moulton, Taoyang Wu

In phylogenetics, distances are often used to measure the incongruence between a pair of phylogenetic trees that are reconstructed by different methods or using different regions of genome. Motivated by the maximum parsimony principle in tree inference, we recently introduced the maximum parsimony (MP) distance, which enjoys various attractive properties due to its connection with several other well-known tree distances, such as TBR and SPR. Here we show that computing the MP distance between two trees, a NP-hard problem in general, is fixed parameter tractable in terms of the TBR distance between the tree pair. Our approach is based on two reduction rules–the chain reduction and the subtree reduction–that are widely used in computing TBR and SPR distances. More precisely, we show that reducing chains to length 4 (but not shorter) preserves the MP distance. In addition, we describe a generalization of the subtree reduction which allows the pendant subtrees to be rooted in different places, and show that this still preserves the MP distance. We conclude with an extended discussion in which we focus on similarities and differences between MP distance and TBR distance, and present a number of open problems.

An Invariants-based Method for Efficient Identification of Hybrid Species From Large-scale Genomic Data

Posted on December 18, 2015 by schraib

An Invariants-based Method for Efficient Identification of Hybrid Species From Large-scale Genomic Data

Laura Kubatko, Julia Chifman

bioRxiv doi: http://dx.doi.org/10.1101/034348

Coalescent-based species tree inference has become widely used in the analysis of genome-scale multilocus and SNP datasets when the goal is inference of a species-level phylogeny. However, numerous evolutionary processes are known to violate the assumptions of a coalescence-only model and complicate inference of the species tree. One such process is hybrid speciation, in which a species shares its ancestry with two distinct species. Although many methods have been proposed to detect hybrid speciation, only a few have considered both hybridization and coalescence in a unified framework, and these are generally limited to the setting in which putative hybrid species must be identified in advance. Here we propose a method that can examine genome-scale data for a large number of taxa and detect those taxa that may have arisen via hybridization, as well as their potential “parental” taxa. The method is based on a model that considers both coalescence and hybridization together, and uses phylogenetic invariants to construct a test that scales well in terms of computational time for both the number of taxa and the amount of sequence data. We test the method using simulated data for up 20 taxa and 100,000bp, and find that the method accurately identifies both recent and ancient hybrid species in less than 30 seconds. We apply the method to two empirical datasets, one composed of Sistrurus rattlesnakes for which hybrid speciation is not supported by previous work, and one consisting of several species of Heliconius butterflies for which some evidence of hybrid speciation has been previously found.

Independent evolution of ab- and adaxial stomatal density enables adaptation

Posted on December 18, 2015 by schraib

Independent evolution of ab- and adaxial stomatal density enables adaptation

Christopher David Muir, Miquel Àngel Conesa, Jeroni Galmés

bioRxiv doi: http://dx.doi.org/10.1101/034355

Are organisms free to reach their adaptive optima or constrained by hard-wired developmental programs? Recent evidence suggests that the arrangement of stomata on abaxial (lower) and adaxial (upper) leaf surfaces may be an important adaptation in plants, but stomatal traits on each surface likely share developmental pathways that could hamper evolution. We reviewed the quantitative genetics of stomatal density to look for loci that (1) affected ab- or adaxial density independently or (2) pleiotropically affected stomatal density on both surfaces. We also used phylogenetic comparative methods to test for independent versus correlated evolution of stomatal traits (density, size, and pore index) on each surface from 14 amphistomatous wild tomato taxa (Solanum; Solanaceae). Naturally occurring and laboratory-induced genetic variation alters stomatal density on one surface without affecting the other, indicating that development does not strongly constrain the spectrum of available mutations. Among wild tomato taxa, traits most closely related to function (stomatal pore index and density) evolved independently on each surface, whereas stomatal size was constrained by correlated evolution. Genetics and phylogenetics demonstrate mostly independent evolution of stomatal function on each leaf surface, facilitating largely unfettered access to fitness optima.

Evidence of adoption, monozygotic twinning, and low inbreeding rates in a large genetic pedigree of polar bears

Posted on December 18, 2015 by schraib

Evidence of adoption, monozygotic twinning, and low inbreeding rates in a large genetic pedigree of polar bears

René M. Malenfant, David W. Coltman, Evan S. Richardson, Nicholas J. Lunn, Ian Stirling, Elizabeth Adamowicz, Corey S. Davis

bioRxiv doi: http://dx.doi.org/10.1101/034009

Multigenerational pedigrees have been developed for free-ranging populations of many species, are frequently used to describe mating systems, and are used in studies of quantitative genetics. Here, we document the development of a 4449-individual pedigree for the Western Hudson Bay subpopulation of polar bears (Ursus maritimus), created from relationships inferred from field and genetic data collected over six generations of bears sampled between 1966 and 2011. Microsatellite genotypes for 22-25 loci were obtained for 2945 individuals, and parentage analysis was performed using the program FRANZ, including additional offspring-dam associations known only from capture data. Parentage assignments for a subset of 859 individuals were confirmed using an independent medium-density set of single nucleotide polymorphisms. To account for unsampled males in our population, we performed half-sib/full-sib analysis to reconstruct males using the program COLONY, resulting in a final pedigree containing 2957 assigned maternities and 1861 assigned paternities with only one observed case of inbreeding between close relatives. During genotyping, we identified two independently captured two-year-old males with identical genotypes at all 25 loci, showing–for the first time–a case of monozygotic twinning among polar bears. In addition, we documented six new cases of cub adoption, which we attribute to cub misidentification or misdirected maternal care by a female bereaved of her young. Importantly, none of these adoptions could be attributed to reduced female vigilance caused by immobilization to facilitate scientific handling, as has previously been suggested.

An evolutionary hourglass of herbivore-induced transcriptomic responses in Nicotiana attenuata

Posted on December 18, 2015 by schraib

An evolutionary hourglass of herbivore-induced transcriptomic responses in Nicotiana attenuata

Matthew Durrant, Justin Boyer, Ian T. Boldwin, Shuqing Xu

bioRxiv doi: http://dx.doi.org/10.1101/034603

Herbivore induced defences are robust, evolve rapidly and activated in plants when specific elicitors, frequently found in the herbivores’ oral secretions (OS) are introduced into wounds during attack. How these complex induced defences evolve remains unclear. Here, we show that herbivore-induced transcriptomic responses in a wild tobacco, Nicotiana attenuata, display an evolutionary hourglass: the pattern that characterises the transcriptomic evolution of embryogenesis in animals, plants, and fungi. While relatively young and rapidly evolving genes involved in signal perception and processing to regulate defence metabolite biosynthesis are recruited both early (1 h) and late (9-21 h) in the defence elicitation process, a group of highly conserved and older genes involved in transcriptomic regulation are activated in the middle stage (5 h). The appearance of the evolutionary hourglass architecture in both developmental and defence elicitation processes may reflect the importance of robustness and evolvability in the signalling of these important biological processes.

What kind of maternal effects are selected for in fluctuating environments?

Posted on December 18, 2015 by schraib

What kind of maternal effects are selected for in fluctuating environments?

Stephen R Proulx, Henrique Teotonio

bioRxiv doi: http://dx.doi.org/10.1101/034546

Adaptation to temporally fluctuating environments can be achieved through direct phenotypic evolution, by phenotypic plasticity (either developmental plasticity or trans-generational plasticity), or by randomizing offspring phenotypes (often called diversifying bet-hedging). Theory has long held that plasticity can evolve when information about the future environment is reliable while bet-hedging can evolve when mixtures of phenotypes have high average fitness (leading to low among generation variance in fitness). To date, no study has studied the evolutionary routes that lead to the evolution of randomized offspring phenotypes on the one hand or deterministic maternal effects on the other. We develop simple, yet general, models of the evolution of maternal effects and are able to directly compare selection for deterministic and randomizing maternal effects and can also incorporate the notion of differential maternal costs of producing offspring with alternative phenotypes. We find that only a small set of parameters allow bet hedging type strategies to outcompete deterministic maternal effects. Not only must there be little or no informative cues available, but also the frequency with which different environments are present must fall within a narrow range. By contrast, when we consider the joint evolution of the maternal strategy and the set of offspring phenotypes we find that deterministic maternal effects can always invade the ancestral state (lacking any form of maternal effect). The long-term ESS may, however, involve some form of offspring randomization, but only if the phenotypes evolve extreme differences in environment-specific fitness. Overall we conclude that deterministic maternal effects are much more likely to evolve than offspring randomization, and offspring randomization will only be maintained if it results in extreme differences in environment-specific fitness.

BrowseVCF: a web-based application and workflow to quickly prioritise disease-causative variants in VCF files.

Posted on December 18, 2015 by schraib

BrowseVCF: a web-based application and workflow to quickly prioritise disease-causative variants in VCF files.

Silvia Salatino, Varun Ramraj, Stefano Lise

bioRxiv doi: http://dx.doi.org/10.1101/034769

As sequencing cost continues to decrease with fast advancing Next Generation Sequencing (NGS) technologies, variant discovery is becoming a more affordable and popular analysis method among research laboratories. Following variant calling and annotation, accurate variant filtering is a crucial step to extract meaningful biological information from sequencing data and to investigate disease etiology. However, the standard variant call file format (VCF) used to store this valuable information is not easy to handle without bioinformatics skills, thus preventing investigators from directly analysing their data. Here, we present BrowseVCF, an easy-to-use stand-alone software that enables researchers to browse, query and filter millions of variants in a few seconds. Top features include the possibility to store intermediate search results, to query user-defined gene lists, to group samples for family or tumour/normal studies, to download a report of the filters applied, and to export the filtered variants in spreadsheet format. Additionally, BrowseVCF is suitable for any DNA variant analysis (exome, whole-genome and targeted sequencing), can be used also for non-diploid genomes, and is able to discriminate between Single Nucleotide Polymorphisms (SNPs), Insertions/Deletions (InDels), and Multiple Nucleotide Polymorphisms (MNPs). Thanks to its portable implementation, BrowseVCF can be used either on personal computers or as part of automated analysis pipelines. The software can be initialised with a few clicks on any operating system without any special administrative or installation permissions, and is freely available for download from https://github.com/BSGOxford/BrowseVCF.

Haldane's Sieve

Discussing preprints in population and evolutionary genetics

Monthly Archives: December 2015

Do Phylogenetic Tree Viewers correctly display Support Values?

Host-pathogen co-evolution and the emergence of broadly neutralizing antibodies in chronic infections

Finite-size effects and switching times for Moran dynamics with mutation

Reduction rules for the maximum parsimony distance on phylogenetic trees

An Invariants-based Method for Efficient Identification of Hybrid Species From Large-scale Genomic Data

Independent evolution of ab- and adaxial stomatal density enables adaptation

Evidence of adoption, monozygotic twinning, and low inbreeding rates in a large genetic pedigree of polar bears

An evolutionary hourglass of herbivore-induced transcriptomic responses in Nicotiana attenuata

What kind of maternal effects are selected for in fluctuating environments?

BrowseVCF: a web-based application and workflow to quickly prioritise disease-causative variants in VCF files.

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this:

Share this: