Efficient Quartet Representations of Trees and Applications to Supertree and Summary Methods

Efficient Quartet Representations of Trees and Applications to Supertree and Summary Methods
Ruth Davidson, MaLyn Lawhorn, Joseph Rusinko, Noah Weber

Quartet trees which are displayed by larger phylogenetic trees have long been used as inputs for species tree and supertree reconstruction. Computational constraints prevent the use of all displayed quartets when the number of genes or number of taxa is large. We introduce the Efficient Quartet System (EQS) to represent a phylogenetic tree with a subset of the quartets displayed by the tree. We show mathematically that the set of quartets obtained from a tree via EQS contains all of the combinatorial information of the tree itself. We also demonstrate via performance tests on some simulated datasets that the use of EQS to reduce the number of quartets input to quartet-based species tree methods (including summary methods) and supertree methods only corresponds to small reductions in accuracy.

Amplifiers for the Moran Process

Amplifiers for the Moran Process
Andreas Galanis, Andreas Göbel, Leslie Ann Goldberg, John Lapinskas, David Richerby

The Moran process, as studied by Lieberman, Hauert and Nowak, is a stochastic process modelling the spread of genetic mutations in populations. It has an underlying graph in which vertices correspond to individuals. Initially, one individual (chosen uniformly at random) possesses a mutation, with fitness r>1. All other individuals have fitness 1. At each step of the discrete-time process, an individual is chosen with probability proportional to its fitness, and its state (mutant or non-mutant) is passed on to an out-neighbour chosen u.a.r. If the underlying graph is strongly connected, the process will eventually reach fixation (all individuals are mutants) or extinction (no individuals are mutants). We define an infinite family of directed graphs to be strongly amplifying if, for every r>1, the extinction probability tends to 0 as the number n of vertices increases. Strong amplification is a rather surprising property – the initial mutant only has a fixed selective advantage, independent of n, which is “amplified” to give a fixation probability tending to 1. Strong amplifiers have received quite a bit of attention. Lieberman et al. proposed two potential families of them: superstars and metafunnels. It has been argued heuristically that some infinite families of superstars are strongly amplifying. The same has been claimed for metafunnels. We give the first rigorous proof that there is a strongly amplifying family of directed graphs which we call “megastars”. We show that the extinction probability of n-vertex graphs in this family of megastars is roughly n−1/2, up to logarithmic factors, and that all infinite families of superstars and metafunnels have larger extinction probabilities as a function of n. Our analysis of megastars is tight, up to logarithmic factors. We also clarify the literature on the isothermal theorem of Lieberman et al.

Inferring protein-protein interaction networks from inter-protein sequence co-evolution

Inferring protein-protein interaction networks from inter-protein sequence co-evolution
Christoph Feinauer, Hendrik Szurmant, Martin Weigt, Andrea Pagnani

Interaction between proteins is a fundamental mechanism that underlies virtually all biological processes. Many important interactions are conserved across a large variety of species. The need to maintain interaction leads to a high degree of co-evolution between residues in the interface between partner proteins. The inference of protein-protein interaction networks from the rapidly growing sequence databases is one of the most formidable tasks in systems biology today. We propose here a novel approach based on the Direct-Coupling Analysis of the co-evolution between inter-protein residue pairs. We use ribosomal and trp operon proteins as test cases: For the small resp. large ribosomal subunit our approach predicts protein-interaction partners at a true-positive rate of 70% resp. 90% within the first 10 predictions, with areas of 0.69 resp. 0.81 under the ROC curves for all predictions. In the trp operon, it assigns the two largest interaction scores to the only two interactions experimentally known. On the level of residue interactions we show that for both the small and the large ribosomal subunit our approach predicts interacting residues in the system with a true positive rate of 60% and 85% in the first 20 predictions. We use artificial data to show that the performance of our approach depends crucially on the size of the joint multiple sequence alignments and analyze how many sequences would be necessary for a perfect prediction if the sequences were sampled from the same model that we use for prediction. Given the performance of our approach on the test data we speculate that it can be used to detect new interactions, especially in the light of the rapid growth of available sequence data.

The impact of natural selection on the distribution of cis-regulatory variation across the genome of an outcrossing plant

The impact of natural selection on the distribution of cis-regulatory variation across the genome of an outcrossing plant

Kim A Steige, Benjamin Laenen, Johan Reimegård, Douglas Scofield, Tanja Slotte

Evolutionary dynamics of cytoplasmic segregation and fusion: Mitochondrial mixing facilitated the evolution of sex at the origin of eukaryotes

Evolutionary dynamics of cytoplasmic segregation and fusion: Mitochondrial mixing facilitated the evolution of sex at the origin of eukaryotes

Arunas L Radzvilavicius

Changes in the relative abundance of two Saccharomyces species from oak forests to wine fermentations

Changes in the relative abundance of two Saccharomyces species from oak forests to wine fermentations

Sofia Dashko, Ping Liu, Helena Volk, Lorena Butinar, Jure Piskur, Justin C. Fay

Human copy number variants are enriched in regions of low-mappability

Human copy number variants are enriched in regions of low-mappability

Jean Monlong, Caroline Meloche, Guy Rouleau, Patrick Cossette, Simon Louis Girard, Guillaume Bourque

When three traits make a line: Evolution of phenotypic plasticity and genetic assimilation through linear reaction norms in stochastic environments

When three traits make a line: Evolution of phenotypic plasticity and genetic assimilation through linear reaction norms in stochastic environments

Torbjorn Ergon, Rolf Ergon

Genomic and Chemical Diversity in Cannabis

Genomic and Chemical Diversity in Cannabis

Ryan C Lynch, Daniela Vergara, Silas Tittes, Kristin White, C.J. Schwartz, Matthew J Gibbs, Travis C Ruthenburg, Kymron deCesare, Donald P Land, Nolan C Kane

Mutation rates and the evolution of germline structure

Mutation rates and the evolution of germline structure

Aylwyn Scally