The origin and evolution of species ranges remains a central focus of historical biogeography and the advent of likelihood methods based on phylogenies has revolutionized the way in which range evolution has been studied. A decade ago, the first elements of what turned out to be a popular inference approach of ancestral ranges based on the processes of Dispersal, local Extinction and Cladogenesis (DEC) was proposed. The success of the DEC model lies in its use of a flexible statistical framework known as a Continuous Time Markov Chain and since, several conceptual and computational improvements have been proposed using this as a baseline approach. In the spirit of the original version of DEC, we introduce DEC eXtended (DECX) by accounting for rapid expansion and local extinction as possible anagenetic events on the phylogeny but without increasing model complexity (i.e. in the number of free parameters). Classical vicariance as a cladogenetic event is also incorporated by making use of temporally flexible constraints on the connectivity between any two given areas in accordance with the movement of landmasses and dispersal opportunity over time. DECX is built upon a previous implementation in C/C++ and can analyze phylogenies on the order of several thousand tips in a few minutes. We test our model extensively on Pseudo Observed Datasets and on well-curated and recently published data from various island clades and a worldwide phylogeny of Amphibians (3309 species). We also propose the very first implementation of the DEC model that can specifically account for trees with fossil tips (i.e. non-ultrametric) using the phylogeny of palpimanoid spiders as a case study. In this paper, we argue in favour of the proposed improvements, which have the advantage of being computationally efficient while toeing the line of increased biological realism.
Monthly Archives: February 2016
Evolutionary dynamics of selfish DNA generates pseudo-linguistic features of genomes
Evolutionary dynamics of selfish DNA generates pseudo-linguistic features of genomes
Michael Sheinman, Anna Ramisch, Florian Massip, Peter F. Arndt
(Submitted on 4 Feb 2016)
Since the sequencing of large genomes, many statistical features of their sequences have been found. One intriguing feature is that certain subsequences are much more abundant than others. In fact, abundances of subsequences of a given length are distributed with a scale-free power-law tail, resembling properties of human texts, such as the Zipf’s law. Despite recent efforts, the understanding of this phenomenon is still lacking. Here we find that selfish DNA elements, such as those belonging to the Alu family of repeats, dominate the power-law tail. Interestingly, for the Alu elements the power-law exponent increases with the length of the considered subsequences. Motivated by these observations, we develop a model of selfish DNA expansion. The predictions of this model qualitatively and quantitatively agree with the empirical observations. This allows us to estimate parameters for the process of selfish DNA spreading in a genome during its evolution. The obtained results shed light on how evolution of selfish DNA elements shapes non-trivial statistical properties of genomes.