A novel method to model read counts in genomic data to reduce false positive identification of heterozygotes

A novel method to model read counts in genomic data to reduce false positive identification of heterozygotes

Steven H Wu, Rachel S Schwartz, David J Winter, Don Conrad, Reed A Cartwright

Natural selection reduces linked neutral divergence between distantly related species

Natural selection reduces linked neutral divergence between distantly related species

Tanya Phung, Christian Huber, Kirk Lohmueller

A new model of human dispersal

A new model of human dispersal

Trevor G Underwood

Detecting Heterogeneity in Population Structure Across the Genome in Admixed Populations

Detecting Heterogeneity in Population Structure Across the Genome in Admixed Populations

Caitlin McHugh, Timothy A Thornton, Lisa Brown

Strong Selection is Necessary for Evolution of Blindness in Cave Dwellers

Strong Selection is Necessary for Evolution of Blindness in Cave Dwellers

Rachel S Schwartz, Alexandra L. Merry, Megan M. Howell, Reed Cartwright

Marker-based estimates reveal significant non-additive effects in clonally propagated cassava (Manihot esculenta): implications for the prediction of total genetic value and the selection of varieties

Marnin Wolfe, Peter Kulakow, Ismail Y Rabbi, Jean-Luc Jannink

Population genetic analyses of metagenomes reveal extensive strain-level variation in prevalent human-associated bacteria

Population genetic analyses of metagenomes reveal extensive strain-level variation in prevalent human-associated bacteria

Stephen Nayfach, Katherine S Pollard
doi: http://dx.doi.org/10.1101/031757

Deep sequencing has the potential to shed light on the functional and phylogenetic heterogeneity of microbial populations in the environment. Here we present PhyloCNV, an integrated computational pipeline for quantifying species abundance and strain-level genomic variation from shotgun metagenomes. Our method leverages a comprehensive database of >30,000 reference genomes which we accurately clustered into species groups using a panel of universal-single-copy genes. Given a shotgun metagenome, PhyloCNV will rapidly and automatically identify gene copy number variants and single-nucleotide variants present in abundant bacterial species. We applied PhyloCNV to >500 faecal metagenomes from the United States, Europe, China, Peru, and Tanzania and present the first global analysis of strain-level variation and biogeography in the human gut microbiome. On average there is 8.5x more nucleotide diversity of strains between different individuals than within individuals, with elevated strain-level diversity in hosts from Peru and Tanzania that live rural lifestyles. For many, but not all common gut species, a significant proportion of inter-sample strain-level genetic diversity is explained by host geography. Eubacterium rectale, for example, has a highly structured population that tracks with host country, while strains of Bacteroides uniformis and other species are structured independently of their hosts. Finally, we discovered that the gene content of some bacterial strains diverges at short evolutionary timescales during which few nucleotide variants accumulate. These findings shed light onto the recent evolutionary history of microbes in the human gut and highlight the extensive differences in the gene content of closely related bacterial strains. PhyloCNV is freely available at: https://github.com/snayfach/PhyloCNV.

Evolutionary analysis across mammals reveals distinct classes of long noncoding RNAs

Evolutionary analysis across mammals reveals distinct classes of long noncoding RNAs

Jenny Chen, Alexander A. Shishkin, Xiaopeng Zhu, Sabah Kadri, Itay Maza, Jacob H Hanna, Aviv Regev, Manuel Garber

An evaluation of transcriptome-based exon capture for frog phylogenomics across multiple scales of divergence (Class: Amphibia, Order: Anura)

An evaluation of transcriptome-based exon capture for frog phylogenomics across multiple scales of divergence (Class: Amphibia, Order: Anura)

Daniel Portik, Lydia Smith, Ke Bi

Rapid Genotype Refinement for Whole-Genome Sequencing Data using Multi-Variate Normal Distributions

Rapid Genotype Refinement for Whole-Genome Sequencing Data using Multi-Variate Normal Distributions

Rudy Arthur, Jared O’Connell, Ole Schulz-Trieglaff, Anthony J Cox