Our paper: A genetic variant near olfactory receptor genes influences cilantro preference

For our next guest post Nick Eriksson (@nkeriks) writes about his ArXived paper with other 23andMe folks: A genetic variant near olfactory receptor genes influences cilantro preference ArXived here

First a little background about research at 23andMe. We have over 150,000 genotyped customers, a large proportion of whom answer surveys online. We run GWAS on pretty much everything trait you can think of (at least everything that is easily reported and possibly related to genetics). Around 2010, we started to ask a couple of questions about cilantro: if people like it, and if they perceive a soapy taste to it.

Fast forward a couple of years, and we have tens of thousands of people answering these questions. We start to see an interesting finding: one SNP significantly associated with both cilantro dislike and perceiving a soapy taste. Best of all, it was in a cluster of olfactory receptor genes.

The sense of smell is pretty cool. Humans have hundreds of olfactory receptor genes that encode G protein-coupled receptors. We perceive smells due to the binding of specific chemicals (“odorants”) to these receptors. There are maybe 1000 total olfactory receptors in various mammalian genomes, but it’s not totally clear which are pseudogenes. There has probably been some loss of these genes in humans as our sense of smell has become less critical. These genes appear in clusters in the genome, which makes it pretty hard for GWAS to pick out a specific gene. For example, in the first 23andMe paper, we identified a variant in a different cluster of olfactory receptors that affected whether you perceive a certain smell in your urine after eating asparagus. However, we still don’t know what the true functional variant in that region is.

Luckily, one of the olfactory receptors near our cilantro SNP turns out to be very well studied. It is known to bind to about 30 different aldehydes, including some of the chemicals that give cilantro its famous odor. So at the core this is a pretty simple paper. We found one significant association; it has as good of a functional story as you’ll see in nearly any GWAS. There are a couple of complications, however. First, we studied two related traits: soapy taste detection and cilantro dislike. They’re relatively correlated (r^2 about 0.33), and they are both associated with the same SNP. It looks like the association is stronger with soapy taste detection (and this trait seemed like it would be less influenced by environment than cilantro dislike), so we used soapy taste as the main phenotype.

The second complicated story is our heritability calculation. We saw about 9% heritability (tagged by the SNPs on our array). However, the confidence interval was pretty huge (-3% to 21%). Roughly, you could think of things falling into three heritability classes: high (height, celiac, type 1 diabetes), medium (type 2 diabetes, Crohn’s) and low (lung, colorectal, and maybe breast cancer). I think that’s about as accurate as the current heritability numbers can get. Our calculation puts cilantro soapy-taste detection into the low heritability group. There is the complication that this is only additive heritability tagged by common SNPs, so this phenotype could actually be very heritable, with most of the action coming from rare variants. But in my opinion, that’s doubtful.

Coming out of mathematics, I’ve always posted my papers to preprint servers. Luckily, this fits in well with 23andMe’s mission of making research faster, more participatory, and more fun. We’ve published all our papers so far in open access journals and have posted a couple of them to Nature Preceedings (before it shut down). I also write everything in LaTeX, so posting to the arXiv is a refreshing change (as compared to most biology journals where you have to undergo a conversion from LaTeX to word that makes everything look terrible (a particular pet peeve of mine with PLOS journals, which I otherwise love)).

I’m very curious to see how posting to the arXiv will affect publicity. Our papers tend to get a fair bit of press. However, I don’t know how the press will deal with one opportunity to report on the paper now (when the results are fresh and novel, but published on a site reporters will mostly not know about) and then another opportunity when the paper gets “blessed” via peer review. Because most of our papers are relatively straightforward GWAS (and we have a lot of coauthors here who have read and written a huge number of such papers), I think getting the data out on a preprint server is particularly important. However, we really need a Genetics category in q-bio!

Feedback on the paper would be most welcome. I’d love to see a replication or a nice functional study to followup, of course. I also think this is a good example for teaching people about genetics. A number of the issues that come up in this paper are a little tricky, but are good examples for understanding the how difficult it is to predict something based on genetics. On the technical side, I’m most curious if there are methods that might give a nice way of analyzing these two correlated traits together. We’ve tried a few regression based approaches for this sort of problem, but haven’t thought of anything entirely satisfactory.

Nick Eriksson

9 thoughts on “Our paper: A genetic variant near olfactory receptor genes influences cilantro preference

  1. a particular pet peeve of mine with PLOS journals, which I otherwise love

    Amen. The final (TeX) version I send in to PLoS of my articles is invariably better formatted than the published version. The other awful thing is how they make you split out each supplementary figure into a separate file, instead of letting you make a single Supplementary Information file with both text and figures. I often feel like I’m spending large amounts of time making my papers look worse.

    • Agreed on both accounts. This is especially pressing as PLoS seems to be cutting back on copy editing etc. I think with our paper Joe, we just told them no and submitted a single SOM, is that right? perhaps they’ve gotten more stringent, or may be that is just the power of the Pritchard.

      • Oh yeah, good point, I’d forgotten about that. I guess I haven’t tried to insist on a single file, maybe I’ll try that next time.

        I once tried to print the Supplementary of the fineSTRUCTURE paper to read on a plane. Absolutely impossible (do they expect people to download 50 files and then put together the pieces like it’s a puzzle?). Luckily the authors had a preprint on their website in a readable format!

  2. Seems like this variant/paper provides quite a nice example for discussing genetics in class room setting. The paper’s intro also has some fun historical quotes showing hatred/love of cilantro over the ages.

    You have an easy to assay phenotype, “do you like cilantro”, which can then be correlated with a more specific phenotype “does cilantro taste soapy to you”. By a simple show of hands you could assess the correlation between the phenotypes, and then talk through the correlation between the phenotypes and the genotype at this genetic locus.

    Also thinking for my evolution class that we could talk through what would happen if cilantro turned poisonous tomorrow [with no one noticing, e.g. 5% of folks die from choosing to eat it]. Some folks would be protected by not liking cilantro, but only a small subset due carrying the soapy taste allele. We could then work through the series of correlations to calculate the small change in allele frequency that would occur at this locus [I’d likely ignore the issue that this is not the causal variant and only in LD with it for the ease of discussion]. I usually try and kill off at least 50% of my class in describing some thought experiment to them. Sadly due to my love of all things cilantro I might perish in this one.

    While this is still fresh in my mind [bad pun intended], I might work through these numbers and put them up here and on my teaching site.

  3. It is really suggestive, at least to me with my probably well-known bias, that rs72921001 is right in the middle of a LINE-L2 transposable element which we have observed to be commonly used as promoters and enhancers for nearby genes. Perhaps rs72921001 is located in a regulatory element.

    • Nick, I’m guessing that you automatically check for whether the GWAS SNP is an known eQTLs? Perhaps this is tricky for olfactory genes, because of the paralogy and because the tissue specificity is likely a problem.

  4. I have a paper in press at a PLoS journal that was submitted in LaTeX, and I never had to convert it to word. Maybe things have changed. I agree, however, about their policy on supplemental files.

  5. Pingback: Most viewed on Haldane’s Sieve: August-September 2012 | Haldane's Sieve

  6. Pingback: Haldane’s Sieve sifts through 2012 | Haldane's Sieve

Leave a comment