No Evidence for Phylostratigraphic Bias Impacting Inferences on Patterns of Gene Emergence and Evolution

dc.contributor.authorDomazet-Lošo, Tomislavca
dc.contributor.authorCarvunis, Anne-Ruxandraca
dc.contributor.authorAlbà Soler, Marca
dc.contributor.authorŠestak, Martin Sebastijanca
dc.contributor.authorBakarić, Robertca
dc.contributor.authorNeme, Rafikca
dc.contributor.authorTautz, Diethardca
dc.date.accessioned2017-05-23T08:06:07Z
dc.date.available2017-05-23T08:06:07Z
dc.date.issued2017
dc.description.abstractPhylostratigraphy is a computational framework for dating the emergence of DNA and protein sequences in a phylogeny. It has been extensively applied to make inferences on patterns of genome evolution, including patterns of disease gene evolution, ontogeny and de novo gene origination. Phylostratigraphy typically relies on BLAST searches along a species tree, but new simulation studies have raised concerns about the ability of BLAST to detect remote homologues and its impact on phylostratigraphic inferences. Here, we re-assessed these simulations. We found that, even with a possible overall BLAST false negative rate between 11–15%, the large majority of sequences assigned to a recent evolutionary origin by phylostratigraphy is unaffected by technical concerns about BLAST. Where the results of the simulations did cast doubt on previously reported findings, we repeated the original analyses but now excluded all questionable sequences. The originally described patterns remained essentially unchanged. These new analyses strongly support phylostratigraphic inferences, including: genes that emerged after the origin of eukaryotes are more likely to be expressed in the ectoderm than in the endoderm or mesoderm in Drosophila, and the de novo emergence of protein-coding genes from non-genic sequences occurs through proto-gene intermediates in yeast. We conclude that BLAST is an appropriate and sufficiently sensitive tool in phylostratigraphic analysis that does not appear to introduce significant biases into evolutionary pattern inferences.
dc.description.sponsorshipWe thank the following funding organizations for support of our work: TD-L: City of Zagreb and Adris Foundation grants; A-RC: National Institute of Health (NIH) grant K99 GM108865; MA: grant BFU2015-65235-P from MINECO/FEDER, EU; DT: ERC grant NewGenes, 322564.
dc.format.mimetypeapplication/pdfca
dc.identifier.citationDomazet-Lošo T, Carvunis A-R, Albà Soler M, Šestak MS, Bakarić R, Neme R, Tautz D. No Evidence for Phylostratigraphic Bias Impacting Inferences on Patterns of Gene Emergence and Evolution. Molecular Biology and Evolution. 2017;34(4):843-56. DOI: 10.1093/molbev/msw284
dc.identifier.doihttp://dx.doi.org/10.1093/molbev/msw284
dc.identifier.issn0737-4038
dc.identifier.urihttp://hdl.handle.net/10230/32144
dc.language.isoeng
dc.publisherOxford University Pressca
dc.relation.ispartofMolecular Biology and Evolution. 2017;34(4):843-56
dc.relation.projectIDinfo:eu-repo/grantAgreement/ES/1PE/BFU2015-65235-P
dc.rights© The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited.
dc.rights.accessRightsinfo:eu-repo/semantics/openAccess
dc.rights.urihttp://creativecommons.org/licenses/by-nc/4.0/
dc.subject.keywordGenome analysis
dc.subject.keywordPhylostratigraphy
dc.subject.keywordBLAST
dc.subject.keywordGene age estimation
dc.titleNo Evidence for Phylostratigraphic Bias Impacting Inferences on Patterns of Gene Emergence and Evolutionca
dc.typeinfo:eu-repo/semantics/article
dc.type.versioninfo:eu-repo/semantics/publishedVersion

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Alba_molbioevo_noev.pdf
Size:
1.16 MB
Format:
Adobe Portable Document Format

License

Rights