Welcome to the UPF Digital Repository

A benchmark of transposon insertion detection tools using real data

Show simple item record

dc.contributor.author Vendrell Mir, Pol
dc.contributor.author Barteri, Fabio
dc.contributor.author Merenciano, Miriam
dc.contributor.author González, Josefa
dc.contributor.author Casacuberta, Josep M.
dc.contributor.author Castanera, Raúl
dc.date.accessioned 2020-07-09T06:46:04Z
dc.date.available 2020-07-09T06:46:04Z
dc.date.issued 2019
dc.identifier.citation Vendrell-Mir P, Barteri F, Merenciano M, González J, Casacuberta JM, Castanera R. A benchmark of transposon insertion detection tools using real data. Mob DNA. 2019; 10:53. DOI: 10.1186/s13100-019-0197-9
dc.identifier.issn 1759-8753
dc.identifier.uri http://hdl.handle.net/10230/45092
dc.description.abstract Background: Transposable elements (TEs) are an important source of genomic variability in eukaryotic genomes. Their activity impacts genome architecture and gene expression and can lead to drastic phenotypic changes. Therefore, identifying TE polymorphisms is key to better understand the link between genotype and phenotype. However, most genotype-to-phenotype analyses have concentrated on single nucleotide polymorphisms as they are easier to reliable detect using short-read data. Many bioinformatic tools have been developed to identify transposon insertions from resequencing data using short reads. Nevertheless, the performance of most of these tools has been tested using simulated insertions, which do not accurately reproduce the complexity of natural insertions. Results: We have overcome this limitation by building a dataset of insertions from the comparison of two high-quality rice genomes, followed by extensive manual curation. This dataset contains validated insertions of two very different types of TEs, LTR-retrotransposons and MITEs. Using this dataset, we have benchmarked the sensitivity and precision of 12 commonly used tools, and our results suggest that in general their sensitivity was previously overestimated when using simulated data. Our results also show that, increasing coverage leads to a better sensitivity but with a cost in precision. Moreover, we found important differences in tool performance, with some tools performing better on a specific type of TEs. We have also used two sets of experimentally validated insertions in Drosophila and humans and show that this trend is maintained in genomes of different size and complexity. Conclusions: We discuss the possible choice of tools depending on the goals of the study and show that the appropriate combination of tools could be an option for most approaches, increasing the sensitivity while maintaining a good precision.
dc.description.sponsorship This work was supported in part by grants from the Ministerio de Economia y Competitividad (AGL2016–78992-R). Fabio Barteri and Pol Vendrell hold a FPI (Formación de Personal Investigador) fellowship from the Spanish Ministerio de Economia y Competitividad. Raúl Castanera holds a Juan de la Cierva Postdoctoral fellowship from the Spanish Ministerio de Economia y Competitividad. JG is funded by the European Commission (H2020-ERC-2014-CoG-647900) and the Spanish Ministerio de Ciencia, Innovación y Universidades/AEI/FEDER, EU (BFU2017–82937-P).
dc.format.mimetype application/pdf
dc.language.iso eng
dc.publisher BioMed Central
dc.relation.ispartof Mob DNA. 2019; 10:53
dc.rights © The Author(s). 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
dc.rights.uri http://creativecommons.org/licenses/by/4.0/
dc.title A benchmark of transposon insertion detection tools using real data
dc.type info:eu-repo/semantics/article
dc.identifier.doi http://dx.doi.org/10.1186/s13100-019-0197-9
dc.subject.keyword Benchmark
dc.subject.keyword Polymorphism
dc.subject.keyword Resequencing
dc.subject.keyword Transposable elements
dc.subject.keyword Transposon insertion
dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/647900
dc.relation.projectID info:eu-repo/grantAgreement/ES/1PE/AGL2016–78992-R
dc.relation.projectID info:eu-repo/grantAgreement/ES/2PE/BFU2017–82937-P
dc.rights.accessRights info:eu-repo/semantics/openAccess
dc.type.version info:eu-repo/semantics/publishedVersion

Thumbnail

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account

Statistics

In collaboration with Compliant to Partaking