Redundans: an assembly pipeline for highly heterozygous genomes
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Pryszcz, Leszek Piotr, 1985-ca
- dc.contributor.author Gabaldón Estevan, Juan Antonio, 1973-ca
- dc.date.accessioned 2017-01-25T07:42:42Z
- dc.date.available 2017-01-25T07:42:42Z
- dc.date.issued 2016ca
- dc.description.abstract Many genomes display high levels of heterozygosity (i.e. presence of different alleles at the same loci in homologous chromosomes), being those of hybrid organisms an extreme such case. The assembly of highly heterozygous genomes from short sequencing reads is a challenging task because it is difficult to accurately recover the different haplotypes. When confronted with highly heterozygous genomes, the standard assembly process tends to collapse homozygous regions and reports heterozygous regions in alternative contigs. The boundaries between homozygous and heterozygous regions result in multiple assembly paths that are hard to resolve, which leads to highly fragmented assemblies with a total size larger than expected. This, in turn, causes numerous problems in downstream analyses such as fragmented gene models, wrong gene copy number, or broken synteny. To circumvent these caveats we have developed a pipeline that specifically deals with the assembly of heterozygous genomes by introducing a step to recognise and selectively remove alternative heterozygous contigs. We tested our pipeline on simulated and naturally-occurring heterozygous genomes and compared its accuracy to other existing tools. Our method is freely available at https://github.com/Gabaldonlab/redundans.
- dc.description.sponsorship Spanish Ministry of Economy and Competitiveness grants, ‘Centro de Excelencia Severo Ochoa [2013–2017’ SEV-2012-0208, BFU2015-67107 to TG group] cofounded by European Regional Development Fund (ERDF); European Union and ERC Seventh Framework Programme [FP7/2007-2013] under grant agreements [FP7-PEOPLE-2013-ITN-606786 and ERC-2012-StG-310325]; European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie [H2020-MSCA-ITN-2014-642095]; La Caixa-CRG International Fellowship Program (to L.P.P.). Funding for open access charge: ERC Seventh Framework Programme [FP7/2007-2013] under grant agreements [FP7-PEOPLE-2013-ITN-606786 and ERC-2012-StG-310325]; European Union's Horizon 2020 research and innovation programme under the Marie Sklodowska-Curie [H2020-MSCA-ITN-2014-642095]; Spanish Ministry of Economy and Competitiveness grants, ‘Centro de Excelencia Severo Ochoa [2013–2017’ SEV-2012-0208, BFU2015-67107] cofounded by ERDF; Catalan Research Agency (AGAUR) [SGR857].
- dc.format.mimetype application/pdfca
- dc.identifier.citation Pryszcz LP, Gabaldón Estevan JA. Redundans: an assembly pipeline for highly heterozygous genomes. Nucleic Acids Research. 2016;44(12):e113. DOI: 10.1093/nar/gkw294ca
- dc.identifier.doi http://dx.doi.org/10.1093/nar/gkw294
- dc.identifier.issn 0305-1048ca
- dc.identifier.uri http://hdl.handle.net/10230/27979
- dc.language.iso engca
- dc.publisher Oxford University Pressca
- dc.relation.ispartof Nucleic Acids Research. 2016;44(12):e113
- dc.relation.projectID info:eu-repo/grantAgreement/ES/3PN/SEV2012-0208
- dc.relation.projectID info:eu-repo/grantAgreement/ES/1PE/BFU2015-67107
- dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/310325
- dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/606786
- dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/642095
- dc.rights © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited.ca
- dc.rights.accessRights info:eu-repo/semantics/openAccessca
- dc.rights.uri http://creativecommons.org/licenses/by-nc/4.0/
- dc.subject.keyword Genome
- dc.subject.keyword Heterozygote
- dc.title Redundans: an assembly pipeline for highly heterozygous genomesca
- dc.type info:eu-repo/semantics/articleca
- dc.type.version info:eu-repo/semantics/publishedVersionca