Alignathon: a competitive assessment of whole-genome alignment methods

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Earl, Dent
  • dc.contributor.author Kemena, Carsten, 1983-
  • dc.contributor.author Chang, Jia-Ming, 1978-
  • dc.contributor.author Erb, Ionas
  • dc.contributor.author Notredame, Cedric
  • dc.contributor.author Paten, Benedict
  • dc.date.accessioned 2023-12-18T06:51:41Z
  • dc.date.available 2023-12-18T06:51:41Z
  • dc.date.issued 2014
  • dc.description.abstract Multiple sequence alignments (MSAs) are a prerequisite for a wide variety of evolutionary analyses. Published assessments and benchmark data sets for protein and, to a lesser extent, global nucleotide MSAs are available, but less effort has been made to establish benchmarks in the more general problem of whole-genome alignment (WGA). Using the same model as the successful Assemblathon competitions, we organized a competitive evaluation in which teams submitted their alignments and then assessments were performed collectively after all the submissions were received. Three data sets were used: Two were simulated and based on primate and mammalian phylogenies, and one was comprised of 20 real fly genomes. In total, 35 submissions were assessed, submitted by 10 teams using 12 different alignment pipelines. We found agreement between independent simulation-based and statistical assessments, indicating that there are substantial accuracy differences between contemporary alignment tools. We saw considerable differences in the alignment quality of differently annotated regions and found that few tools aligned the duplications analyzed. We found that many tools worked well at shorter evolutionary distances, but fewer performed competitively at longer distances. We provide all data sets, submissions, and assessment programs for further study and provide, as a resource for future benchmarking, a convenient repository of code and data for reproducing the simulation assessments.
  • dc.description.sponsorship We would like to thank the Howard Hughes Medical Institute, Dr. and Mrs. Gordon Ringold, the NIH (grant 2U41 HG002371-13), and the NHGRI/NIH (grant 5U01HG004695) for providing funding. We would like to thank the Genome 10K organizers for providing a venue to discuss an early version of these findings. J.M. is supported by NIH grant R21HG006464, NIH grant R01HG007352, NSF grant 1054309, and NSF grant 1262575. M.H. is supported by NIH grant R15HG005913. C.K., J.-M.C., I.E., and C.N. were supported by the Spanish Ministry of Economy and Competitiveness (grant no. BFU2011-28575); the Secretariat of Universities and Research, Department of Economy and Knowledge of the Government of Catalonia (2009 SGR 951); the ‘‘la Caixa’’ International Fellowship Programme for a predoctoral fellowship at the CRG (to J.-M.C.); the European Research Council (ERC-2008-AdG no. 232947 to J.-M.C.); and the Spanish Ministry of Economy and Competitiveness, ‘‘Centro de Excelencia Severo Ochoa 2013-2017,’’ SEV-2012-0208. This work was supported by the Wellcome Trust (grant no. WT095908) and the European Molecular Biology Laboratory. The research leading to these results has received funding from the European Community’s Seventh Framework Programme (FP7/2007-2013) under grant agreement no. 222664 (‘‘Quantomics’’). This publication reflects only the authors’ views and the European Community is not liable for any use that may be made of the information contained herein. We thank three anonymous referees who provided a great deal of helpful feedback and suggestions.
  • dc.format.mimetype application/pdf
  • dc.identifier.citation Earl D, Nguyen N, Hickey G, Harris RS, Fitzgerald S, Beal K, et al. Alignathon: a competitive assessment of whole-genome alignment methods. Genome Res. 2014 Oct 1;24(12):2077-89. DOI: 10.1101/gr.174920.114
  • dc.identifier.doi http://dx.doi.org/10.1101/gr.174920.114
  • dc.identifier.issn 1088-9051
  • dc.identifier.uri http://hdl.handle.net/10230/58545
  • dc.language.iso eng
  • dc.publisher Cold Spring Harbor Laboratory Press (CSHL Press)
  • dc.relation.ispartof Genome Research. 2014 Oct 1;24(12):2077-89
  • dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/232947
  • dc.relation.projectID info:eu-repo/grantAgreement/ES/3PN/BFU2011-28575
  • dc.relation.projectID info:eu-repo/grantAgreement/ES/1PE/SEV-2012-0208
  • dc.relation.projectID info:eu-repo/grantAgreement/EC/FP7/222664
  • dc.rights © 2014 Earl et al.; Published by Cold Spring Harbor Laboratory Press. This article, published in Genome Research, is available under a Creative Commons License (Attribution 4.0 International), as described at http://creativecommons.org/licenses/by/4.0.
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.rights.uri http://creativecommons.org/licenses/by/4.0
  • dc.subject.other Genomes
  • dc.subject.other Nucleòtids
  • dc.subject.other Proteïnes
  • dc.title Alignathon: a competitive assessment of whole-genome alignment methods
  • dc.type info:eu-repo/semantics/article
  • dc.type.version info:eu-repo/semantics/publishedVersion