Understanding sequencing data as compositions: an outlook and review
Mostra el registre complet Registre parcial de l'ítem
- dc.contributor.author Quinn, Thomas P.
- dc.contributor.author Erb, Ionas
- dc.contributor.author Richardson, Mark F.
- dc.contributor.author Crowley, Tamsyn M.
- dc.date.accessioned 2019-05-24T07:43:23Z
- dc.date.available 2019-05-24T07:43:23Z
- dc.date.issued 2018
- dc.description.abstract Motivation: Although seldom acknowledged explicitly, count data generated by sequencing platforms exist as compositions for which the abundance of each component (e.g. gene or transcript) is only coherently interpretable relative to other components within that sample. This property arises from the assay technology itself, whereby the number of counts recorded for each sample is constrained by an arbitrary total sum (i.e. library size). Consequently, sequencing data, as compositional data, exist in a non-Euclidean space that, without normalization or transformation, renders invalid many conventional analyses, including distance measures, correlation coefficients and multivariate statistical models. Results: The purpose of this review is to summarize the principles of compositional data analysis (CoDA), provide evidence for why sequencing data are compositional, discuss compositionally valid methods available for analyzing sequencing data, and highlight future directions with regard to this field of study. Supplementary information: Supplementary data are available at Bioinformatics online.
- dc.format.mimetype application/pdf
- dc.identifier.citation Quinn TP, Erb I, Richardson MF, Crowley TM. Understanding sequencing data as compositions: an outlook and review. Bioinformatics. 2018; 34(16):2870-2878. DOI 10.1093/bioinformatics/bty175
- dc.identifier.doi http://dx.doi.org/10.1093/bioinformatics/bty175
- dc.identifier.issn 1367-4803
- dc.identifier.uri http://hdl.handle.net/10230/37289
- dc.language.iso eng
- dc.publisher Oxford University Press
- dc.relation.ispartof Bioinformatics. 2018; 34(16):2870-2878
- dc.rights © The Author(s) 2018. Published by Oxford University Press. This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
- dc.rights.accessRights info:eu-repo/semantics/openAccess
- dc.rights.uri http://creativecommons.org/licenses/by-nc/4.0/
- dc.title Understanding sequencing data as compositions: an outlook and review
- dc.type info:eu-repo/semantics/article
- dc.type.version info:eu-repo/semantics/publishedVersion