Greenacre, MichaelUniversitat Pompeu Fabra. Departament d'Economia i Empresa2024-11-142024-11-142023-01-02http://hdl.handle.net/10230/68648The approach to analysing compositional data with a fixed sum constraint has been dominated by the use of logratio transformations, to ensure exact subcompositional coherence and, in some situations, exact isometry as well. A problem with this approach is that data zeros, found in most applications, have to be replaced to permit the logarithmic transformation. A simpler approach is to use the chi-square standardization that is inherent in correspondence analysis. Combined with the Box-Cox power transformation, this standardization defines chi-square distances that tend to logratio distances for strictly positive data as the power parameter tends to zero, and can thus be considered equivalent to transforming to logratios. For data with zeros, a value of the power can be identified that brings the chi-square standardization as close as possible to transforming by logratios, without having to substitute the zeros. Especially in the field of high-dimensional "omics" data, this alternative presents such a high level of coherence and isometry as to be a valid, and much simpler, approach to the analysis of compositional data.application/pdfengL'accés als continguts d'aquest document queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative CommonsThe chi-square standardization, combined with Box-Cox transformation, is a valid alternative to transforming to logratios in compositional data analysis<resourceType xmlns="http://datacite.org/schema/kernel-3" resourceTypeGeneral="Other">info:eu-repo/semantics/workingPaper</resourceType><subject xmlns="http://datacite.org/schema/kernel-3" subjectScheme="keyword">box-cox transformation</subject><subject xmlns="http://datacite.org/schema/kernel-3" subjectScheme="keyword">chi-square distance</subject><subject xmlns="http://datacite.org/schema/kernel-3" subjectScheme="keyword">correspondence analysis</subject><subject xmlns="http://datacite.org/schema/kernel-3" subjectScheme="keyword">isometry</subject><subject xmlns="http://datacite.org/schema/kernel-3" subjectScheme="keyword">logratios</subject><subject xmlns="http://datacite.org/schema/kernel-3" subjectScheme="keyword">procrustes analysis</subject><subject xmlns="http://datacite.org/schema/kernel-3" subjectScheme="keyword">subcompositional coherence</subject><subject xmlns="http://datacite.org/schema/kernel-3" subjectScheme="keyword">Statistics, Econometrics and Quantitative Methods</subject><rights xmlns="http://datacite.org/schema/kernel-3">info:eu-repo/semantics/openAccess</rights>