Towards a pragmatic approach to compositional data analysis

Mostra el registre complet Registre parcial de l'ítem

  • dc.contributor.author Greenacre, Michael
  • dc.contributor.other Universitat Pompeu Fabra. Departament d'Economia i Empresa
  • dc.date.accessioned 2020-05-25T09:26:59Z
  • dc.date.available 2020-05-25T09:26:59Z
  • dc.date.issued 2017-01-01
  • dc.date.modified 2017-07-23T02:18:03Z
  • dc.description.abstract Compositional data are nonnegative data with the property of closure: that is, each set of values on their components, or so-called parts, has a fixed sum, usually 1 or 100%. The approach to compositional data analysis originated by John Aitchison uses ratios of parts as the fundamental starting point for description and modeling. I show that a compositional data set can be effectively replaced by a set of ratios, one less than the number of parts, and that these ratios describe an acyclic connected graph of all the parts. Contrary to recent literature, I show that the additive log-ratio transformation can be an excellent substitute for the original data set, as shown in an archaeological data set as well as in three other examples. I propose further that a smaller set of ratios of parts can be determined, either by expert choice or by automatic selection, which explains as much variance as required for all practical purposes. These part ratios can then be validly summarized and analyzed by conventional univariate methods, as well as multivariate methods, where the ratios are preferably log-transformed.
  • dc.format.mimetype application/pdf*
  • dc.identifier https://econ-papers.upf.edu/ca/paper.php?id=1554
  • dc.identifier.citation
  • dc.identifier.uri http://hdl.handle.net/10230/44738
  • dc.language.iso eng
  • dc.relation.ispartofseries Economics and Business Working Papers Series; 1554
  • dc.rights L'accés als continguts d'aquest document queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative Commons
  • dc.rights.accessRights info:eu-repo/semantics/openAccess
  • dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/3.0/es/
  • dc.subject.keyword compositional data
  • dc.subject.keyword log-ratio transformation
  • dc.subject.keyword log-ratio analysis
  • dc.subject.keyword log-ratio distance
  • dc.subject.keyword multivariate analysis
  • dc.subject.keyword ratios
  • dc.subject.keyword subcompositional coherence
  • dc.subject.keyword univariate statistics.
  • dc.subject.keyword Statistics, Econometrics and Quantitative Methods
  • dc.title Towards a pragmatic approach to compositional data analysis
  • dc.title.alternative
  • dc.type info:eu-repo/semantics/workingPaper