dc.contributor
Universitat Pompeu Fabra. Departament d'Economia i Empresa
dc.contributor.author
Greenacre, Michael
dc.date.issued
2017-07-26T10:50:51Z
dc.date.issued
2017-07-26T10:50:51Z
dc.date.issued
2016-08-01
dc.date.issued
2017-07-23T02:18:00Z
dc.identifier
https://econ-papers.upf.edu/ca/paper.php?id=1551
dc.identifier
http://hdl.handle.net/10230/32637
dc.description.abstract
Compositional data are nonnegative data with the property of closure: that is, each set
of values on their components, or so-called parts, has a fixed sum, usually 1 or 100%.
Compositional data cannot be analyzed by conventional statistical methods, since the value of
any part depends on the choice of the other parts of the composition of interest. For example,
reporting the mean and standard deviation of a specific part makes no sense, neither does the
correlation between two parts. I propose that a small set of ratios of parts can be determined,
either by expert choice or by automatic selection, which effectively replaces the compositional
data set. This set can be determined to explain 100% of the variance in the compositional data,
or as close to 100% as required. These part ratios can then be validly summarized and analyzed
by conventional univariate methods, as well as multivariate methods, where the ratios are
preferably log-transformed.
dc.format
application/pdf
dc.format
application/pdf
dc.relation
Economics and Business Working Papers Series; 1551
dc.rights
L'accés als continguts d'aquest document queda condicionat a l'acceptació de les condicions d'ús establertes per la següent llicència Creative Commons
dc.rights
http://creativecommons.org/licenses/by-nc-nd/3.0/es/
dc.rights
info:eu-repo/semantics/openAccess
dc.subject
compositional data
dc.subject
logarithmic transformation
dc.subject
log-ratio analysis
dc.subject
multivariate analysis
dc.subject
univariate statistics.
dc.title
Selection and statistical analysis of compositional ratios
dc.type
info:eu-repo/semantics/workingPaper