2024-07-23T06:37:55Z
2024-07-23T06:37:55Z
2023
We propose an estimation procedure for covariation in wide compositional data sets. For compositions, widely-used logratio variables are interdependent due to a common reference. Logratio uncorrelated compositions are linearly independent before the unitsum constraint is imposed. We show how they are used to construct bespoke shrinkage targets for logratio covariance matrices and test a simple procedure for partial correlation estimates on both a simulated and a single-cell gene expression data set. For the underlying counts, different zero imputations are evaluated. The partial correlation induced by the closure is derived analytically. Data and code are available from GitHub.
Article
Published version
English
Compositional covariance structure; Logratio analysis; Partial correlation; James-Stein shrinkage
Statistical Institute of Catalonia
SORT-Statistics and Operations Research Transactions. 2023;47(2):245-68
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (http://creativecommons.org/licenses/by-nc-nd/4.0).
http://creativecommons.org/licenses/by-nc-nd/4.0/