Document de travail

Log-Free Divergence and Covariance matrix for Compositional Data I: The Affiine/Barycentric Approach

Olivier Faugeras

Résumé

The presence of zeroes in Compositional Data (CoDa) is a thorny issue for Aitchison’s classical log-ratio analysis. Building upon our previous geometric approach (Faugeras (2023)), we study the full CoDa simplex from the perspective of affine geometry. This view allows to regard CoDa as points (and not vectors), naturally expressed in barycentric coordinates. A decomposition formula for the displacement vector of two CoDa points yields a novel family of barycentric dissimilarity measures. In turn, these barycentric divergences allow to define i) Fréchet means and their variants, ii) isotropic and anisotropic analogues of the Gaussian distribution, and importantly iii) variance and covariance matrices. All together, the new tools introduced in this paper provide a log-free, direct and unified way to deal with the whole CoDa space, exploiting the linear affine structure of CoDa, and effectively handling zeroes. A strikingly related approach based on the projective viewpoint and the exterior product will be studied in the separate companion paper Faugeras (2024a).

Mots-clés

Compositional data; Affine geometry, Barycentric coordinates,; Barycentric divergence; Fréchet mean, Gaussian distribution; Barycentric variation matrix.;

Référence

Olivier Faugeras, « Log-Free Divergence and Covariance matrix for Compositional Data I: The Affiine/Barycentric Approach », TSE Working Paper, n° 24-1580, octobre 2024.

Voir aussi

Publié dans

TSE Working Paper, n° 24-1580, octobre 2024