Abstract
Invariant coordinate selection (ICS) is a multivariate data transformation and a dimension reduction method that can be useful in many different contexts. It can be used for outlier detection or cluster identification, and can be seen as an independent component or a non-Gaussian component analysis method. The usual implementation of ICS is based on a joint diagonalization of two scatter matrices, and may be numerically unstable in some ill-conditioned situations. We focus on one-step M-scatter matrices and propose a new implementation of ICS based on a pivoted QR factorization of the centered data set. This factorization avoids the direct computation of the scatter matrices and their inverse and brings numerical stability to the algorithm. Furthermore, the row and column pivoting leads to a rank revealing procedure that allows computation of ICS when the scatter matrices are not full rank. Several artificial and real data sets illustrate the interest of using the new implementation compared to the original one.
Replaces
Aurore Archimbaud, Fériel Boulfani, Xavier Gendre, Klaus Nordhausen, Anne Ruiz-Gazen, and Joni Virta, “ICS for multivariate functional anomaly detection with applications to predictive maintenance and quality control”, TSE Working Paper, n. 21-1182, January 2021, revised March 2022.
Reference
Aurore Archimbaud, Fériel Boulfani, Xavier Gendre, Klaus Nordhausen, Anne Ruiz-Gazen, and Joni Virta, “ICS for multivariate functional anomaly detection with applications to predictive maintenance and quality control”, Econometrics and Statistics, 2024, forthcoming.
Published in
Econometrics and Statistics, 2024, forthcoming