Random selection of factors preserves the correlation structure in a linear factor model to a high degree

Published in PloS one 13 (12), 2018

Recommended citation: Tanskanen, A.J., Lukkarinen, J., Vatanen, K. (2007). "Random selection of factors preserves the correlation structure in a linear factor model to a high degree" PloS one 13 (12) https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0206551

In a very high-dimensional vector space, two randomly-chosen vectors are almost orthogonal with high probability. Starting from this observation, we develop a statistical factor model, the random factor model, in which factors are chosen stochastically based on the random projection method. Randomness of factors has the consequence that correlation and covariance matrices are well preserved in a linear factor representation. It also enables derivation of probabilistic bounds for the accuracy of the random factor representation of time-series, their cross-correlations and covariances. As an application, we analyze reproduction of time-series and their cross-correlation coefficients in the well-diversified Russell 3,000 equity index.

Recommended citation: Tanskanen, A.J., Lukkarinen, J., Vatanen, K. (2007). “Random selection of factors preserves the correlation structure in a linear factor model to a high degree” PloS one 13 (12)