Abstract
Collinearity discovery through diagnostic tools is an important analysis step when performing linear regression. Despite their wide-spread use, collinearity indices such as the variance inflation factor and the condition number have limitations and may not be effective in some applications. In this article, we will contribute to the study of conventional collinearity indices through theoretical and empirical work. We will present mcvis, a new framework that uses resampling techniques to repeatedly learn from these conventional collinearity indices to better understand the causes of collinearity. Our framework is made available in R through the mcvis package which includes new collinearity measures and visualizations, in particular a bipartite plot that informs on the degree and structure of collinearity. Supplementary materials for this article are available online.
Original language | English |
---|---|
Pages (from-to) | 125-132 |
Number of pages | 8 |
Journal | Journal of Computational and Graphical Statistics |
Volume | 30 |
Issue number | 1 |
Early online date | 30 Jul 2020 |
DOIs | |
Publication status | Published - 2021 |
Externally published | Yes |
Keywords
- High correlations
- Multicollinearity
- Resampling
- Variance inflation factor