0

In the context of multi-regression, I am wondering if there is a way to decompose $$VIF_i = 1/(1-R_i^2)$$ where $R_i^2$ is the r squared obtained from the regression of dependent variable = i and independent variables are all other factors.

I want to decompose $VIF_i$ or $R_i^2$ into individual factors to see how much each individual factor contributes to the $VIF_i$ or $R_i^2$

Someone recommended using the square of partial correlation coefficient and that value is linearly related to $R_i^2$. My undestanding is that partial correlation coefficient measures the correlation between two variables, holding the other variables constant. Is this a viable option?

JungleDiff
  • 217
  • 1
  • 4
  • 1
    Decomposing the R-squared (e.g. by looking at squared semi-partial correlations) *and* gain the information you are interested in is difficult. Imagine e.g. a situation with three highly correlated covariables $z_1$, $z_2$, $z_3$. How would you want to decompose the R-squared corresponding to the VIF of $z_1$ in contributions of $z_2$ and $z_3$? – Michael M Aug 29 '18 at 16:27
  • @MichaelM well.. I do not know how to decompose the R-squared corresponding to the VIF of z1 in contributions of z2 and z3 and that's why I posted the question to get an answer. – JungleDiff Aug 29 '18 at 16:47
  • https://datascience.stackexchange.com/a/64415/71442 – Peter Jan 10 '21 at 13:52
  • Similar question here: https://stats.stackexchange.com/questions/400122/decomposing-r2-into-independent-variables – Ralph Winters May 21 '22 at 21:00

1 Answers1

0

I suggest calculating R-squared and VIF for each permutation of variable combinations.

Also consider the fact that variable interactions may influence your R-squared and VIF values. Depending on what programming language you are using there are many ways to build interactions into your regression model, and you can iteratively test for effects from there.

bstrain
  • 401
  • 3
  • 6
  • From my understanding, Cook's Distance is used to identify an influential observation in regression. I am not trying to identify influential points. I want to identify influential "variables" (aka x's) – JungleDiff Sep 04 '18 at 12:06