I just read on several blogs something at the form: Variable Importance using permutation will lead to a bias if the variables exhibit correlation. It is for instance stated by https://blog.methodsconsultants.com/posts/be-aware-of-bias-in-rf-variable-importance-metrics/ that
"The mean decrease in impurity and permutation importance computed from random forest models spread importance across collinear variables. For example, if you duplicate a feature and re-evaluate importance, the duplicated feature pulls down the importance of the original, so they are close to equal in importance."
In the article by Strobl et. al (2008): https://bmcbioinformatics.biomedcentral.com/articles/10.1186/1471-2105-9-307 it is argued that correlated variables will show too high variable importance, where it is stated
"We know that the original permutation importance overestimates the importance of correlated predictor variables."
Furthermore, it is described in https://scikit-learn.org/stable/modules/permutation_importance.html that
"When two features are correlated and one of the features is permuted, the model will still have access to the feature through its correlated feature. This will result in a lower importance value for both features, where they might actually be important."
The three quotes seem rather contradicting. The second quote states that correlated variable will show too high variable importance, where the third states that the variable importance will be too low. The quote agrees with this.
Do anyone know what is true? Is the variable importance overestimated or underestimated when variables are correlated?



