The PermutationImportance
object has some nice attributes such as feature_importances_
and feature_importances_std_
.
To visualize in an HTML style this attributes I used eli5.show_weights
function. However, I noticed that the displayed standard deviation does not agree with the values in feature_importances_std_
.
More specifically, I can see that the displayed HTML values are equal to feature_importances_std_ * 2
. Why is that ?
Code:
from sklearn import datasets
import eli5
from eli5.sklearn import PermutationImportance
from sklearn.svm import SVC, SVR
# import some data to play with
iris = datasets.load_iris()
X = iris.data[:, :2] # we only take the first two features.
y = iris.target
clf = SVC()
perms = PermutationImportance(clf, n_iter=1000, cv=10, random_state=0).fit(X, y)
print(perms.feature_importances_)
# this is the actual SD
print(perms.feature_importances_std_)
# These are the displayed values
print(perms.feature_importances_std_* 2)
[0.39527333 0.17178 ] # the actual mean
[0.13927548 0.11061278] # the actual SD
[0.27855095 0.22122556] # the displayed values by `show_weights()`
eli5.show_weights(perms)
We can see that the diplayed standard deviation is doupled i.e. 2 * perms.feature_importances_std_
.
Is this a bug maybe?
Found the *2
:
It's in the template generating the feature importances html table in the following page
{% if not fw.std is none %}
± {{ "%0.4f"|format(2 * fw.std) }}
{% endif %}
It's clearly put by hand