Sklearn & SHAP

Pre-requirements

pip install shap, xgboost, scikit-learn

Cleaning

import pandas as pd

# label encoder
df['ocean_proximity_encoded'], labels = pd.factorize(df['ocean_proximity'])
df = df.drop("ocean_proximity", axis=1)

# onehot encoder
one_hot_encoded = pd.get_dummies(df['ocean_proximity'])
df = pd.concat([df, one_hot_encoded], axis=1).drop("ocean_proximity", axis=1)

# train_test_split
x_train, x_test, y_train, y_test = train_test_split(df, test_size=0.2, random_state=0)

Modeling

Classification

Regression

Evaluation

SHAP

waterfall
scatter
beeswarm
bar

References

Last updated