Pixel classification explained with SHAP

Pixel classification explained with SHAP#

SHapley Additive exPlanations (SHAP) is a technique for visualizing how, for example random forest classifiers work. In this example we use a random forest classifier for pixel classification.

Generating a feature stack#

Pixel classifiers such as the random forest classifier takes multiple images as input. We typically call these images a feature stack because for every pixel exist now multiple values (features). In the following example we create a feature stack containing three features:

The original pixel value
The pixel value after a Gaussian blur
The pixel value of the Gaussian blurred image processed through a Sobel operator.

Thus, we denoise the image and detect edges. All three images serve the pixel classifier to differentiate positive and negative pixels.

feature_names = ["original", "top_hat(10)", "gaussian_sobel(1)", "random"]

feature_stack = generate_feature_stack(image, feature_names)
feature_stack.shape

(4, 60, 60)

visualize_image_list(feature_stack, feature_names)

../_images/2ffcbacabec1cef2911b996cad6e3ea576e317ba24349a06b2e1de478d059db4.png

Formatting data#

We now need to format the input data so that it fits to what scikit learn expects. Scikit-learn asks for an array of shape (n, m) as input data and (n) annotations. n corresponds to number of pixels and m to number of features. In our case m = 3.

X, y = format_data(feature_stack, annotation)

print("input shape", X.shape)
print("annotation shape", y.shape)

input shape (969, 4)
annotation shape (969,)

Training the random forest classifier#

We now train the random forest classifier by providing the feature stack X and the annotations y.

classifier = RandomForestClassifier(max_depth=2, random_state=0)
classifier.fit(X, y)

RandomForestClassifier(max_depth=2, random_state=0)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

Predicting pixel classes#

After the classifier has been trained, we can use it to predict pixel classes for whole images. Note in the following code, we provide feature_stack.T which are more pixels then X in the commands above, because it also contains the pixels which were not annotated before.

prediction = classifier.predict(np.asarray([f.ravel() for f in feature_stack]).T).reshape(image.shape)
stackview.animate_curtain(image, prediction, zoom_factor=4)

SHAP#

SHAP analysis allows us to visualize to what degree features contribute to decisions the classifier makes.

def visualize_shap(classifier, feature_names, target_class=-1):
    import shap
    
    # Create SHAP explainer
    explainer = shap.TreeExplainer(classifier)
    
    # Calculate SHAP values 
    shap_values = explainer.shap_values(X)[...,target_class]

    # Create a new figure with larger size for better visibility
    plt.figure(figsize=(40, 8))
    
    # Create SHAP summary plot with feature names
    shap.summary_plot(shap_values, X, feature_names=feature_names, show=False)
    
    # Style plot and show it 
    #plt.title('SHAP Feature Importance and Impact', pad=20)
    plt.xlabel("SHAP value")
    plt.tight_layout()

visualize_shap(classifier, feature_names)

../_images/3deb005941a01dcbcb649c666bd4c76c4c25ba2542ca504addbc7c524a70f44a.png

The plot above can be read like:

The top-hat filtered image is the most crucial for the segmentation of the objects. If top-hat filtered pixel values are high, the classifier sees the pixel as positive (red).
The original and the Gaussian-blurred image contribute to the decision as well, but not as prominently because the SHAP values are closer to 0.
The random image does not contribute to the classification.

To interpret the plot above more easily, we show the feature images again:

visualize_image_list(feature_stack, feature_names)

Beware of correlation#

We will execute the same procedure again, but this time with strongly correlating features.

feature_names = ["original"] + [f"top_hat({r})" for r in range(6, 14, 2)] + ["gaussian_sobel(1)"]

feature_stack = generate_feature_stack(image, feature_names)
visualize_image_list(feature_stack, feature_names)

../_images/7505525aa3ac889ec9d309a59b1af463e6628302151176d5b5c2267a6b0b53d3.png

X, y = format_data(feature_stack, annotation)

classifier = RandomForestClassifier(max_depth=2, random_state=0)
classifier.fit(X, y)

RandomForestClassifier(max_depth=2, random_state=0)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

prediction = classifier.predict(np.asarray([f.ravel() for f in feature_stack]).T).reshape(image.shape)
stackview.animate_curtain(image, prediction, zoom_factor=4)

In this shap plot it seems the Gaussian blurred image and the original are less useful compared to the SHAP plot above. However, the strongly correlating top-hat features might mislead our perception.

visualize_shap(classifier, feature_names)

../_images/1e52e7d5045bcac5b03fa1dc44188d3694d5f3e3da4cb8dd50bb9752305f5049.png

import seaborn as sns

# Create DataFrame
df = pd.DataFrame(X, columns=feature_names)

# Calculate correlation matrix
correlation_matrix = df.corr()

# Create heatmap
plt.figure(figsize=(5, 5))
sns.heatmap(correlation_matrix, 
            cmap='PRGn',  # Purple-Green diverging colormap
            center=0,     # Center the colormap at 0
            vmin=-1,      # Set minimum value
            vmax=1,       # Set maximum value
            annot=False,   # Show correlation values
            fmt='.2f')    # Format numbers to 2 decimal places
plt.title('Feature Correlation Matrix') 
plt.tight_layout()
plt.show()

../_images/c5fab2f2d384295170d6682444d09e9246f6d0a83b3e81ec25e146428ee4f3a8.png

The SHAP values are defined for all classes. In case of a binary classification, the two SHAP plots show oppsing values. Hence, showing one is enough. For completeness, here we see the two SHAP plots. The first is for predicing the class 0 (blue) and the second for class 1 (orange).

visualize_shap(classifier, feature_names, target_class=0)

../_images/d82220608165b5eae634a99b5789416a63ceb9417a603265cb3a87566941e8a7.png

visualize_shap(classifier, feature_names, target_class=1)

../_images/c93b16507a444494852c208817ed5bc64cd816460da91660b4420e8362c829fa.png

Exercise#

Interpret the features in this SHAP plot.

feature_names = ["original", "gaussian(1)", "laplace", "gaussian_laplace(1)"]

feature_stack = generate_feature_stack(image, feature_names)
visualize_image_list(feature_stack, feature_names)

X, y = format_data(feature_stack, annotation)

classifier = RandomForestClassifier(max_depth=2, random_state=0)
classifier.fit(X, y)

visualize_shap(classifier, feature_names, target_class=0)

../_images/1fe44acee86fc909f47f33b334bfd097df1418ffa4375368c3fc28111bbce30e.png

../_images/5e5ad345c3c6773e708d42147a349291351003ad3b712455453e411a69fda373.png

Exercise#

Execute the procedure demonstrated above to segment the edges of the objects in the image.

Pixel classification explained with SHAP

Contents

Pixel classification explained with SHAP#

Generating a feature stack#

Formatting data#

Training the random forest classifier#

Predicting pixel classes#

SHAP#

Beware of correlation#

Exercise#

Exercise#