Diffusion Models for Image Regression Counterfactuals: Bridging Sparsity and Quality

Models & Research

The Engineer

28 Mar 2025 · 3 min read

Researchers Trung Duc Ha and Sidney Bender introduce diffusion models for generating high-quality, sparse counterfactuals in image regression tasks, expanding the use of these techniques beyond classification.

In a recent paper titled "Diffusion Counterfactuals for Image Regressors," Trung Duc Ha and Sidney Bender explore the application of diffusion models to generate counterfactual explanations for image regression tasks. This work is significant because while counterfactuals have been widely used in classification, their use in regression has remained relatively underexplored. The authors propose two methods: one based on a Denoising Diffusion Probabilistic Model (DDPM) operating directly in pixel space and another using a Diffusion Autoencoder (DAE) in latent space. Both methods aim to produce realistic, semantic, and smooth counterfactuals that provide interpretable insights into the decision-making process of regression models.

Why It Matters

Counterfactual explanations help us understand why a model made a particular prediction by showing how small changes in input can alter the output. For image regression tasks, this is particularly useful for identifying spurious correlations and ensuring the model's decisions are robust and interpretable. The authors' methods address key challenges such as sparsity (minimal changes needed to affect predictions) and quality (realism of generated images).

Key Technical Details

Pixel Space Method (DDPM):
- Architecture: Uses a DDPM that denoises the image step-by-step, starting from random noise.
- Training: The model is trained to predict the noise added at each step, gradually refining the image.
- Counterfactual Generation: For a given input image and target prediction, the model generates a counterfactual by iteratively applying denoising steps while guiding the output towards the desired regression value.
- Advantages: Produces sparse counterfactuals, meaning fewer changes are needed to achieve the target prediction.
Latent Space Method (DAE):
- Architecture: Combines a Variational Autoencoder (VAE) with a DDPM. The VAE encodes images into a latent space, and the DDPM operates in this space.
- Training: The VAE is trained to encode and decode images, while the DDPM learns to denoise in the latent space.
- Counterfactual Generation: For an input image and target prediction, the model first encodes the image into the latent space. It then generates a counterfactual by applying denoising steps in the latent space and decoding back to the pixel space.
- Advantages: Produces higher quality counterfactuals with larger semantic changes, making it more suitable for significant shifts in predicted values.

Benchmarks and Results

Datasets:
- CelebA-HQ: A high-resolution dataset of celebrity faces.
- Synthetic Dataset: Custom-generated images to test specific scenarios.
Findings:
- Both methods produce realistic and interpretable counterfactuals, but they differ in their sparsity and quality:
  - Pixel Space (DDPM): Generates more sparse counterfactuals, requiring fewer changes to achieve the target prediction. This is useful for understanding small, local perturbations.
  - Latent Space (DAE): Produces higher quality images with larger semantic changes, making it better suited for significant shifts in predicted values. However, this comes at the cost of increased complexity and computational requirements.
Challenges:
- For regression counterfactuals, large semantic changes are often necessary to achieve significant shifts in predicted values. This makes finding sparse counterfactuals more challenging compared to classification tasks.
- The region of the predicted value influences the required feature changes, adding another layer of complexity to the generation process.

Implications

The methods proposed by Ha and Bender open new avenues for understanding and improving image regression models. By providing interpretable insights into model decisions, these counterfactuals can help identify and mitigate spurious correlations, leading to more robust and fair models. The trade-offs between sparsity and quality in the two methods offer practitioners flexibility in choosing the most appropriate approach based on their specific needs.