
Share
Researchers Trung Duc Ha and Sidney Bender introduce diffusion models for generating high-quality, sparse counterfactuals in image regression tasks, expanding the use of these techniques beyond classification.
In a recent paper titled "Diffusion Counterfactuals for Image Regressors," Trung Duc Ha and Sidney Bender explore the application of diffusion models to generate counterfactual explanations for image regression tasks. This work is significant because while counterfactuals have been widely used in classification, their use in regression has remained relatively underexplored. The authors propose two methods: one based on a Denoising Diffusion Probabilistic Model (DDPM) operating directly in pixel space and another using a Diffusion Autoencoder (DAE) in latent space. Both methods aim to produce realistic, semantic, and smooth counterfactuals that provide interpretable insights into the decision-making process of regression models.
Counterfactual explanations help us understand why a model made a particular prediction by showing how small changes in input can alter the output. For image regression tasks, this is particularly useful for identifying spurious correlations and ensuring the model's decisions are robust and interpretable. The authors' methods address key challenges such as sparsity (minimal changes needed to affect predictions) and quality (realism of generated images).
Pixel Space Method (DDPM):
Latent Space Method (DAE):

Datasets:
Findings:
Challenges:
The methods proposed by Ha and Bender open new avenues for understanding and improving image regression models. By providing interpretable insights into model decisions, these counterfactuals can help identify and mitigate spurious correlations, leading to more robust and fair models. The trade-offs between sparsity and quality in the two methods offer practitioners flexibility in choosing the most appropriate approach based on their specific needs.
Tags
Original Sources
About the author
Kai built ML infrastructure at a Bay Area startup before developing an obsession with transformer architectures and inference optimisation that eventually pulled him out of product work entirely. A stint at a compute research lab sharpened his instinct for what actually matters in a model release versus what is marketing. He writes from the inside — from the perspective of someone who has debugged the systems he is describing at three in the morning. He is allergic to hype and instinctively drawn to the unglamorous plumbing questions that everyone else skips over.
More from The Engineer →This Week's Edition
28 March 2025
88 articles
Related Articles
Related Articles
More Stories