Teaching Humans Subtle Differences with DIFFusion

*Equal contribution
Columbia, Berkeley, Caltech
In Submission

DIFFusion

Human expertise depends on the ability to recognize subtle visual differences, such as distinguishing diseases, species, or celestial phenomena. We propose a new method to teach novices how to differentiate between nuanced categories in specialized domains. Our method uses generative models to visualize the minimal change in features to transition between classes, i.e., counterfactuals, and performs well even in domains where data is sparse, examples are unpaired, and category boundaries are not easily explained by text. By manipulating the conditioning space of diffusion models, our proposed method DIFFusion disentangles category structure from instance identity, enabling high-fidelity synthesis even in challenging domains. Experiments across six domains show accurate transitions even with limited and unpaired examples across categories. User studies confirm that our generated counterfactuals outperform unpaired examples in teaching perceptual expertise, showing the potential of generative models for specialized visual learning.
Black Hole: We learn that the SANE simulation tends to have more uniform wisps (yellow). The MAD simulation tends to have a more prominent photon ring (blue).
Teaser Image 1 Teaser Gif 1 Teaser Image 2 Teaser Gif 2
Butterfly: We learn that Viceroy has a cross-sectional line across its' wings, whereas Monarchs don't (yellow). Monarchs have a larger head with spots on it (magenta). Finally, Viceroy's splots along the edge of the wings can be described as more 'scaley', or 'gothic' (blue).
Teaser Image 3 Teaser Gif 3 Teaser Image 4 Teaser Gif 4
Retina: We learn that Retinas with drusen have bumps along the horizontal cross-section (yellow).
Teaser Image 5 Teaser Gif 5 Teaser Image 6 Teaser Gif 6

Method

Our approach leverages diffusion models to create smooth transitions between visual categories, helping novices learn subtle discriminative features. By carefully manipulating the conditioning space, we maintain instance identity while traversing category boundaries.
Method Diagram

Interpolation Results

DIFFusion generates smooth transitions that highlight key discriminative features between categories.
Interpolation Results

User Study

Our user studies demonstrate that DIFFusion significantly improves novices' ability to distinguish subtle visual differences.
User Study Results User Study Results Top User Study Results Bottom
Acknowledgements

We thank our user study participants and collaborators for their insights. This work is supported by the Carver Mead New Adventures Fund, a Pritzker Award, an AI4Science Amazon Discovery Grant, the NSF AI Institute for Artificial and Natural Intelligence (ARNI), NSF CAREER #2046910, NSF RETTL #2202578, DARPA ECOLE, and a Google Fellowship. Views are ours, not necessarily our sponsors'.