StepX โ€” Attention Explorer

Visualize how text and image attention evolve across Stable Diffusion denoising steps

๐Ÿ”— DAAM
Cross-attention maps โ€” see which image regions each prompt word attends to
๐Ÿ” DAAM-I2I
Self-attention maps โ€” see how pixels attend to each other during generation
๐Ÿ”ญ TITAN
Object discovery โ€” detect and annotate objects with bounding boxes

Select a different model only if you want to switch from the default SD 1.5. The model loads automatically when you click Generate.

Model

Text-to-Image Cross-Attention Maps

Visualize how text tokens attend to image regions during generation.

๐Ÿ“ Input

10 100

๐Ÿ–ผ๏ธ Generated Image

๐Ÿ“ Depth Map (ZoeDepth)

Metric depth from the generated image. Included in Export All ZIP.


๐ŸŽ›๏ธ Attention Controls

Focus Word
0 50
0.1 1

๐ŸŽฌ GIF Animation

0.1 2

๐Ÿ“ฆ Export All

๐Ÿ”ฅ Attention Map

๐Ÿ’ก Note: Use the 'Download GIF' button to download the generated GIF. Use Export All to get one ZIP with every step ร— token image + CSV and per-token GIFs.