SimEdit: Conditioning Matters — Stabilizing Inversion and Attention in Diffusion Image Editing

Official implementation of our ECML PKDD 2026 paper "Conditioning Matters: Stabilizing Inversion and Attention in Diffusion Image Editing".

SimEdit is a training-free framework for inversion-based diffusion image editing. It is built on the observation that the precision and structural alignment of textual conditioning strongly affect both inversion stability and cross-branch attention consistency. SimEdit has two complementary components:

Conditioning Refinement (CR): expands the original source/target prompts with additional image-grounded details while preserving a shared semantic structure, stabilizing the diffusion velocity field and improving background preservation.
Token-wise Cross-Branch Attention Control (TCAC): uses an LCS-based token alignment to separate structure-preserving and edit-driving tokens, and modulates their attention contributions asymmetrically during editing.

Installation

# Python 3.10 recommended
pip install -r requires.txt

The implementation is based on FLUX.1-dev (via diffusers). A GPU with sufficient memory is required (we use NVIDIA A800; FLUX.1-dev needs about 48GB for 512x512 images).

Quick Start

The full pipeline is in SimEdit.ipynb. Open it and run the cells in order. It will:

(optionally) generate refined source/target prompts via an LLM,
reconstruct the source image from inversion,
produce the final edited result,
save all intermediate outputs to folders.

Conditioning Refinement (API key)

Conditioning refinement calls a vision-language model through the OpenRouter API. Before running the refinement cell, set your own key:

os.environ["OPENROUTER_API_KEY"] = "your-key-here"

This step is optional: you can also skip refinement and run editing with the prompts provided in Mapping_file_for_PIEBench.json, or supply your own refined prompts. The system prompt we use for refinement is in system_prompt.txt.

Repository Contents

File	Description
`SimEdit.ipynb`	Complete SimEdit pipeline (conditioning refinement + token-wise cross-branch attention control), with reconstruction and editing outputs.
`system_prompt.txt`	System prompt used for conditioning refinement; expands the original prompts with additional image-grounded details.
`Mapping_file_for_PIEBench.json`	Source/target prompt pairs used in our final experiments, generated with our system prompt and Gemini-2.5-Pro.
`CLIP*_mapping.json`	Source/target prompt pairs used to compute the CLIPSim* metric reported in the paper.
`calc_for_L_and_directional_deviation.py`	Estimation of the empirical Lipschitz constant `L` and the directional deviation used in the motivation analysis.
`src/model/`	Model code: diffuser pipeline, attention manipulation, and CLIP utilities.
`src/util/`	Utilities: token alignment (LCS), prompt running, metrics, and attention visualization.
`example.jpg`	Example input image used in the notebook.
`requires.txt`	Required Python packages.

Citation

If you find this work useful, please consider citing our paper:

@inproceedings{zhan2026simedit,
  title     = {Conditioning Matters: Stabilizing Inversion and Attention in Diffusion Image Editing},
  author    = {Zhan, Zheyuan and Li, Hongchen and Wang, Can and Ma, Yinfei and Huang, Mingzhen and Bai, Ruoshi and Chen, Jiawei and Lyu, Siwei and Chen, Defang},
  booktitle = {ECML PKDD},
  year      = {2026}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SimEdit: Conditioning Matters — Stabilizing Inversion and Attention in Diffusion Image Editing

Installation

Quick Start

Conditioning Refinement (API key)

Repository Contents

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
.gitignore		.gitignore
CLIP*_mapping.json		CLIP*_mapping.json
LICENSE		LICENSE
Mapping_file_for_PIEBench.json		Mapping_file_for_PIEBench.json
README.md		README.md
SimEdit.ipynb		SimEdit.ipynb
calc_for_L_and_directional_deviation.py		calc_for_L_and_directional_deviation.py
example.jpg		example.jpg
requires.txt		requires.txt
supplementary.pdf		supplementary.pdf
system_prompt.txt		system_prompt.txt

Folders and files

Latest commit

History

Repository files navigation

SimEdit: Conditioning Matters — Stabilizing Inversion and Attention in Diffusion Image Editing

Installation

Quick Start

Conditioning Refinement (API key)

Repository Contents

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages