Introduction

March 3, 2026 · View on GitHub

image

Yinan Chen 1★ . Jiangning Zhang 1,2★ . Yali Bi 3 . Xiaobin Hu 2 . Teng Hu 4 .
Zhucun Xue 1 . Ran Yi 4 . Yong Liu 1† . Ying Tai 5

1College of Control Science and Engineering, Zhejiang University     2YouTu Lab, Tencent     3College of Computer and Information Science, Southwest University
4Department of Computer Science & Engineering, Shanghai Jiao Tong University     5School of Intelligence Science and Technology, Nanjing University

arXiv PDF

Introduction

This repository is a comprehensive collection of resources for Image Inversion, If you find any work missing or have any suggestions, feel free to pull requests or contact us. We will promptly add the missing papers to this repository.

✨Highlight!!!

1. Comprehensive Coverage of Image Inversion Techniques: Includes methods ranging from GANs and diffusion models to emerging frameworks like DiT and rectified flow.

2. Mainstream Applications: Supports applications such as object editing, attribute editing, style transfer, image restoration, and personalized generation.

3. Other Domain Generative Model Inversion: Extends to other domains, showcasing the versatility of generative model inversion techniques.

✨Survey pipeline

Summary of Contents

Image Inversion Methods

Diffusion Model

Training-free

YearVenueTaskPaper TitleCode
2025NIPSObject & Attribute EditingFreeInv: Free Lunch for Improving DDIM Inversioncode
2025ICCVObject & Attribute EditingEEdit: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editingcode
2025CVPRStyle TransferStyleSSP: Sampling StartPoint Enhancement for Training-free Diffusion-based Method for Style Transfercode
2025ICLRObject & Attribute EditingSemantic Image Inversion and Editing using Rectified Stochastic Differential Equationscode
2025ICLRImage RestorationHD-Painter: High-Resolution and Prompt-Faithful Text-Guided Image Inpainting with Diffusion ModelsCode
2025ICLRObject & Attribute EditingGNRI: Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion ModelsCode
2025ICMLObject & Attribute EditingEasyInv: Toward Fast and Better DDIM Inversioncode
2025ICMLObject & Attribute EditingFireFlow: Fast Inversion of Rectified Flow for Image Semantic Editing[code]
2025AAAISpatial-Aware EditingDesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image EditingCode
2024CVPRObject & Attribute EditingAn Edit Friendly DDPM Noise Space: Inversion and ManipulationsCode
2024NNObject & Attribute EditingPFB-Diff: Progressive Feature Blending Diffusion for Text-driven Image EditingCode
2024NIPSObject & Attribute EditingEnergy-Based Cross Attention for Bayesian Context Update in Text-to-Image Diffusion ModelsCode
2024WACVObject & Attribute EditingProxEdit: Improving Tuning-Free Real Image Editing with Proximal GuidanceCode
2024ECCVObject & Attribute EditingSource Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Modelscode
2024ECCVObject & Attribute EditingReNoise: Real Image Inversion Through Iterative Noising-
2024ECCVObject & Attribute EditingExact Diffusion Inversion via Bi-directional Integration ApproximationCode
2024ICLRObject & Attribute EditingMagicremover: Tuning-free Text-guided Image inpainting with Diffusion Models-
2024ICLRObject & Attribute EditingPnP Inversion: Boosting Diffusion-based Editing with 3 Lines of CodeCode
2024ICLRObject & Attribute EditingObject-aware Inversion and Reassembly for Image EditingCode
2024CVPRObject & Attribute EditingLEDITS++: Limitless Image Editing using Text-to-Image ModelsCode
2024CVPRObject & Attribute EditingContrastive Denoising Score for Text-guided Latent Diffusion Image EditingCode
2024CVPRObject & Attribute EditingFocus on Your Instruction: Fine-grained and Multi-instruction Image Editing by Attention ModulationCode
2024ICLRObject & Attribute EditingNoise Map Guidance: Inversion with Spatial Context for Real Image EditingCode
2024CVPRObject & Attribute EditingTowards Understanding Cross and Self-Attention in Stable Diffusion for Text-Guided Image EditingCode
2024ArxivObject & Attribute EditingGround-A-Score: Scaling Up the Score Distillation for Multi-Attribute EditingCode
2024ACM MMObject & Attribute EditingLoMOE: Localized Multi-Object Editing via Multi-DiffusionCode
2024ICLRSpatial-Aware EditingDragonDiffusion: Enabling Drag-style Manipulation on Diffusion ModelsCode
2024CVPRStyle TransferZ∗: Zero-shot Style Transfer via Attention RearrangementCode
2024CVPRStyle TransferStyle Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style TransferCode
2024CVPRControllable Image GenerationFreeControl: Training-Free Spatial Control of Any Text-to-Image Diffusion Model with Any ConditionCode
2024FGObject & Attribute EditingDiscovering Interpretable Directions in the Semantic Latent Space of Diffusion ModelsCode
2024NIPSImage RestorationBlind Image Restoration via Fast Diffusion InversionCode
2024SIGGRAPHImage FusionCross-Image Attention for Zero-Shot Appearance TransferCode
2024ECCVImage FusionTuning-Free Image Customization with Image and Text GuidanceCode
2024CVPRImage GenerationSelf-Discovering Interpretable Diffusion Latent Directions for Responsible Text-to-Image Generation.Code
2024ACM MMPersonalized GenerationPick-and-Draw: Training-free Semantic Guidance for Text-to-Image Personalization-
2024CVPRPersonalized GenerationDreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image PersonalizationCode
2023ICLRObject & Attribute EditingPrompt-to-Prompt Image Editing with Cross Attention ControlCode
2023ICLRObject & Attribute EditingDiffEdit: Diffusion-based semantic image editing with mask guidance-
2023CVPRObject & Attribute EditingNull-text Inversion for Editing Real Images using Guided Diffusion ModelsCode
2023CVPRObject & Attribute EditingEDICT: Exact Diffusion Inversion via Coupled TransformationsCode
2023CVPRObject & Attribute EditingPlug-and-Play Diffusion Features for Text-Driven Image-to-Image TranslationCode
2023CVPRObject & Attribute EditingUncovering the Disentanglement Capability in Text-to-Image Diffusion ModelsCode
2023ArxivObject & Attribute EditingNegative-prompt Inversion: Fast Image Inversion for Editing with Text-guided Diffusion Models-
2023ICCVObject & Attribute EditingPrompt Tuning Inversion for Text-Driven Image Editing Using Diffusion ModelsCode
2023ArxivObject & Attribute EditingLEDITS: Real Image Editing with DDPM Inversion and Semantic GuidanceCode
2023ICCVObject & Attribute EditingEffective Real Image Editing with Accelerated Iterative Diffusion Inversion-
2023NIPSObject & Attribute EditingDynamic Prompt Learning: Addressing Cross-Attention Leakage for Text-Based Image EditingCode
2023ICCVAttribute EditingLocalizing Object-level Shape Variations with Text-to-Image Diffusion ModelsCode
2023ICCVAttribute EditingMasaCtrl: Tuning-Free Mutual Self-Attention Control for Consistent Image Synthesis and EditingCode
2023PRCVAttribute EditingKV Inversion: KV Embeddings Learning for Text-Conditioned Real Image Action Editing-
2023AAAIAttribute EditingTuning-Free Inversion-Enhanced Control for Consistent Image Editing-
2023TOGImage RestorationBlended Latent DiffusionCode
2023ArxivImage RestorationDifferential Diffusion: Giving Each Pixel Its StrengthCode
2023ICCVObject & Attribute EditingTF-ICON: Diffusion-Based Training-Free Cross-Domain Image CompositionCode
2023NIPSSpatial-Aware EditingDiffusion Self-Guidance for Controllable Image Generation-
2023ICCVObject & Attribute EditingUnifying Diffusion Models' Latent Space, with Applications to CycleDiffusion and GuidanceCode
2023ICLRPersonalized GenerationAn Image is Worth One Word: Personalizing Text-to-Image Generation using Textual InversionCode
2023ArxivPersonalized GenerationHighly Personalized Text Embedding for Image Manipulation by Stable DiffusionCode
2023ArixvPersonalized GenerationP+: Extended Textual Conditioning in Text-to-Image GenerationCode
2022CVPRImage RestorationBlended Diffusion for Text-driven Editing of Natural ImagesCode
2022NIPSImage RestorationHigh-Resolution Image Editing via Multi-Stage Blended DiffusionCode

Fine-tune

YearVenueTaskPaper TitleCode
2025WACVPersonalized GenerationA Data Perspective on Enhanced Identity Preservation for Diffusion Personalization-
2024ICLRSpatial-Aware EditingDragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image EditingCode
2024ICLRPersonalized GenerationDisenBooth: Identity-Preserving Disentangled Tuning for Subject-Driven Text-to-Image GenerationCode
2024NIPSPersonalized GenerationDirect Consistency Optimization for Robust Customization of Text-to-Image Diffusion ModelsCode
2024CVPRPersonalized GenerationFaceChain-SuDe: Building Derived Class to Inherit Category Attributes for One-shot Subject-Driven GenerationCode
2023CVPRObject & Attribute EditingImagic: Text-Based Real Image Editing with Diffusion Models-
2023TOGObject & Attribute EditingUniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single ImageCode
2023CVPRObject & Attribute EditingSINE: SINgle Image Editing with Text-to-Image Diffusion ModelsCode
2023ArxivObject & Attribute EditingForgedit: Text Guided Image Editing via Learning and ForgettingCode
2023NIPSImage FusionPhotoswap: Personalized Subject Swapping in ImagesCode
2023TMLRImage FusionDreamEdit: Subject-driven Image EditingCode
2023CVPRPersonalized GenerationDreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven GenerationCode
2023CVPRPersonalized GenerationMulti-Concept Customization of Text-to-Image DiffusionCode
2023ICMLPersonalized GenerationCones: Concept Neurons in Diffusion Models for Customized Generation-
2023ICCVPersonalized GenerationSVDiff: Compact Parameter Space for Diffusion Fine-TuningCode
2023CVPRPersonalized GenerationCustom-Edit: Text-Guided Image Editing with Customized Diffusion Models-

Extra Trainable Module

YearVenueTaskPaper TitleCode
2025CVPRImage RestorationArbitrary-steps Image Super-resolution via Diffusion InversionCode
2025CVMObject & Attribute EditingStyleDiffusion: Prompt-Embedding Inversion for Text-Based EditingCode
2024CVPRObject & Attribute EditingZONE: Zero-Shot Instruction-Guided Local EditingCode
2024CVPRObject & Attribute EditingDoubly Abductive Counterfactual Inference for Text-based Image EditingCode
2024ECCVObject & Attribute EditingTurboEdit: Instant text-based image editing-
2024CVPRSpatial-Aware EditingDiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image EditingCode
2024AAAIPersonalized GenerationDecoupled Textual Embeddings for Customized Image GenerationCode
2024CVPRImage Concept DecouplingCLiC: Concept Learning in ContextCode
2023ICLRObject & Attribute EditingDiffusion Models Already Have A Semantic Latent SpaceCode
2023ArxivObject & Attribute EditingRegion-Aware Diffusion for Zero-shot Text-driven Image EditingCode
2023ICCVObject & Attribute EditingDelta Denoising ScoreCode
2023CVPRStyle TransferInversion-Based Style Transfer With Diffusion ModelsCode
2023SIGGRAPH AsiaPersonalized GenerationA Neural Space-Time Representation for Text-to-Image PersonalizationCode
2023ArxivPersonalized GenerationViCo: Plug-and-play Visual Condition for Personalized Text-to-image GenerationCode
2023SIGGRAPH AsiaImage Concept DecouplingBreak-A-Scene: Extracting Multiple Concepts from a Single ImageCode
2022ArxivPersonalized GenerationDreamArtist: Towards Controllable One-Shot Text-to-Image Generation via Positive-Negative Prompt-TuningCode

GANs

Hybrid Table

YearVenueTaskPaper TitleCode
2024AAAIAttribute EditingSpatial-Contextual Discrepancy Information Compensation for GAN InversionCode
2024IJCVImage FusionOne-Shot Neural Face Reenactment via Finding Directions in GAN’s Latent Space-
2024CVPRAttribute EditingThe Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image EditingCode
2023WACVAttribute EditingDyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing.-
2023AAAIAttribute EditingReGANIE: Rectifying GAN Inversion Errors for Accurate Real Image Editing-
2022ECCVStyle TransferJoJoGAN: One Shot Face StylizationCode
2022CVPRAttribute EditingSpatially-Adaptive Multilayer Selection for GAN Inversion and EditingCode
2022ECCVAttribute EditingEditing Out-of-Domain GAN Inversion via Differential Activations-
2022NIPSObject & Attribute EditingGeneralized One-shot Domain Adaptation of Generative Adversarial NetworksCode
2016ECCVAttribute EditingGenerative Visual Manipulation on the Natural Image ManifoldCode

Latent Optimization Table

YearVenueTaskPaper TitleCode
2024AAAIObject & Attribute EditingHyperEditor: Achieving Both Authenticity and Cross-Domain Capability in Image Editing via HypernetworksCode
2023CVPRAttribute EditingBalancing Reconstruction and Editing Quality of GAN Inversion for Real Image Editing with StyleGAN Prior Latent Space-
2022TOGAttribute EditingPivotal Tuning for Latent-based Editing of Real ImagesCode
2022ECCVAttribute EditingChunkmogrify: Real image inversion via SegmentsCode
2022CVPRAttribute EditingOverparameterization Improves StyleGAN Inversion-
2019ICCVAttribute EditingImage2StyleGAN: How to Embed Images Into the StyleGAN Latent Space?Code
2016NIPSImage GenerationInverting the generator of a generative adversarial networkCode

Encoder-based Table

YearVenueTaskPaper TitleCode
2024AAAIAttribute EditingGradual Residuals Alignment: A Dual-Stream Framework for GAN Inversion and Image Attribute Editing-
2023CVPRImage FusionFine-Grained Face Swapping via Regional GAN InversionCode
2023CVPRAttribute EditingDelving StyleGAN Inversion for Image Editing: A Foundation Latent Space ViewpointCode
2023CVPRAttribute EditingStyleRes: Transforming the Residuals for Real Image Editing with StyleGANCode
2023ICCVAttribute EditingDiverse Inpainting and Editing with GAN Inversion-
2023TOGObject & Attribute EditingCLIP-Guided StyleGAN Inversion for Text-Driven Real Image EditingCode
2022CVPRAttribute EditingHyperInverter: Improving StyleGAN Inversion via HypernetworkCode
2022CVPRAttribute EditingStyle Transformer for Image Inversion and EditingCode
2022ECCVAttribute EditingHigh-fidelity GAN Inversion with Padding SpaceCode
2022ACM MMAttribute EditingEverything is There in Latent Space: Attribute Editing and Attribute Style Manipulation by StyleGAN Latent Space Exploration-
2022NIPSImage RestorationSemantic uncertainty intervals for disentangled latent spacesCode
2022ECCVAttribute EditingIntereStyle: Encoding an Interest Region for Robust StyleGAN Inversion-
2021SIGGRAPHAttribute EditingDesigning an Encoder for StyleGAN Image ManipulationCode
2016ArxivAttribute EditingInvertible conditional GANs for image editingCode

Promising Technologies

DiT

YearVenueTaskPaper TitleCode
2025CVPRObject & Attribute EditingStable Flow: Vital Layers for Training-Free Image EditingCode
2024AAAIObject & Attribute EditingDiT4Edit: Diffusion Transformer for Image EditingCode

Rectified Flow

YearVenueTaskPaper TitleCode
2025NIPSObject & Attribute EditingDNAEdit: Direct Noise Alignment for Text-Guided Rectified Flow Editingcode
2025ICCVObject EditingKV-Edit: Training-Free Image Editing for Precise Background PreservationCode
2025ICLRObject & Attribute EditingSemantic Image Inversion and Editing using
Stochastic Rectified Differential Equations
Code
2025ICLRObject & Attribute EditingLightning-Fast Image Inversion and Editing for Text-to-Image Diffusion ModelsCode
2025ICMLObject & Attribute EditingTaming Rectified Flow for Inversion and EditingCode

Related Research Domains

Video

YearVenueCategoryTaskPaperCode
2025CVPRDMVideo EditingVideoDirector: Precise Video Editing via Text-to-Video Modelscode
2025NIPSDMDynamic View SynthesisDynamic View Synthesis as an Inverse Problem-
2025ICLRDMVideo EditingVideoGrain: Modulating Space-Time Attention for Multi-Grained Video EditingCode
2025ICMLDMVideo & Image EditingEditable Noise Map Inversion: Encoding Target-image into Noise For High-Fidelity Image Manipulation-
2024CVPRGANVideo EditingIn-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face EditingCode
2024CVPRDMVideo Editing Video-P2P: Video Editing with Cross-attention ControlCode
2024CVPRDMVideo EditingSpace-Time Diffusion Features for Zero-Shot Text-Driven Motion TransferCode
2024CVPRDMVideo EditingA Video is Worth 256 Bases: Spatial-Temporal Expectation-Maximization Inversion for Zero-Shot Video EditingCode
2024ArxivDMVideo EditingMotion Inversion for Video CustomizationCode
2024ECCVDMVideo EditingDreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large MotionCode
2024ECCVDMVideo EditingVideoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion InversionCode
2023CVPRGANVideo EditingVIVE3D: Viewpoint-Independent Video Editing using 3D-Aware GANsCode
2023ICCVDMVideo EditingFateZero: Fusing Attentions for Zero-shot Text-based Video EditingCode
2023ICCVGANVideo EditingRIGID: Recurrent GAN Inversion and Editing of Real Face VideosCode
2023ICCVGANVideo EditingStyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video GenerationCode
2023ICCVDMVideo EditingTune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video GenerationCode
2023ArxivDMVideo EditingDreamix: Video Diffusion Models are General Video Editors-
2023ICCVDMVideo EditingPix2Video: Video Editing using Image DiffusionCode
2023ICCVDMVideo EditingText2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video GeneratorsCode
2022ECCVGANVideo EditingTemporally Consistent Semantic Video Editing-

3D

YearVenueCategoryTaskPaperCode
2025CVPRRF3D Object EditingSplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesiscode
2024CVPRGAN3D Face ReconstructionIn-N-Out: Faithful 3D GAN Inversion with Volumetric Decomposition for Face EditingCode
2024CVPRGAN3D Face ReconstructionDiffusion-driven GAN Inversion for Multi-Modal Face Image Generation-
2024CVPRDM3D Object EditingSHAP-EDITOR: Instruction-Guided Latent 3D Editing in SecondsCode
2024ECCVDM3D Scene EditingLatentEditor: Text Driven Local Editing of 3D ScenesCode
2024ECCVGAN + DM3D Face ReconstructionReal-Time 3D-Aware Portrait Editing from a Single ImageCode
2023WACVGAN3D Face Reconstruction3D GAN Inversion with Pose OptimizationCode
2023CVPRGAN3D Face ReconstructionHigh-Fidelity 3D GAN Inversion by Pseudo-Multi-View OptimizationCode
2023CVPRGAN3D Face Reconstruction3D GAN Inversion With Facial Symmetry PriorCode
2023CVPRGAN3D Face ReconstructionSelf-Supervised Geometry-Aware Encoder for Style-Based 3D GAN InversionCode
2023ICCVDM3D Scene EditingInstruct-NeRF2NeRF: Editing 3D Scenes with InstructionsCode

Audio

YearVenueCategoryTaskPaperCode
2024IJCAIDMAudio EditingMusicMagus: Zero-Shot Text-to-Music Editing via Diffusion ModelsCode
2024ICMLDMAudio EditingZero-Shot Unsupervised and Text-Based Audio Editing Using DDPM InversionCode
2024ICMLDMAudio EditingPrompt-guided Precise Audio Editing with Diffusion Models-
2024ArxivDMAudio EditingMEDIC: Zero-shot Music Editing with Disentangled Inversion Control-
2024ArxivDMAudio EditingAudioEditor: A Training-Free Diffusion-Based Audio Editing FrameworkCode
2023ICASSPDMAudio RestorationSolving Audio Inverse Problems with a Diffusion ModelCode

Cite The Survey

If you find our survey and repository useful for your research projects, please consider citing our paper:

@article{chen2025imageinversion,
      title={Image Inversion: A Survey from GANs to Diffusion and Beyond}, 
      author={Yinan Chen and Jiangning Zhang and Yali Bi and Xiaobin Hu and Teng Hu and Zhucun Xue and Ran Yi and Yong Liu and Ying Tai},
      year={2025},
      journal={CoRR},
      url={https://arxiv.org/abs/2502.11974}, 
}

Contact

yinanchencs@outlook.com
186368@zju.edu.cn