SOTAVerified

Image Manipulation

Image Manipulation is the process of altering or transforming an existing image to achieve a desired effect or to modify its content. This can involve various techniques and tools to enhance, modify, or create images based on specific requirements.

Papers

Showing 150 of 427 papers

TitleStatusHype
Emerging Properties in Unified Multimodal PretrainingCode9
Visual Agentic Reinforcement Fine-TuningCode7
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image ManifoldCode7
Step1X-Edit: A Practical Framework for General Image EditingCode7
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsCode4
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and GenerationCode4
Differentiable Data Augmentation with KorniaCode3
ReNoise: Real Image Inversion Through Iterative NoisingCode3
MaskGIT: Masked Generative Image TransformerCode3
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and LocalizationCode3
Pivotal Tuning for Latent-based Editing of Real ImagesCode3
Inversion-Free Image Editing with Language-Guided Diffusion ModelsCode3
Paint by Example: Exemplar-based Image Editing with Diffusion ModelsCode3
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & LocalizationCode3
Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative FrameworkCode3
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel MethodsCode3
Deep PCB To COCO ConvertorCode2
Visual Text Processing: A Comprehensive Review and Unified EvaluationCode2
ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and LocalizationCode2
DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image ManipulationCode2
StyleCLIP: Text-Driven Manipulation of StyleGAN ImageryCode2
DeltaEdit: Exploring Text-free Training for Text-Driven Image ManipulationCode2
StyleGAN2 Distillation for Feed-forward Image ManipulationCode2
PUMA: Empowering Unified MLLM with Multi-granular Visual GenerationCode2
Open-World Entity SegmentationCode2
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation LocalizationCode2
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlCode2
Inversion-Free Image Editing with Natural LanguageCode2
Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding TransformerCode2
IML-ViT: Benchmarking Image Manipulation Localization by Vision TransformerCode2
Closed-Form Factorization of Latent Semantics in GANsCode2
Diffusion Models already have a Semantic Latent SpaceCode2
ClickDiffusion: Harnessing LLMs for Interactive Precise Image EditingCode2
Interpreting the Latent Space of GANs for Semantic Face EditingCode2
GAN Inversion: A SurveyCode2
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and ManipulationCode2
MMFusion: Combining Image Forensic Filters for Visual Manipulation Detection and LocalizationCode1
A Disentangling Invertible Interpretation Network for Explaining Latent RepresentationsCode1
FacialGAN: Style Transfer and Attribute Manipulation on Synthetic FacesCode1
AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and ResultsCode1
A Style-aware Discriminator for Controllable Image TranslationCode1
Exploiting Deep Generative Prior for Versatile Image Restoration and ManipulationCode1
Fake face detection via adaptive manipulation traces extraction networkCode1
EdiBERT, a generative model for image editingCode1
Enjoy Your Editing: Controllable GANs for Image Editing via Latent Space NavigationCode1
Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?Code1
Drop the GAN: In Defense of Patches Nearest Neighbors as Single Image Generative ModelsCode1
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pix2PixHD-SIALPIPS (S1)0.44Unverified
2TPSLPIPS (S1)0.12Unverified