SOTAVerified

Image Manipulation

Image Manipulation is the process of altering or transforming an existing image to achieve a desired effect or to modify its content. This can involve various techniques and tools to enhance, modify, or create images based on specific requirements.

Papers

Showing 125 of 427 papers

TitleStatusHype
Emerging Properties in Unified Multimodal PretrainingCode9
Visual Agentic Reinforcement Fine-TuningCode7
Step1X-Edit: A Practical Framework for General Image EditingCode7
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image ManifoldCode7
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and GenerationCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative FrameworkCode3
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and LocalizationCode3
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & LocalizationCode3
ReNoise: Real Image Inversion Through Iterative NoisingCode3
Inversion-Free Image Editing with Language-Guided Diffusion ModelsCode3
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel MethodsCode3
Paint by Example: Exemplar-based Image Editing with Diffusion ModelsCode3
MaskGIT: Masked Generative Image TransformerCode3
Pivotal Tuning for Latent-based Editing of Real ImagesCode3
Differentiable Data Augmentation with KorniaCode3
ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and LocalizationCode2
Visual Text Processing: A Comprehensive Review and Unified EvaluationCode2
Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding TransformerCode2
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation LocalizationCode2
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and ManipulationCode2
PUMA: Empowering Unified MLLM with Multi-granular Visual GenerationCode2
Show:102550
← PrevPage 1 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pix2PixHD-SIALPIPS (S1)0.44Unverified
2TPSLPIPS (S1)0.12Unverified