SOTAVerified

Image Manipulation

Image Manipulation is the process of altering or transforming an existing image to achieve a desired effect or to modify its content. This can involve various techniques and tools to enhance, modify, or create images based on specific requirements.

Papers

Showing 150 of 427 papers

TitleStatusHype
Emerging Properties in Unified Multimodal PretrainingCode9
Visual Agentic Reinforcement Fine-TuningCode7
Step1X-Edit: A Practical Framework for General Image EditingCode7
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image ManifoldCode7
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and GenerationCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative FrameworkCode3
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and LocalizationCode3
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & LocalizationCode3
ReNoise: Real Image Inversion Through Iterative NoisingCode3
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel MethodsCode3
Inversion-Free Image Editing with Language-Guided Diffusion ModelsCode3
Paint by Example: Exemplar-based Image Editing with Diffusion ModelsCode3
MaskGIT: Masked Generative Image TransformerCode3
Pivotal Tuning for Latent-based Editing of Real ImagesCode3
Differentiable Data Augmentation with KorniaCode3
ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and LocalizationCode2
Visual Text Processing: A Comprehensive Review and Unified EvaluationCode2
Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding TransformerCode2
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation LocalizationCode2
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and ManipulationCode2
PUMA: Empowering Unified MLLM with Multi-granular Visual GenerationCode2
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlCode2
ClickDiffusion: Harnessing LLMs for Interactive Precise Image EditingCode2
DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image ManipulationCode2
Inversion-Free Image Editing with Natural LanguageCode2
IML-ViT: Benchmarking Image Manipulation Localization by Vision TransformerCode2
DeltaEdit: Exploring Text-free Training for Text-Driven Image ManipulationCode2
Diffusion Models already have a Semantic Latent SpaceCode2
Deep PCB To COCO ConvertorCode2
Open-World Entity SegmentationCode2
StyleCLIP: Text-Driven Manipulation of StyleGAN ImageryCode2
GAN Inversion: A SurveyCode2
Closed-Form Factorization of Latent Semantics in GANsCode2
StyleGAN2 Distillation for Feed-forward Image ManipulationCode2
Interpreting the Latent Space of GANs for Semantic Face EditingCode2
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation DetectionCode1
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image EditingCode1
Wavelet-Driven Generalizable Framework for Deepfake Face Forgery DetectionCode1
Click2Mask: Local Editing with Dynamic Mask GenerationCode1
AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and ResultsCode1
DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News DetectionCode1
Harmfully Manipulated Images Matter in Multimodal Misinformation DetectionCode1
TGIF: Text-Guided Inpainting Forgery DatasetCode1
Deformation-Recovery Diffusion Model (DRDM): Instance Deformation for Image Manipulation and SynthesisCode1
Deep Image Composition Meets Image ForgeryCode1
Unveiling Implicit Deceptive Patterns in Multi-Modal Fake News via Neuro-Symbolic ReasoningCode1
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pix2PixHD-SIALPIPS (S1)0.44Unverified
2TPSLPIPS (S1)0.12Unverified