SOTAVerified

Image Manipulation

Image Manipulation is the process of altering or transforming an existing image to achieve a desired effect or to modify its content. This can involve various techniques and tools to enhance, modify, or create images based on specific requirements.

Papers

Showing 150 of 427 papers

TitleStatusHype
Beyond Fully Supervised Pixel Annotations: Scribble-Driven Weakly-Supervised Framework for Image Manipulation Localization0
Towards Reliable Identification of Diffusion-based Image Manipulations0
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and GenerationCode4
Weakly-supervised Localization of Manipulated Image Regions Using Multi-resolution Learned Features0
RBench-V: A Primary Assessment for Visual Reasoning Models with Multi-modal Outputs0
My Face Is Mine, Not Yours: Facial Protection Against Diffusion Model Face Swapping0
Visual Agentic Reinforcement Fine-TuningCode7
Emerging Properties in Unified Multimodal PretrainingCode9
Anti-Inpainting: A Proactive Defense against Malicious Diffusion-based Inpainters under Unknown Conditions0
ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and LocalizationCode2
3D-Fixup: Advancing Photo Editing with 3D Priors0
IntrinsicEdit: Precise generative image manipulation in intrinsic space0
MDE-Edit: Masked Dual-Editing for Multi-Object Image Editing via Diffusion Models0
Visual Text Processing: A Comprehensive Review and Unified EvaluationCode2
Step1X-Edit: A Practical Framework for General Image EditingCode7
Can GPT tell us why these images are synthesized? Empowering Multimodal Large Language Models for Forensics0
AnimeDL-2M: Million-Scale AI-Generated Anime Image Detection and Localization in Diffusion Era0
Towards Explainable Partial-AIGC Image Quality Assessment0
Pro-DG: Procedural Diffusion Guidance for Architectural Facade Generation0
Patronus: Bringing Transparency to Diffusion Models with PrototypesCode0
Context-Aware Weakly Supervised Image Manipulation Localization with SAM Refinement0
LEGION: Learning to Ground and Explain for Synthetic Image Detection0
Advancing 3D Gaussian Splatting Editing with Complementary and Consensus Information0
Deepfake Detection of Face Images based on a Convolutional Neural Network0
EmoAgent: A Multi-Agent Framework for Diverse Affective Image Manipulation0
PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models0
IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning0
LLM-guided Instance-level Image Manipulation with Diffusion U-Net Cross-Attention MapsCode0
Joint Learning of Depth and Appearance for Portrait Image Animation0
Data-Driven Fairness Generalization for Deepfake Detection0
Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding TransformerCode2
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation LocalizationCode2
Prompt Augmentation for Self-supervised Text-guided Image Manipulation0
Instruction-based Image Manipulation by Watching How Things Move0
Detecting Facial Image Manipulations with Multi-Layer CNN Models0
EditScout: Locating Forged Regions from Diffusion-based Edited Images with Multimodal LLM0
Unleashing In-context Learning of Autoregressive Models for Few-shot Image Manipulation0
ForgerySleuth: Empowering Multimodal Large Language Models for Image Manipulation DetectionCode1
Omni-IML: Towards Unified Image Manipulation Localization0
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and ManipulationCode2
Visual question answering based evaluation metrics for text-to-image generation0
HRGR: Enhancing Image Manipulation Detection via Hierarchical Region-aware Graph Reasoning0
Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative FrameworkCode3
Training Free Guided Flow Matching with Optimal Control0
SLIC: Secure Learned Image Codec through Compressed Domain Watermarking to Defend Image Manipulation0
PUMA: Empowering Unified MLLM with Multi-granular Visual GenerationCode2
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlCode2
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image EditingCode1
AIM 2024 Challenge on Video Super-Resolution Quality Assessment: Methods and Results0
Exploring Strengths and Weaknesses of Super-Resolution Attack in Deepfake Detection0
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pix2PixHD-SIALPIPS (S1)0.44Unverified
2TPSLPIPS (S1)0.12Unverified