SOTAVerified

Image Manipulation

Image Manipulation is the process of altering or transforming an existing image to achieve a desired effect or to modify its content. This can involve various techniques and tools to enhance, modify, or create images based on specific requirements.

Papers

Showing 150 of 427 papers

TitleStatusHype
Emerging Properties in Unified Multimodal PretrainingCode9
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image ManifoldCode7
Step1X-Edit: A Practical Framework for General Image EditingCode7
Visual Agentic Reinforcement Fine-TuningCode7
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and GenerationCode4
Guiding Instruction-based Image Editing via Multimodal Large Language ModelsCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
Paint by Example: Exemplar-based Image Editing with Diffusion ModelsCode3
ReNoise: Real Image Inversion Through Iterative NoisingCode3
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel MethodsCode3
Differentiable Data Augmentation with KorniaCode3
Pivotal Tuning for Latent-based Editing of Real ImagesCode3
MaskGIT: Masked Generative Image TransformerCode3
Inversion-Free Image Editing with Language-Guided Diffusion ModelsCode3
Kandinsky 3: Text-to-Image Synthesis for Multifunctional Generative FrameworkCode3
GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and LocalizationCode3
IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & LocalizationCode3
Deep PCB To COCO ConvertorCode2
Visual Text Processing: A Comprehensive Review and Unified EvaluationCode2
ForensicHub: A Unified Benchmark & Codebase for All-Domain Fake Image Detection and LocalizationCode2
Diffusion Models already have a Semantic Latent SpaceCode2
StyleGAN2 Distillation for Feed-forward Image ManipulationCode2
StyleCLIP: Text-Driven Manipulation of StyleGAN ImageryCode2
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance ControlCode2
PUMA: Empowering Unified MLLM with Multi-granular Visual GenerationCode2
Mesoscopic Insights: Orchestrating Multi-scale & Hybrid Architecture for Image Manipulation LocalizationCode2
Open-World Entity SegmentationCode2
Closed-Form Factorization of Latent Semantics in GANsCode2
IML-ViT: Benchmarking Image Manipulation Localization by Vision TransformerCode2
Inversion-Free Image Editing with Natural LanguageCode2
GAN Inversion: A SurveyCode2
DeltaEdit: Exploring Text-free Training for Text-Driven Image ManipulationCode2
HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and ManipulationCode2
ClickDiffusion: Harnessing LLMs for Interactive Precise Image EditingCode2
Can We Get Rid of Handcrafted Feature Extractors? SparseViT: Nonsemantics-Centered, Parameter-Efficient Image Manipulation Localization through Spare-Coding TransformerCode2
Interpreting the Latent Space of GANs for Semantic Face EditingCode2
DreamSampler: Unifying Diffusion Sampling and Score Distillation for Image ManipulationCode2
MMFusion: Combining Image Forensic Filters for Visual Manipulation Detection and LocalizationCode1
A Disentangling Invertible Interpretation Network for Explaining Latent RepresentationsCode1
FacialGAN: Style Transfer and Attribute Manipulation on Synthetic FacesCode1
AIM 2024 Challenge on Compressed Video Quality Assessment: Methods and ResultsCode1
A Style-aware Discriminator for Controllable Image TranslationCode1
Fake face detection via adaptive manipulation traces extraction networkCode1
Diffusion Cocktail: Mixing Domain-Specific Diffusion Models for Diversified Image GenerationsCode1
DiffusionCLIP: Text-Guided Diffusion Models for Robust Image ManipulationCode1
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image EditingCode1
Can We Find Neurons that Cause Unrealistic Images in Deep Generative Networks?Code1
Dense Interspecies Face EmbeddingCode1
Show:102550
← PrevPage 1 of 9Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Pix2PixHD-SIALPIPS (S1)0.44Unverified
2TPSLPIPS (S1)0.12Unverified