Emu3: Next-Token Prediction is All You Need Sep 27, 2024 All
Code Code Available 3PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions Sep 23, 2024 Image Generation Image Restoration
Code Code Available 3Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models Sep 11, 2024 3D Generation 3D Reconstruction
Code Code Available 3VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generation Sep 6, 2024 Image Generation
Code Code Available 3Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation Aug 19, 2024 Image Generation Video Generation
Code Code Available 3Scaling Diffusion Transformers to 16 Billion Parameters Jul 16, 2024 Attribute Conditional Image Generation
Code Code Available 3Consistency Flow Matching: Defining Straight Flows with Velocity Consistency Jul 2, 2024 Image Generation
Code Code Available 3StyleShot: A Snapshot on Any Style Jul 1, 2024 Image Generation Style Transfer
Code Code Available 3Consistency Models Made Easy Jun 20, 2024 Computational Efficiency GPU
Code Code Available 3DF40: Toward Next-Generation Deepfake Detection Jun 19, 2024 DeepFake Detection Face Reenactment
Code Code Available 3GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual Generation Jun 19, 2024 Benchmarking Image Generation
Code Code Available 3MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance Jun 11, 2024 Image Generation Text to Image Generation
Code Code Available 3Image and Video Tokenization with Binary Spherical Quantization Jun 11, 2024 Decoder Image Generation
Code Code Available 3An Image is Worth 32 Tokens for Reconstruction and Generation Jun 11, 2024 Image Generation Image Reconstruction
Code Code Available 3Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization Jun 6, 2024 Denoising Image Generation
Code Code Available 3AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation Jun 3, 2024 Image Generation
Code Code Available 3Deciphering Oracle Bone Language with Diffusion Models Jun 2, 2024 Decipherment Image Generation
Code Code Available 3DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis May 23, 2024 Image Generation Mamba
Code Code Available 3On the Trajectory Regularity of ODE-based Diffusion Sampling May 18, 2024 Denoising Image Generation
Code Code Available 3Inf-DiT: Upsampling Any-Resolution Image with Memory-Efficient Diffusion Transformer May 7, 2024 Image Generation Super-Resolution
Code Code Available 3ImageInWords: Unlocking Hyper-Detailed Image Descriptions May 5, 2024 Image Generation Specificity
Code Code Available 3U-DiTs: Downsample Tokens in U-Shaped Diffusion Transformers May 4, 2024 Image Generation Inductive Bias
Code Code Available 3Taming Stable Diffusion for Text to 360° Panorama Image Generation Apr 11, 2024 Denoising Image Generation
Code Code Available 3MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation Apr 8, 2024 Image Generation Image-to-Image Translation
Code Code Available 3Towards Realistic Scene Generation with LiDAR Diffusion Models Mar 31, 2024 3D geometry Image Generation
Code Code Available 3Self-Rectifying Diffusion Sampling with Perturbed-Attention Guidance Mar 26, 2024 Deblurring Denoising
Code Code Available 3FlashFace: Human Image Personalization with High-fidelity Identity Preservation Mar 25, 2024 Face Swapping Image Generation
Code Code Available 3DesignEdit: Multi-Layered Latent Decomposition and Fusion for Unified & Accurate Image Editing Mar 21, 2024 Image Generation spatial-aware image editing
Code Code Available 3Generic 3D Diffusion Adapter Using Controlled Multi-View Editing Mar 18, 2024 3D Generation Image Generation
Code Code Available 3Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion Model Mar 12, 2024 Image Generation Text to Image Generation
Code Code Available 3Scaling Rectified Flow Transformers for High-Resolution Image Synthesis Mar 5, 2024 Image Generation
Code Code Available 3Behavior Generation with Latent Actions Mar 5, 2024 Autonomous Driving Decision Making
Code Code Available 3ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models Mar 4, 2024 Denoising Image Generation
Code Code Available 3VisionLLaMA: A Unified LLaMA Backbone for Vision Tasks Mar 1, 2024 Image Classification Image Generation
Code Code Available 3Trajectory Consistency Distillation: Improved Latent Consistency Distillation by Semi-Linear Consistency Function with Trajectory Mapping Feb 29, 2024 Image Generation
Code Code Available 3Coarse-to-Fine Latent Diffusion for Pose-Guided Person Image Synthesis Feb 28, 2024 Decoder Image Generation
Code Code Available 3Visual Style Prompting with Swapping Self-Attention Feb 20, 2024 Denoising Image Generation
Code Code Available 3FiT: Flexible Vision Transformer for Diffusion Model Feb 19, 2024 Computational Efficiency Image Cropping
Code Code Available 3DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation Feb 19, 2024 Image Generation
Code Code Available 3Magic-Me: Identity-Specific Video Customized Diffusion Feb 14, 2024 Image Generation Text to Image Generation
Code Code Available 3Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models Feb 7, 2024 counterfactual Image Generation
Code Code Available 3Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion Models Jan 1, 2024 Image Generation Text to Image Generation
Code Code Available 3SEED-Bench: Benchmarking Multimodal Large Language Models Jan 1, 2024 Benchmarking Image Generation
Code Code Available 3Style Aligned Image Generation via Shared Attention Dec 4, 2023 Image Generation
Code Code Available 3UniGS: Unified Representation for Image Generation and Segmentation Dec 4, 2023 Image Generation Segmentation
Code Code Available 3VBench: Comprehensive Benchmark Suite for Video Generative Models Nov 29, 2023 Image Generation Video Generation
Code Code Available 3Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models Nov 20, 2023 Image Generation
Code Code Available 3Multimodal Foundation Models: From Specialists to General-Purpose Assistants Sep 18, 2023 Image Generation Survey
Code Code Available 3InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation Sep 12, 2023 GPU Image Generation
Code Code Available 3Pixel-Aware Stable Diffusion for Realistic Image Super-resolution and Personalized Stylization Aug 28, 2023 Image Enhancement Image Generation
Code Code Available 3