Next Patch Prediction for Autoregressive Visual Generation Dec 19, 2024 Image Generation Prediction
Code Code Available 2Causal Diffusion Transformers for Generative Modeling Dec 16, 2024 Decoder Image Generation
Code Code Available 2Financial Fine-tuning a Large Time Series Model Dec 13, 2024 Image Generation Prediction
Code Code Available 2Simple Guidance Mechanisms for Discrete Diffusion Models Dec 13, 2024 Image Generation
Code Code Available 2Diffusion-Enhanced Test-time Adaptation with Text and Image Augmentation Dec 12, 2024 Image Augmentation Image Generation
Code Code Available 2LAION-SG: An Enhanced Large-Scale Dataset for Training Complex Image-Text Models with Structural Annotations Dec 11, 2024 Attribute Image Generation
Code Code Available 2Proactive Agents for Multi-Turn Text-to-Image Generation Under Uncertainty Dec 9, 2024 Image Generation Text to Image Generation
Code Code Available 2EMOv2: Pushing 5M Vision Model Frontier Dec 9, 2024 Image Generation model
Code Code Available 2ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality Dec 5, 2024 Image Generation
Code Code Available 2Generative Photography: Scene-Consistent Camera Control for Realistic Text-to-Image Synthesis Dec 3, 2024 Image Generation
Code Code Available 2TextSSR: Diffusion-based Data Synthesis for Scene Text Recognition Dec 2, 2024 Image Generation Optical Character Recognition (OCR)
Code Code Available 2TinyFusion: Diffusion Transformers Learned Shallow Dec 2, 2024 Image Generation
Code Code Available 2X-Prompt: Towards Universal In-Context Image Generation in Auto-Regressive Vision Language Foundation Models Dec 2, 2024 Image Generation In-Context Learning
Code Code Available 2OmniFlow: Any-to-Any Generation with Multi-Modal Rectified Flows Dec 2, 2024 Audio Synthesis Image Generation
Code Code Available 2Playable Game Generation Dec 1, 2024 GPU Image Generation
Code Code Available 2TexGaussian: Generating High-quality PBR Material via Octree-based 3D Gaussian Splatting Nov 29, 2024 Denoising Image Generation
Code Code Available 2TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models Nov 27, 2024 Garment Reconstruction Image Generation
Code Code Available 2Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints Nov 26, 2024 Denoising Image Generation
Code Code Available 2Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient Nov 26, 2024 GPU Image Generation
Code Code Available 2What Makes a Scene ? Scene Graph-based Evaluation and Feedback for Controllable Generation Nov 23, 2024 Image Generation Scene Generation
Code Code Available 2AnyText2: Visual Text Generation and Editing With Customizable Attributes Nov 22, 2024 Image Generation Text Generation
Code Code Available 2MMGenBench: Evaluating the Limits of LMMs from the Text-to-Image Generation Perspective Nov 21, 2024 Image Comprehension Image Generation
Code Code Available 2RAW-Diffusion: RGB-Guided Diffusion Models for High-Fidelity RAW Image Generation Nov 20, 2024 Image Generation object-detection
Code Code Available 2HyperGAN-CLIP: A Unified Framework for Domain Adaptation, Image Synthesis and Manipulation Nov 19, 2024 Domain Adaptation Image Generation
Code Code Available 2From Text to Pose to Image: Improving Diffusion Model Control and Quality Nov 19, 2024 Image Generation Prompt Engineering
Code Code Available 2M-VAR: Decoupled Scale-wise Autoregressive Modeling for High-Quality Image Generation Nov 15, 2024 Image Generation Mamba
Code Code Available 2Physics Informed Distillation for Diffusion Models Nov 13, 2024 Dataset Generation Image Generation
Code Code Available 2TIPO: Text to Image with Text Presampling for Prompt Optimization Nov 12, 2024 Image Generation Language Modeling
Code Code Available 2Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis Nov 11, 2024 Attribute Image Generation
Code Code Available 2GrounDiT: Grounding Diffusion Transformers via Noisy Patch Transplantation Oct 27, 2024 Image Generation Text to Image Generation
Code Code Available 2Adversarial Score identity Distillation: Rapidly Surpassing the Teacher in One Step Oct 19, 2024 Conditional Image Generation GPU
Code Code Available 2SeaS: Few-shot Industrial Anomaly Image Generation with Separation and Sharing Fine-tuning Oct 19, 2024 Image Generation
Code Code Available 2HiCo: Hierarchical Controllable Diffusion Model for Layout-to-image Generation Oct 18, 2024 Disentanglement Image Generation
Code Code Available 2BiGR: Harnessing Binary Latent Codes for Image Generation and Improved Visual Representation Capabilities Oct 18, 2024 Conditional Image Generation Image Generation
Code Code Available 2ARKit LabelMaker: A New Scale for Indoor 3D Scene Understanding Oct 17, 2024 3D Semantic Segmentation Image Generation
Code Code Available 2PUMA: Empowering Unified MLLM with Multi-granular Visual Generation Oct 17, 2024 Diversity Image Generation
Code Code Available 2Fluid: Scaling Autoregressive Text-to-image Generative Models with Continuous Tokens Oct 17, 2024 Image Generation Text to Image Generation
Code Code Available 2Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective Oct 16, 2024 Conditional Image Generation Image Generation
Code Code Available 2TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control Oct 14, 2024 Disentanglement Image Generation
Code Code Available 2High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity Oct 14, 2024 Denoising Dichotomous Image Segmentation
Code Code Available 2EvolveDirector: Approaching Advanced Text-to-Image Generation with Large Vision-Language Models Oct 9, 2024 Image Generation Text to Image Generation
Code Code Available 2Think While You Generate: Discrete Diffusion with Planned Denoising Oct 8, 2024 Denoising Image Generation
Code Code Available 2Dynamic Diffusion Transformer Oct 4, 2024 Image Generation
Code Code Available 2A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation Oct 2, 2024 Image Generation Quantization
Code Code Available 2Effective Diffusion Transformer Architecture for Image Super-Resolution Sep 29, 2024 Image Generation Image Super-Resolution
Code Code Available 2Conditional Image Synthesis with Diffusion Models: A Survey Sep 28, 2024 Denoising Diversity
Code Code Available 2FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner Sep 26, 2024 Image Generation Text to Image Generation
Code Code Available 2Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation Sep 26, 2024 Image Generation Object
Code Code Available 2MonoFormer: One Transformer for Both Diffusion and Autoregression Sep 24, 2024 Image Generation Text Generation
Code Code Available 2MaskBit: Embedding-free Image Generation via Bit Tokens Sep 24, 2024 Conditional Image Generation Image Generation
Code Code Available 2