| Sora Generates Videos with Stunning Geometrical Consistency | Feb 27, 2024 | 3D ReconstructionVideo Generation | —Unverified | 0 |
| Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Feb 22, 2024 | Video Generation | —Unverified | 0 |
| Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis | Feb 22, 2024 | Image GenerationText-to-Video Generation | —Unverified | 0 |
| Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation | Feb 21, 2024 | Video GenerationVideo Reconstruction | —Unverified | 0 |
| VGMShield: Mitigating Misuse of Video Generative Models | Feb 20, 2024 | Video Generation | CodeCode Available | 0 |
| Denoising Diffusion Probabilistic Models in Six Simple Steps | Feb 6, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion | Feb 5, 2024 | ObjectVideo Generation | —Unverified | 0 |
| NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties | Feb 2, 2024 | Contrastive LearningSSIM | —Unverified | 0 |
| A Survey on Generative AI and LLM for Video Generation, Understanding, and Streaming | Jan 30, 2024 | Video GenerationVideo Understanding | —Unverified | 0 |
| Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling | Jan 29, 2024 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation | Jan 18, 2024 | DenoisingPosition | —Unverified | 0 |
| CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects | Jan 18, 2024 | ObjectText-to-Video Generation | —Unverified | 0 |
| WorldDreamer: Towards General World Models for Video Generation via Predicting Masked Tokens | Jan 18, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| Inflation with Diffusion: Efficient Temporal Adaptation for Text-to-Video Super-Resolution | Jan 18, 2024 | Super-ResolutionVideo Generation | —Unverified | 0 |
| UniVG: Towards UNIfied-modal Video Generation | Jan 17, 2024 | Video Generation | —Unverified | 0 |
| Towards A Better Metric for Text-to-Video Generation | Jan 15, 2024 | Mixture-of-ExpertsText-to-Video Generation | —Unverified | 0 |
| 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model | Jan 12, 2024 | Video Generation | —Unverified | 0 |
| RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks | Jan 11, 2024 | Generative Adversarial NetworkOptical Flow Estimation | —Unverified | 0 |
| MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation | Jan 9, 2024 | MORPHVideo Generation | —Unverified | 0 |
| Neural Rendering and Its Hardware Acceleration: A Review | Jan 6, 2024 | 3D ReconstructionDeep Learning | —Unverified | 0 |
| PromptCoT: Align Prompt Distribution via Adapted Chain-of-Thought | Jan 1, 2024 | Computational EfficiencyPrompt Engineering | —Unverified | 0 |
| SNED: Superposition Network Architecture Search for Efficient Video Diffusion Model | Jan 1, 2024 | Video Generation | —Unverified | 0 |
| LAMP: Learn A Motion Pattern for Few-Shot Video Generation | Jan 1, 2024 | GPUImage Animation | —Unverified | 0 |
| DiffPerformer: Iterative Learning of Consistent Latent Guidance for Diffusion-based Human Video Generation | Jan 1, 2024 | Video Generation | —Unverified | 0 |
| On the Content Bias in Frechet Video Distance | Jan 1, 2024 | Video Generation | —Unverified | 0 |
| FlashVideo: A Framework for Swift Inference in Text-to-Video Generation | Dec 30, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| A Recipe for Scaling up Text-to-Video Generation with Text-free Videos | Dec 25, 2023 | Image GenerationText to Image Generation | —Unverified | 0 |
| Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models | Dec 21, 2023 | Synthetic Data GenerationVideo Generation | —Unverified | 0 |
| VideoPoet: A Large Language Model for Zero-Shot Video Generation | Dec 21, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| InstructVideo: Instructing Video Diffusion Models with Human Feedback | Dec 19, 2023 | Video Generation | —Unverified | 0 |
| VideoLCM: Video Latent Consistency Model | Dec 14, 2023 | Computational EfficiencyImage Generation | —Unverified | 0 |
| Photorealistic Video Generation with Diffusion Models | Dec 11, 2023 | Super-ResolutionText-to-Video Generation | —Unverified | 0 |
| DreaMoving: A Human Video Generation Framework based on Diffusion Models | Dec 8, 2023 | Video Generation | —Unverified | 0 |
| Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation | Dec 7, 2023 | Spatial ReasoningText-to-Video Generation | —Unverified | 0 |
| GenDeF: Learning Generative Deformation Field for Video Generation | Dec 7, 2023 | DisentanglementVideo Editing | —Unverified | 0 |
| GenTron: Diffusion Transformers for Image and Video Generation | Dec 7, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| MEVG: Multi-event Video Generation with Text-to-Video Models | Dec 7, 2023 | Video Generation | —Unverified | 0 |
| DreamVideo: Composing Your Dream Videos with Customized Subject and Motion | Dec 7, 2023 | Image GenerationVideo Generation | —Unverified | 0 |
| NewMove: Customizing text-to-video models with novel motions | Dec 7, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| FAAC: Facial Animation Generation with Anchor Frame and Conditional Control for Superior Fidelity and Editability | Dec 6, 2023 | Face ModelVideo Generation | —Unverified | 0 |
| LivePhoto: Real Image Animation with Text-guided Motion Control | Dec 5, 2023 | Image AnimationText-to-Video Generation | —Unverified | 0 |
| Fine-grained Controllable Video Generation via Object Appearance and Context | Dec 5, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance | Dec 5, 2023 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models | Dec 3, 2023 | Image GenerationText to Image Generation | —Unverified | 0 |
| VideoBooth: Diffusion-based Video Generation with Image Prompts | Dec 1, 2023 | Video Generation | —Unverified | 0 |
| MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation | Nov 30, 2023 | Image GenerationText to Image Generation | —Unverified | 0 |
| VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models | Nov 30, 2023 | Semantic SegmentationVideo Editing | —Unverified | 0 |
| ARTV: Auto-Regressive Text-to-Video Generation with Diffusion Models | Nov 30, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| MotionZero:Exploiting Motion Priors for Zero-shot Text-to-Video Generation | Nov 28, 2023 | DisentanglementText-to-Video Generation | —Unverified | 0 |
| FlowZero: Zero-Shot Text-to-Video Synthesis with LLM-Driven Dynamic Scene Syntax | Nov 27, 2023 | Video Generation | —Unverified | 0 |