| TrailBlazer: Trajectory Control for Diffusion-Based Video Generation | Dec 31, 2023 | Video Generation | CodeCode Available | 1 |
| Free-Editor: Zero-shot Text-driven 3D Scene Editing | Dec 21, 2023 | 3D scene EditingStyle Transfer | CodeCode Available | 1 |
| Towards Accurate Guided Diffusion Sampling through Symplectic Adjoint Method | Dec 19, 2023 | Video Generation | CodeCode Available | 1 |
| PEEKABOO: Interactive Video Generation via Masked-Diffusion | Dec 12, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| MotionCrafter: One-Shot Motion Customization of Diffusion Models | Dec 8, 2023 | DisentanglementMotion Disentanglement | CodeCode Available | 1 |
| WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation | Dec 5, 2023 | Autonomous DrivingDiversity | CodeCode Available | 1 |
| MagicStick: Controllable Video Editing via Control Handle Transformations | Dec 5, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| BIVDiff: A Training-Free Framework for General-Purpose Video Synthesis via Bridging Image and Video Diffusion Models | Dec 5, 2023 | Image GenerationModel Selection | CodeCode Available | 1 |
| DragVideo: Interactive Drag-style Video Editing | Dec 3, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models | Dec 1, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| MagDiff: Multi-Alignment Diffusion for High-Fidelity Video Generation and Editing | Nov 29, 2023 | DenoisingImage to Video Generation | CodeCode Available | 1 |
| FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline | Nov 22, 2023 | SSIMText-to-Video Generation | CodeCode Available | 1 |
| FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation | Nov 3, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| CVPR 2023 Text Guided Video Editing Competition | Oct 24, 2023 | Video EditingVideo Generation | CodeCode Available | 1 |
| FreeNoise: Tuning-Free Longer Video Diffusion via Noise Rescheduling | Oct 23, 2023 | Video Generation | CodeCode Available | 1 |
| EvalCrafter: Benchmarking and Evaluating Large Video Generation Models | Oct 17, 2023 | BenchmarkingLanguage Modelling | CodeCode Available | 1 |
| ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation | Oct 11, 2023 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| Diverse and Aligned Audio-to-Video Generation via Text-to-Video Model Adaptation | Sep 28, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models | Sep 26, 2023 | Super-ResolutionText-to-Video Generation | CodeCode Available | 1 |
| Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator | Sep 25, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| Reuse and Diffuse: Iterative Denoising for Text-to-Video Generation | Sep 7, 2023 | Action RecognitionDecoder | CodeCode Available | 1 |
| StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation | Aug 31, 2023 | Style TransferUnconditional Video Generation | CodeCode Available | 1 |
| StoryBench: A Multifaceted Benchmark for Continuous Story Visualization | Aug 22, 2023 | Story ContinuationStory Generation | CodeCode Available | 1 |
| Bidirectionally Deformable Motion Modulation For Video-based Human Pose Transfer | Jul 15, 2023 | motion predictionPose Transfer | CodeCode Available | 1 |
| DDLP: Unsupervised Object-Centric Video Prediction with Deep Dynamic Latent Particles | Jun 9, 2023 | ObjectPosition | CodeCode Available | 1 |
| Learn the Force We Can: Enabling Sparse Motion Control in Multi-Object Video Generation | Jun 6, 2023 | ObjectVideo Generation | CodeCode Available | 1 |
| Video Diffusion Models with Local-Global Context Guidance | Jun 5, 2023 | Future predictionPrediction | CodeCode Available | 1 |
| DirecT2V: Large Language Models are Frame-Level Directors for Zero-Shot Text-to-Video Generation | May 23, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation | May 18, 2023 | Image GenerationText to Image Generation | CodeCode Available | 1 |
| Sketching the Future (STF): Applying Conditional Control Techniques to Text-to-Video Models | May 10, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models | Apr 18, 2023 | Image GenerationSuper-Resolution | CodeCode Available | 1 |
| Generative Disco: Text-to-Video Generation for Music Visualization | Apr 17, 2023 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| Mask-conditioned latent diffusion for generating gastrointestinal polyp images | Apr 11, 2023 | Image GenerationImage Segmentation | CodeCode Available | 1 |
| Generative Recommendation: Towards Next-generation Recommender Paradigm | Apr 7, 2023 | Recommendation SystemsRetrieval | CodeCode Available | 1 |
| MoStGAN-V: Video Generation with Temporal Motion Styles | Apr 5, 2023 | Video Generation | CodeCode Available | 1 |
| Fine-grained Audible Video Description | Mar 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Feature-Conditioned Cascaded Video Diffusion Models for Precise Echocardiogram Synthesis | Mar 22, 2023 | Image GenerationVideo Generation | CodeCode Available | 1 |
| Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers | Mar 20, 2023 | Video Generation | CodeCode Available | 1 |
| MOSO: Decomposing MOtion, Scene and Object for Video Prediction | Mar 7, 2023 | ObjectUnconditional Video Generation | CodeCode Available | 1 |
| SeqDiffuSeq: Text Diffusion with Encoder-Decoder Transformers | Dec 20, 2022 | DecoderDenoising | CodeCode Available | 1 |
| Towards Smooth Video Composition | Dec 14, 2022 | Image Generationsingle-image-generation | CodeCode Available | 1 |
| VIDM: Video Implicit Diffusion Models | Dec 1, 2022 | Generative Adversarial NetworkVideo Generation | CodeCode Available | 1 |
| Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation | Nov 23, 2022 | Text-to-Video GenerationVideo Generation | CodeCode Available | 1 |
| SinFusion: Training Diffusion Models on a Single Image or Video | Nov 21, 2022 | DiversityImage Manipulation | CodeCode Available | 1 |
| INR-V: A Continuous Representation Space for Video-based Generative Tasks | Oct 29, 2022 | Video GenerationVideo Inpainting | CodeCode Available | 1 |
| Make-A-Video: Text-to-Video Generation without Text-Video Data | Sep 29, 2022 | DecoderImage Generation | CodeCode Available | 1 |
| StyleFaceV: Face Video Generation via Decomposing and Recomposing Pretrained StyleGAN3 | Aug 16, 2022 | Image GenerationVideo Generation | CodeCode Available | 1 |
| 3D-Aware Video Generation | Jun 29, 2022 | Image GenerationVideo Generation | CodeCode Available | 1 |
| Diffusion Models for Video Prediction and Infilling | Jun 15, 2022 | PredictionVideo Generation | CodeCode Available | 1 |
| Patch-based Object-centric Transformers for Efficient Video Generation | Jun 8, 2022 | ObjectVideo Editing | CodeCode Available | 1 |