| Exposing AI-generated Videos: A Benchmark Dataset and a Local-and-Global Temporal Defect Based Detection Method | May 7, 2024 | Video Generation | —Unverified | 0 |
| Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation | May 7, 2024 | Face GenerationTalking Face Generation | —Unverified | 0 |
| Vidu: a Highly Consistent, Dynamic and Skilled Text-to-Video Generator with Diffusion Models | May 7, 2024 | Video GenerationVideo Prediction | —Unverified | 0 |
| Matten: Video Generation with Mamba-Attention | May 5, 2024 | MambaVideo Generation | —Unverified | 0 |
| Bridge to Non-Barrier Communication: Gloss-Prompted Fine-grained Cued Speech Gesture Generation with Diffusion Model | Apr 30, 2024 | DescriptiveGesture Generation | —Unverified | 0 |
| Synthesizing Audio from Silent Video using Sequence to Sequence Modeling | Apr 25, 2024 | DecoderDiversity | CodeCode Available | 0 |
| MotionMaster: Training-free Camera Motion Transfer For Video Generation | Apr 24, 2024 | DisentanglementMotion Disentanglement | —Unverified | 0 |
| Accelerating Image Generation with Sub-path Linear Approximation Model | Apr 22, 2024 | DenoisingGPU | —Unverified | 0 |
| Motion-aware Latent Diffusion Models for Video Frame Interpolation | Apr 21, 2024 | Motion EstimationVideo Frame Interpolation | —Unverified | 0 |
| Music Consistency Models | Apr 20, 2024 | Computational EfficiencyMusic Generation | —Unverified | 0 |
| PhysDreamer: Physics-Based Interaction with 3D Objects via Video Generation | Apr 19, 2024 | motion predictionObject | —Unverified | 0 |
| AniClipart: Clipart Animation with Text-to-Video Priors | Apr 18, 2024 | Image to Video GenerationText-to-Video Generation | —Unverified | 0 |
| SparseDM: Toward Sparse Efficient Diffusion Models | Apr 16, 2024 | GPUVideo Generation | —Unverified | 0 |
| Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model | Apr 15, 2024 | GPUImage Generation | —Unverified | 0 |
| LoopAnimate: Loopable Salient Object Animation | Apr 14, 2024 | GPUObject | —Unverified | 0 |
| Action-conditioned video data improves predictability | Apr 8, 2024 | Video Generation | —Unverified | 0 |
| AnimateZoo: Zero-shot Video Generation of Cross-Species Animation via Subject Alignment | Apr 7, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| Grid Diffusion Models for Text-to-Video Generation | Mar 30, 2024 | GPUImage Generation | —Unverified | 0 |
| A Review of Multi-Modal Large Language and Vision Models | Mar 28, 2024 | Image CaptioningPrompt Engineering | —Unverified | 0 |
| Frame by Familiar Frame: Understanding Replication in Video Diffusion Models | Mar 28, 2024 | Image GenerationVideo Generation | —Unverified | 0 |
| Annotated Biomedical Video Generation using Denoising Diffusion Probabilistic Models and Flow Fields | Mar 26, 2024 | Cell SegmentationDenoising | CodeCode Available | 0 |
| TC4D: Trajectory-Conditioned Text-to-4D Generation | Mar 26, 2024 | Scene GenerationVideo Generation | —Unverified | 0 |
| Tutorial on Diffusion Models for Imaging and Vision | Mar 26, 2024 | Image GenerationText to Image Generation | —Unverified | 0 |
| A Survey on Long Video Generation: Challenges, Methods, and Prospects | Mar 25, 2024 | SurveyVideo Generation | —Unverified | 0 |
| TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models | Mar 25, 2024 | Image to Video GenerationRelational Reasoning | —Unverified | 0 |
| Opportunities and challenges in the application of large artificial intelligence models in radiology | Mar 24, 2024 | Video Generation | —Unverified | 0 |
| Spectral Motion Alignment for Video Motion Transfer using Diffusion Models | Mar 22, 2024 | Computational EfficiencyVideo Generation | —Unverified | 0 |
| Explorative Inbetweening of Time and Space | Mar 21, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition | Mar 21, 2024 | Video Generation | —Unverified | 0 |
| Enabling Visual Composition and Animation in Unsupervised Video Generation | Mar 21, 2024 | Video Generation | —Unverified | 0 |
| S2DM: Sector-Shaped Diffusion Models for Video Generation | Mar 20, 2024 | Image GenerationOptical Flow Estimation | —Unverified | 0 |
| AnimateDiff-Lightning: Cross-Model Diffusion Distillation | Mar 19, 2024 | modelVideo Generation | —Unverified | 0 |
| Endora: Video Generation Models as Endoscopy Simulators | Mar 17, 2024 | Data AugmentationVideo Generation | —Unverified | 0 |
| Animate Your Motion: Turning Still Images into Dynamic Videos | Mar 15, 2024 | SpecificityText-to-Video Generation | —Unverified | 0 |
| Video Editing via Factorized Diffusion Distillation | Mar 14, 2024 | Video EditingVideo Generation | —Unverified | 0 |
| Intention-driven Ego-to-Exo Video Generation | Mar 14, 2024 | Optical Flow EstimationStereo Matching | —Unverified | 0 |
| VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Mar 13, 2024 | Face DetectionVideo Editing | —Unverified | 0 |
| AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production | Mar 12, 2024 | Image GenerationRAG | —Unverified | 0 |
| Video Generation with Consistency Tuning | Mar 11, 2024 | Video Generation | —Unverified | 0 |
| BlazeBVD: Make Scale-Time Equalization Great Again for Blind Video Deflickering | Mar 10, 2024 | Video GenerationVideo Temporal Consistency | —Unverified | 0 |
| WorldGPT: A Sora-Inspired Video AI Agent as Rich World Models from Text and Image Inputs | Mar 10, 2024 | AI AgentVideo Generation | —Unverified | 0 |
| FastVideoEdit: Leveraging Consistency Models for Efficient Text-to-Video Editing | Mar 10, 2024 | Image GenerationText-to-Video Editing | —Unverified | 0 |
| Sora as an AGI World Model? A Complete Survey on Text-to-Video Generation | Mar 8, 2024 | ArticlesHallucination | —Unverified | 0 |
| A spatiotemporal style transfer algorithm for dynamic visual stimulus generation | Mar 7, 2024 | Image GenerationObject Recognition | —Unverified | 0 |
| Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation | Mar 5, 2024 | DenoisingImage Animation | —Unverified | 0 |
| AtomoVideo: High Fidelity Image-to-Video Generation | Mar 4, 2024 | Image GenerationImage to Video Generation | —Unverified | 0 |
| Abductive Ego-View Accident Video Understanding for Safe Driving Perception | Mar 1, 2024 | Objectobject-detection | —Unverified | 0 |
| Context-aware Talking Face Video Generation | Feb 28, 2024 | Video GenerationVideo Synchronization | —Unverified | 0 |
| Video as the New Language for Real-World Decision Making | Feb 27, 2024 | Decision MakingIn-Context Learning | —Unverified | 0 |
| EMO: Emote Portrait Alive -- Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions | Feb 27, 2024 | Video Generation | —Unverified | 0 |