| TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation | Mar 14, 2025 | Imitation LearningObject | —Unverified | 0 |
| TC4D: Trajectory-Conditioned Text-to-4D Generation | Mar 26, 2024 | Scene GenerationVideo Generation | —Unverified | 0 |
| TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models | Dec 21, 2024 | QuantizationVideo Generation | —Unverified | 0 |
| Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge | Nov 18, 2024 | Video Generation | —Unverified | 0 |
| Technical Report: Competition Solution For Modelscope-Sora | Sep 24, 2024 | Text-to-Video GenerationVideo Description | —Unverified | 0 |
| Teller: Real-Time Streaming Audio-Driven Portrait Animation with Autoregressive Motion Generation | Mar 24, 2025 | Motion GenerationPortrait Animation | —Unverified | 0 |
| Temporal Regularization Makes Your Video Generator Stronger | Mar 19, 2025 | DiversityVideo Generation | —Unverified | 0 |
| TesserAct: Learning 4D Embodied World Models | Apr 29, 2025 | Novel View SynthesisVideo Generation | —Unverified | 0 |
| Text2Sign: Towards Sign Language Production Using Neural Machine Translation and Generative Adversarial Networks | Jan 2, 2020 | Generative Adversarial NetworkMachine Translation | —Unverified | 0 |
| Text2Story: Advancing Video Storytelling with Text Guidance | Mar 8, 2025 | FormImage Generation | —Unverified | 0 |
| Text-Animator: Controllable Visual Text Video Generation | Jun 25, 2024 | Text GenerationVideo Generation | —Unverified | 0 |
| Text-driven Video Prediction | Oct 6, 2022 | Causal InferencePrediction | —Unverified | 0 |
| The Art of Storytelling: Multi-Agent Generative AI for Dynamic Multimodal Narratives | Sep 17, 2024 | text-to-speechText to Speech | —Unverified | 0 |
| The Dawn of Video Generation: Preliminary Explorations with SORA-like Models | Oct 7, 2024 | Video Generation | —Unverified | 0 |
| The Devil is in the Prompts: Retrieval-Augmented Prompt Optimization for Text-to-Video Generation | Apr 16, 2025 | SentenceText-to-Video Generation | —Unverified | 0 |
| The Lost Melody: Empirical Observations on Text-to-Video Generation From A Storytelling Perspective | May 13, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| The Power of Sound (TPoS): Audio Reactive Video Generation with Stable Diffusion | Sep 8, 2023 | Video Generation | —Unverified | 0 |
| The Role of Video Generation in Enhancing Data-Limited Action Understanding | May 26, 2025 | Action RecognitionAction Understanding | —Unverified | 0 |
| The Tug-of-War Between Deepfake Generation and Detection | Jul 8, 2024 | Face SwappingMisinformation | —Unverified | 0 |
| Think Before You Diffuse: LLMs-Guided Physics-Aware Video Generation | May 27, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| This&That: Language-Gesture Controlled Video Generation for Robot Planning | Jul 8, 2024 | Task PlanningVideo Generation | —Unverified | 0 |
| Through-The-Mask: Mask-based Motion Trajectories for Image-to-Video Generation | Jan 6, 2025 | Image to Video GenerationObject | —Unverified | 0 |
| Tiger200K: Manually Curated High Visual Quality Video Dataset from UGC Platform | Apr 21, 2025 | Boundary DetectionOptical Character Recognition (OCR) | —Unverified | 0 |
| TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation | Nov 5, 2024 | Image to Video GenerationMisinformation | —Unverified | 0 |
| TIV-Diffusion: Towards Object-Centric Movement for Text-driven Image to Video Generation | Dec 13, 2024 | Image to Video GenerationObject | —Unverified | 0 |
| TiVGAN: Text to Image to Video Generation with Step-by-Step Evolutionary Generator | Sep 4, 2020 | Generative Adversarial NetworkImage Generation | —Unverified | 0 |
| TokenMotion: Decoupled Motion Control via Token Disentanglement for Human-centric Video Generation | Apr 11, 2025 | DisentanglementVideo Generation | —Unverified | 0 |
| ToonifyGB: StyleGAN-based Gaussian Blendshapes for 3D Stylized Head Avatars | May 15, 2025 | Image StylizationVideo Generation | —Unverified | 0 |
| Tora2: Motion and Appearance Customized Diffusion Transformer for Multi-Entity Video Generation | Jul 8, 2025 | Video Generation | —Unverified | 0 |
| Zeroth-order Informed Fine-Tuning for Diffusion Model: A Recursive Likelihood Ratio Optimizer | Feb 2, 2025 | Reinforcement Learning (RL)Video Generation | —Unverified | 0 |
| Reenact Anything: Semantic Video Motion Transfer Using Motion-Textual Inversion | Aug 1, 2024 | Face ReenactmentVideo Generation | —Unverified | 0 |
| VidGen-1M: A Large-Scale Dataset for Text-to-video Generation | Aug 5, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| Aquarius: A Family of Industry-Level Video Generation Models for Marketing Scenarios | May 14, 2025 | MarketingVideo Generation | —Unverified | 0 |
| Face Consistency Benchmark for GenAI Video | May 16, 2025 | Video Generation | —Unverified | 0 |
| 360DVD: Controllable Panorama Video Generation with 360-Degree Video Diffusion Model | Jan 12, 2024 | Video Generation | —Unverified | 0 |
| 3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models | Nov 25, 2022 | DenoisingNeRF | —Unverified | 0 |
| 3D Gaussian Splatting with Normal Information for Mesh Extraction and Improved Rendering | Jan 14, 2025 | Novel View SynthesisVideo Generation | —Unverified | 0 |
| 3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors | Oct 21, 2024 | 3DGSDecoder | —Unverified | 0 |
| 3DTrajMaster: Mastering 3D Trajectory for Multi-Entity Motion in Video Generation | Dec 10, 2024 | Video Generation | —Unverified | 0 |
| 4Diffusion: Multi-view Video Diffusion Model for 4D Generation | May 31, 2024 | NeRFVideo Generation | —Unverified | 0 |
| 4Real: Towards Photorealistic 4D Scene Generation via Video Diffusion Models | Jun 11, 2024 | Scene GenerationVideo Generation | —Unverified | 0 |
| Abductive Ego-View Accident Video Understanding for Safe Driving Perception | Mar 1, 2024 | Objectobject-detection | —Unverified | 0 |
| AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers | Nov 27, 2024 | Camera Pose EstimationPose Estimation | —Unverified | 0 |
| Accelerating Diffusion Sampling via Exploiting Local Transition Coherence | Mar 12, 2025 | DenoisingVideo Generation | —Unverified | 0 |
| Accelerating Image Generation with Sub-path Linear Approximation Model | Apr 22, 2024 | DenoisingGPU | —Unverified | 0 |
| Accelerating Video Diffusion Models via Distribution Matching | Dec 8, 2024 | DenoisingVideo Generation | —Unverified | 0 |
| AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports | Mar 26, 2025 | Autonomous DrivingNeRF | —Unverified | 0 |
| ACDC: Autoregressive Coherent Multimodal Generation using Diffusion Correction | Oct 7, 2024 | multimodal generationStory Generation | —Unverified | 0 |
| Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts | May 22, 2025 | Dialogue GenerationLarge Language Model | —Unverified | 0 |
| Action Concept Grounding Network for Semantically-Consistent Video Generation | Sep 28, 2020 | Action Recognitionobject-detection | —Unverified | 0 |