| GenDDS: Generating Diverse Driving Video Scenarios with Prompt-to-Video Generative Model | Aug 28, 2024 | Autonomous DrivingData Augmentation | —Unverified | 0 | 0 |
| GenDeF: Learning Generative Deformation Field for Video Generation | Dec 7, 2023 | DisentanglementVideo Editing | —Unverified | 0 | 0 |
| Gender Bias in Text-to-Video Generation Models: A case study of Sora | Dec 30, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| Generating, Fast and Slow: Scalable Parallel Video Generation with Video Interface Networks | Mar 21, 2025 | DenoisingOptical Flow Estimation | —Unverified | 0 | 0 |
| Generating Human Action Videos by Coupling 3D Game Engines and Probabilistic Graphical Models | Oct 12, 2019 | Action RecognitionOptical Flow Estimation | —Unverified | 0 | 0 |
| Generating Persuasive Visual Storylines for Promotional Videos | Aug 30, 2019 | ClusteringPersuasiveness | —Unverified | 0 | 0 |
| Generating time-consistent dynamics with discriminator-guided image diffusion models | May 14, 2025 | Video Generation | —Unverified | 0 | 0 |
| Generating Videos with Scene Dynamics | Sep 8, 2016 | Action ClassificationFuture prediction | —Unverified | 0 | 0 |
| Generative AI for Autonomous Driving: A Review | May 21, 2025 | Autonomous DrivingImage Generation | —Unverified | 0 | 0 |
| Generative Ghost: Investigating Ranking Bias Hidden in AI-Generated Videos | Feb 11, 2025 | Contrastive LearningImage Retrieval | —Unverified | 0 | 0 |
| Generative Pre-trained Autoregressive Diffusion Transformer | May 12, 2025 | Few-Shot LearningVideo Generation | —Unverified | 0 | 0 |
| Generative Rendering: Controllable 4D-Guided Video Generation with 2D Diffusion Models | Dec 3, 2023 | Image GenerationText to Image Generation | —Unverified | 0 | 0 |
| Generative Video Propagation | Dec 27, 2024 | Image to Video GenerationVideo Generation | —Unverified | 0 | 0 |
| Generative Video Transformer: Can Objects be the Words? | Jul 20, 2021 | GPUScene Understanding | —Unverified | 0 | 0 |
| GenLit: Reformulating Single-Image Relighting as Video Generation | Dec 15, 2024 | Image GenerationImage Relighting | —Unverified | 0 | 0 |
| GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration | Dec 5, 2024 | AttributeHallucination | —Unverified | 0 | 0 |
| GenTron: Diffusion Transformers for Image and Video Generation | Dec 7, 2023 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| GenVidBench: A Challenging Benchmark for Detecting AI-Generated Video | Jan 20, 2025 | Video ClassificationVideo Generation | —Unverified | 0 | 0 |
| GenWorld: Towards Detecting AI-generated Real-world Simulation Videos | Jun 12, 2025 | Video Generation | —Unverified | 0 | 0 |
| GeoMan: Temporally Consistent Human Geometry Estimation using Image-to-Video Diffusion | May 29, 2025 | Depth EstimationImage to Video Generation | —Unverified | 0 | 0 |
| Geometry-aware 4D Video Generation for Robot Manipulation | Jul 1, 2025 | Robot ManipulationVideo Generation | —Unverified | 0 | 0 |
| GEVRM: Goal-Expressive Video Generation Model For Robust Visual Manipulation | Feb 13, 2025 | Contrastive LearningVideo Generation | —Unverified | 0 | 0 |
| GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning | Jun 12, 2025 | GPUVideo Generation | —Unverified | 0 | 0 |
| GiVE: Guiding Visual Encoder to Perceive Overlooked Information | Oct 26, 2024 | ObjectQuestion Answering | —Unverified | 0 | 0 |
| GlocalNet: Class-aware Long-term Human Motion Synthesis | Dec 19, 2020 | Motion SynthesisPedestrian Trajectory Prediction | —Unverified | 0 | 0 |
| Goal-Conditioned Video Prediction | Sep 25, 2019 | Imitation LearningPrediction | —Unverified | 0 | 0 |
| GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning | Nov 21, 2023 | Image GenerationText-to-Video Generation | —Unverified | 0 | 0 |
| GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation | Nov 25, 2023 | Instruction FollowingLanguage Modeling | —Unverified | 0 | 0 |
| GR-2: A Generative Video-Language-Action Model with Web-Scale Knowledge for Robot Manipulation | Oct 8, 2024 | Multi-Task LearningRobot Manipulation | —Unverified | 0 | 0 |
| Grid Diffusion Models for Text-to-Video Generation | Mar 30, 2024 | GPUImage Generation | —Unverified | 0 | 0 |
| GS-DiT: Advancing Video Generation with Dynamic 3D Gaussian Fields through Efficient Dense 3D Point Tracking | Jan 1, 2025 | Novel View SynthesisPoint Tracking | —Unverified | 0 | 0 |
| GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking | Jan 5, 2025 | Novel View SynthesisPoint Tracking | —Unverified | 0 | 0 |
| GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation | Mar 8, 2025 | 3D GenerationDecoder | —Unverified | 0 | 0 |
| GVDIFF: Grounded Text-to-Video Generation with Diffusion Models | Jul 2, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models | Apr 14, 2025 | DenoisingText-to-Video Generation | —Unverified | 0 | 0 |
| HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models | Feb 28, 2025 | Action UnderstandingText-to-Video Generation | —Unverified | 0 | 0 |
| Hamiltonian GAN | Aug 22, 2023 | Inductive BiasVideo Generation | —Unverified | 0 | 0 |
| Hardware-Friendly Static Quantization Method for Video Diffusion Transformers | Feb 20, 2025 | QuantizationVideo Generation | —Unverified | 0 | 0 |
| HARIVO: Harnessing Text-to-Image Models for Video Generation | Oct 10, 2024 | DiversityVideo Generation | —Unverified | 0 | 0 |
| Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models | Feb 4, 2025 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |
| HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators | Jan 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Hierarchical Patch Diffusion Models for High-Resolution Video Generation | Jun 12, 2024 | Video Generation | —Unverified | 0 | 0 |
| Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline | Jul 19, 2023 | DecoderTalking Head Generation | —Unverified | 0 | 0 |
| Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation | Dec 7, 2023 | Spatial ReasoningText-to-Video Generation | —Unverified | 0 | 0 |
| Hierarchical Video Generation for Complex Data | Jun 4, 2021 | Video Generation | —Unverified | 0 | 0 |
| Hierarchical Video Prediction Using Relational Layouts for Human-Object Interactions | Jun 19, 2021 | Human-Object Interaction DetectionObject | —Unverified | 0 | 0 |
| HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation | Jun 26, 2025 | Panoptic SegmentationSegmentation | —Unverified | 0 | 0 |
| High-Fidelity and Freely Controllable Talking Head Video Generation | Apr 20, 2023 | Face ModelTalking Head Generation | —Unverified | 0 | 0 |
| High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model | Aug 10, 2024 | Face GenerationTalking Face Generation | —Unverified | 0 | 0 |
| HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models | Mar 14, 2025 | Text-to-Video GenerationVideo Generation | —Unverified | 0 | 0 |