| GSV3D: Gaussian Splatting-based Geometric Distillation with Stable Video Diffusion for Single-Image 3D Object Generation | Mar 8, 2025 | 3D GenerationDecoder | —Unverified | 0 |
| GVDIFF: Grounded Text-to-Video Generation with Diffusion Models | Jul 2, 2024 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models | Apr 14, 2025 | DenoisingText-to-Video Generation | —Unverified | 0 |
| HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models | Feb 28, 2025 | Action UnderstandingText-to-Video Generation | —Unverified | 0 |
| Hamiltonian GAN | Aug 22, 2023 | Inductive BiasVideo Generation | —Unverified | 0 |
| Hardware-Friendly Static Quantization Method for Video Diffusion Transformers | Feb 20, 2025 | QuantizationVideo Generation | —Unverified | 0 |
| HARIVO: Harnessing Text-to-Image Models for Video Generation | Oct 10, 2024 | DiversityVideo Generation | —Unverified | 0 |
| Harness Local Rewards for Global Benefits: Effective Text-to-Video Generation Alignment with Patch-level Reward Models | Feb 4, 2025 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| HeteroLLM: Accelerating Large Language Model Inference on Mobile SoCs platform with Heterogeneous AI Accelerators | Jan 11, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hierarchical Patch Diffusion Models for High-Resolution Video Generation | Jun 12, 2024 | Video Generation | —Unverified | 0 |
| Hierarchical Semantic Perceptual Listener Head Video Generation: A High-performance Pipeline | Jul 19, 2023 | DecoderTalking Head Generation | —Unverified | 0 |
| Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation | Dec 7, 2023 | Spatial ReasoningText-to-Video Generation | —Unverified | 0 |
| Hierarchical Video Generation for Complex Data | Jun 4, 2021 | Video Generation | —Unverified | 0 |
| Hierarchical Video Prediction Using Relational Layouts for Human-Object Interactions | Jun 19, 2021 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| HieraSurg: Hierarchy-Aware Diffusion Model for Surgical Video Generation | Jun 26, 2025 | Panoptic SegmentationSegmentation | —Unverified | 0 |
| High-Fidelity and Freely Controllable Talking Head Video Generation | Apr 20, 2023 | Face ModelTalking Head Generation | —Unverified | 0 |
| High-fidelity and Lip-synced Talking Face Synthesis via Landmark-based Diffusion Model | Aug 10, 2024 | Face GenerationTalking Face Generation | —Unverified | 0 |
| HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models | Mar 14, 2025 | Text-to-Video GenerationVideo Generation | —Unverified | 0 |
| HOIGen-1M: A Large-scale Dataset for Human-Object Interaction Video Generation | Mar 31, 2025 | HallucinationHuman-Object Interaction Detection | —Unverified | 0 |
| HOI-Swap: Swapping Objects in Videos with Hand-Object Interaction Awareness | Jun 11, 2024 | ObjectVideo Editing | —Unverified | 0 |
| How Do the Hearts of Deep Fakes Beat? Deep Fake Source Detection via Interpreting Residuals with Biological Signals | Aug 26, 2020 | Video Generation | —Unverified | 0 |
| How Far is Video Generation from World Model: A Physical Law Perspective | Nov 4, 2024 | Video Generation | —Unverified | 0 |
| How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models | Apr 3, 2025 | Video EditingVideo Generation | —Unverified | 0 |
| How Much To Guide: Revisiting Adaptive Guidance in Classifier-Free Guidance Text-to-Vision Diffusion Models | Jun 10, 2025 | DenoisingVideo Generation | —Unverified | 0 |
| HRVGAN: High Resolution Video Generation using Spatio-Temporal GAN | Aug 17, 2020 | Video GenerationVocal Bursts Intensity Prediction | —Unverified | 0 |
| Human4DiT: 360-degree Human Video Generation with 4D Diffusion Transformer | May 27, 2024 | Video Generation | —Unverified | 0 |
| Human Action CLIPs: Detecting AI-generated Human Motion | Nov 30, 2024 | Video Generation | —Unverified | 0 |
| Human-Activity AGV Quality Assessment: A Benchmark Dataset and an Objective Evaluation Metric | Nov 25, 2024 | Video GenerationVideo Quality Assessment | —Unverified | 0 |
| HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation | Feb 7, 2025 | FormPose Transfer | —Unverified | 0 |
| HumanDreamer: Generating Controllable Human-Motion Videos via Decoupled Generation | Mar 31, 2025 | Video Generation | —Unverified | 0 |
| Hunyuan-GameCraft: High-dynamic Interactive Game Video Generation with Hybrid History Condition | Jun 20, 2025 | Temporal SequencesVideo Generation | —Unverified | 0 |
| Hunyuan-Game: Industrial-grade Intelligent Game Creation Model | May 20, 2025 | Image GenerationImage to Video Generation | —Unverified | 0 |
| HunyuanVideo-HOMA: Generic Human-Object Interaction in Multimodal Driven Human Animation | Jun 10, 2025 | Human AnimationHuman-Object Interaction Detection | —Unverified | 0 |
| HuViDPO:Enhancing Video Generation through Direct Preference Optimization for Human-Centric Alignment | Feb 2, 2025 | Video Generation | —Unverified | 0 |
| Hybrid Video Diffusion Models with 2D Triplane and 3D Wavelet Representation | Feb 21, 2024 | Video GenerationVideo Reconstruction | —Unverified | 0 |
| I2V3D: Controllable image-to-video generation with 3D guidance | Mar 12, 2025 | 3D geometryImage to Video Generation | —Unverified | 0 |
| I2VControl-Camera: Precise Video Camera Control with Adjustable Motion Strength | Nov 10, 2024 | Video Generation | —Unverified | 0 |
| I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models | Jan 1, 2025 | Adversarial AttackImage to Video Generation | —Unverified | 0 |
| I4VGen: Image as Free Stepping Stone for Text-to-Video Generation | Jun 4, 2024 | DiversityImage Generation | —Unverified | 0 |
| iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering | Aug 12, 2021 | NeRFVideo Generation | —Unverified | 0 |
| Identifying and Solving Conditional Image Leakage in Image-to-Video Diffusion Model | Jun 22, 2024 | AttributeImage to Video Generation | —Unverified | 0 |
| iDiT-HOI: Inpainting-based Hand Object Interaction Reenactment via Video Diffusion Transformer | Jun 15, 2025 | ObjectVideo Generation | —Unverified | 0 |
| IF-MDM: Implicit Face Motion Diffusion Model for High-Fidelity Realtime Talking Head Generation | Dec 5, 2024 | DisentanglementTalking Head Generation | —Unverified | 0 |
| ILDiff: Generate Transparent Animated Stickers by Implicit Layout Distillation | Dec 30, 2024 | Image MattingVideo Generation | —Unverified | 0 |
| IllumiCraft: Unified Geometry and Illumination Diffusion for Controllable Video Generation | Jun 3, 2025 | 3D geometryVideo Generation | —Unverified | 0 |
| Imagen Video: High Definition Video Generation with Diffusion Models | Oct 5, 2022 | Image GenerationSuper-Resolution | —Unverified | 0 |
| Image-to-Video Generation via 3D Facial Dynamics | May 31, 2021 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| Imagine360: Immersive 360 Video Generation from Perspective Anchor | Dec 4, 2024 | DenoisingVideo Denoising | —Unverified | 0 |
| Importance-Based Token Merging for Efficient Image and Video Generation | Nov 23, 2024 | Image GenerationVideo Generation | —Unverified | 0 |
| Impossible Videos | Mar 18, 2025 | counterfactualVideo Generation | —Unverified | 0 |