| Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model | Feb 14, 2025 | Video GenerationVideo Reconstruction | CodeCode Available | 7 |
| Image and Video Tokenization with Binary Spherical Quantization | Jun 11, 2024 | DecoderImage Generation | CodeCode Available | 3 |
| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Mar 23, 2022 | 4kAction Classification | CodeCode Available | 3 |
| Motion Representations for Articulated Animation | Apr 22, 2021 | ObjectVideo Reconstruction | CodeCode Available | 3 |
| First Order Motion Model for Image Animation | Feb 29, 2020 | Image Animationmodel | CodeCode Available | 3 |
| LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models | Mar 18, 2025 | compressed sensingVideo Generation | CodeCode Available | 2 |
| Seeing World Dynamics in a Nutshell | Feb 5, 2025 | Video Reconstruction | CodeCode Available | 2 |
| LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior | Oct 28, 2024 | Video GenerationVideo Reconstruction | CodeCode Available | 2 |
| NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction | Oct 25, 2024 | SSIMVideo Reconstruction | CodeCode Available | 2 |
| Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space | May 22, 2025 | Video Reconstruction | CodeCode Available | 1 |
| V2V: Scaling Event-Based Vision through Efficient Video-to-Voxel Simulation | May 22, 2025 | Event-based visionOptical Flow Estimation | CodeCode Available | 1 |
| Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video Reconstruction | Mar 14, 2025 | Semantic SegmentationVideo Reconstruction | CodeCode Available | 1 |
| Rethinking Video Tokenization: A Conditioned Diffusion-based Approach | Mar 5, 2025 | DecoderVideo Compression | CodeCode Available | 1 |
| VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models | Nov 29, 2024 | DeblurringGPU | CodeCode Available | 1 |
| bit2bit: 1-bit quanta video reconstruction via self-supervised photon prediction | Oct 30, 2024 | DenoisingVideo Reconstruction | CodeCode Available | 1 |
| Cascaded Temporal Updating Network for Efficient Video Super-Resolution | Aug 26, 2024 | Super-ResolutionVideo Reconstruction | CodeCode Available | 1 |
| Bilateral Event Mining and Complementary for Event Stream Super-Resolution | May 16, 2024 | Object RecognitionSuper-Resolution | CodeCode Available | 1 |
| Towards Real-World HDR Video Reconstruction: A Large-Scale Benchmark Dataset and A Two-Stage Alignment Network | Apr 30, 2024 | Video Reconstruction | CodeCode Available | 1 |
| Collaborative Feedback Discriminative Propagation for Video Super-Resolution | Apr 6, 2024 | Super-ResolutionVideo Reconstruction | CodeCode Available | 1 |
| An Asynchronous Linear Filter Architecture for Hybrid Event-Frame Cameras | Sep 3, 2023 | Video Reconstruction | CodeCode Available | 1 |
| LAN-HDR: Luminance-based Alignment Network for High Dynamic Range Video Reconstruction | Aug 22, 2023 | HallucinationMotion Compensation | CodeCode Available | 1 |
| HyperE2VID: Improving Event-Based Video Reconstruction via Hypernetworks | May 10, 2023 | Event-Based Video ReconstructionVideo Reconstruction | CodeCode Available | 1 |
| EVREAL: Towards a Comprehensive Benchmark and Analysis Suite for Event-based Video Reconstruction | Apr 30, 2023 | Event-Based Video ReconstructionEvent-based vision | CodeCode Available | 1 |
| HNeRV: A Hybrid Neural Representation for Videos | Apr 5, 2023 | DecoderDenoising | CodeCode Available | 1 |
| Joint Video Multi-Frame Interpolation and Deblurring under Unknown Exposure Time | Mar 27, 2023 | Contrastive LearningDeblurring | CodeCode Available | 1 |