| HART: Efficient Visual Generation with Hybrid Autoregressive Transformer | Oct 14, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 9 | 5 |
| Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Apr 3, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 9 | 5 |
| MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation | Sep 19, 2022 | DecoderImage Generation | CodeCode Available | 5 | 5 |
| Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Jun 10, 2024 | Conditional Image GenerationImage Generation | CodeCode Available | 5 | 5 |
| Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation | Sep 6, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 4 | 5 |
| Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image | Jul 20, 2023 | Depth EstimationImage Reconstruction | CodeCode Available | 4 | 5 |
| End-to-End Hybrid Refractive-Diffractive Lens Design with Differentiable Ray-Wave Model | Jun 2, 2024 | Image Reconstruction | CodeCode Available | 4 | 5 |
| High-Resolution Image Synthesis with Latent Diffusion Models | Dec 20, 2021 | DenoisingGPU | CodeCode Available | 4 | 5 |
| DeepInverse: A Python package for solving imaging inverse problems with deep learning | May 26, 2025 | Image Reconstruction | CodeCode Available | 4 | 5 |
| Taming Scalable Visual Tokenizer for Autoregressive Image Generation | Dec 3, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 4 | 5 |
| Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations | Dec 19, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 3 | 5 |
| High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain Activity | Jan 1, 2023 | DenoisingImage Reconstruction | CodeCode Available | 3 | 5 |
| Image Quality Assessment for Magnetic Resonance Imaging | Mar 15, 2022 | DenoisingImage Enhancement | CodeCode Available | 3 | 5 |
| ImageFolder: Autoregressive Image Generation with Folded Tokens | Oct 2, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 | 5 |
| An Image is Worth 32 Tokens for Reconstruction and Generation | Jun 11, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 | 5 |
| Unifying Vision, Text, and Layout for Universal Document Processing | Dec 5, 2022 | Document AIdocument understanding | CodeCode Available | 3 | 5 |
| VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters | Aug 30, 2024 | Image ReconstructionTime Series | CodeCode Available | 3 | 5 |
| SwinIR: Image Restoration Using Swin Transformer | Aug 23, 2021 | Color Image DenoisingDenoising | CodeCode Available | 3 | 5 |
| Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining | Apr 2, 2024 | Image ReconstructionRain Removal | CodeCode Available | 3 | 5 |
| MaskGIT: Masked Generative Image Transformer | Feb 8, 2022 | DecoderImage Generation | CodeCode Available | 3 | 5 |
| GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation | Apr 11, 2025 | DecoderImage Generation | CodeCode Available | 3 | 5 |
| Autoregressive Image Generation using Residual Quantization | Mar 3, 2022 | Conditional Image GenerationImage Generation | CodeCode Available | 3 | 5 |
| Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputs | Jun 20, 2023 | Deep LearningImage Reconstruction | CodeCode Available | 3 | 5 |
| Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction | Nov 15, 2021 | Compressive SensingImage Reconstruction | CodeCode Available | 3 | 5 |
| TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation | Dec 4, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 | 5 |