| HART: Efficient Visual Generation with Hybrid Autoregressive Transformer | Oct 14, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 9 | 5 |
| Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Apr 3, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 9 | 5 |
| Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation | Jun 10, 2024 | Conditional Image GenerationImage Generation | CodeCode Available | 5 | 5 |
| MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation | Sep 19, 2022 | DecoderImage Generation | CodeCode Available | 5 | 5 |
| Taming Scalable Visual Tokenizer for Autoregressive Image Generation | Dec 3, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 4 | 5 |
| Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image | Jul 20, 2023 | Depth EstimationImage Reconstruction | CodeCode Available | 4 | 5 |
| End-to-End Hybrid Refractive-Diffractive Lens Design with Differentiable Ray-Wave Model | Jun 2, 2024 | Image Reconstruction | CodeCode Available | 4 | 5 |
| Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation | Sep 6, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 4 | 5 |
| High-Resolution Image Synthesis with Latent Diffusion Models | Dec 20, 2021 | DenoisingGPU | CodeCode Available | 4 | 5 |
| DeepInverse: A Python package for solving imaging inverse problems with deep learning | May 26, 2025 | Image Reconstruction | CodeCode Available | 4 | 5 |
| An Image is Worth 32 Tokens for Reconstruction and Generation | Jun 11, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 | 5 |
| Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations | Dec 19, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 3 | 5 |
| TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation | Dec 4, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 | 5 |
| SwinIR: Image Restoration Using Swin Transformer | Aug 23, 2021 | Color Image DenoisingDenoising | CodeCode Available | 3 | 5 |
| Autoregressive Image Generation using Residual Quantization | Mar 3, 2022 | Conditional Image GenerationImage Generation | CodeCode Available | 3 | 5 |
| Image Quality Assessment for Magnetic Resonance Imaging | Mar 15, 2022 | DenoisingImage Enhancement | CodeCode Available | 3 | 5 |
| Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputs | Jun 20, 2023 | Deep LearningImage Reconstruction | CodeCode Available | 3 | 5 |
| High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain Activity | Jan 1, 2023 | DenoisingImage Reconstruction | CodeCode Available | 3 | 5 |
| GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation | Apr 11, 2025 | DecoderImage Generation | CodeCode Available | 3 | 5 |
| Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining | Apr 2, 2024 | Image ReconstructionRain Removal | CodeCode Available | 3 | 5 |
| Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction | Nov 15, 2021 | Compressive SensingImage Reconstruction | CodeCode Available | 3 | 5 |
| XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation | Dec 2, 2024 | Image ReconstructionQuantization | CodeCode Available | 3 | 5 |
| Unifying Vision, Text, and Layout for Universal Document Processing | Dec 5, 2022 | Document AIdocument understanding | CodeCode Available | 3 | 5 |
| VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters | Aug 30, 2024 | Image ReconstructionTime Series | CodeCode Available | 3 | 5 |
| ImageFolder: Autoregressive Image Generation with Folded Tokens | Oct 2, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 | 5 |
| MaskGIT: Masked Generative Image Transformer | Feb 8, 2022 | DecoderImage Generation | CodeCode Available | 3 | 5 |
| Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99% | Jun 17, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion Priors | May 29, 2023 | Contrastive LearningImage Reconstruction | CodeCode Available | 2 | 5 |
| Snowflake Point Deconvolution for Point Cloud Completion and Generation with Skip-Transformer | Feb 18, 2022 | Image ReconstructionPoint Cloud Completion | CodeCode Available | 2 | 5 |
| Orientation-Independent Chinese Text Recognition in Scene Images | Sep 3, 2023 | BenchmarkingImage Reconstruction | CodeCode Available | 2 | 5 |
| Preventing Local Pitfalls in Vector Quantization via Optimal Transport | Dec 19, 2024 | Image ReconstructionQuantization | CodeCode Available | 2 | 5 |
| MindBridge: A Cross-Subject Brain Decoding Framework | Apr 11, 2024 | Brain DecodingData Augmentation | CodeCode Available | 2 | 5 |
| MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization | Jul 14, 2025 | 2kImage Generation | CodeCode Available | 2 | 5 |
| Navigating Image Restoration with VAR's Distribution Alignment Prior | Jan 1, 2025 | Image ReconstructionImage Restoration | CodeCode Available | 2 | 5 |
| Q-Insight: Understanding Image Quality via Visual Reinforcement Learning | Mar 28, 2025 | DescriptiveImage Quality Assessment | CodeCode Available | 2 | 5 |
| Towards Extreme Image Compression with Latent Feature Guidance and Diffusion Prior | Apr 29, 2024 | Image CompressionImage Reconstruction | CodeCode Available | 2 | 5 |
| MaskBit: Embedding-free Image Generation via Bit Tokens | Sep 24, 2024 | Conditional Image GenerationImage Generation | CodeCode Available | 2 | 5 |
| Advancing MRI Reconstruction: A Systematic Review of Deep Learning and Compressed Sensing Integration | Jan 24, 2025 | compressed sensingFederated Learning | CodeCode Available | 2 | 5 |
| Learning A Sparse Transformer Network for Effective Image Deraining | Mar 21, 2023 | Image ReconstructionImage Restoration | CodeCode Available | 2 | 5 |
| Implicit Neural Representation in Medical Imaging: A Comparative Survey | Jul 30, 2023 | Domain AdaptationImage Reconstruction | CodeCode Available | 2 | 5 |
| Invertible Diffusion Models for Compressed Sensing | Mar 25, 2024 | compressed sensingGPU | CodeCode Available | 2 | 5 |
| IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model | May 16, 2024 | Image EnhancementImage Reconstruction | CodeCode Available | 2 | 5 |
| Generative Adversarial Network in Medical Imaging: A Review | Sep 19, 2018 | Data AugmentationDomain Adaptation | CodeCode Available | 2 | 5 |
| ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement | Apr 2, 2025 | DecoderImage Generation | CodeCode Available | 2 | 5 |
| Learning A Spiking Neural Network for Efficient Image Deraining | May 10, 2024 | Image ReconstructionRain Removal | CodeCode Available | 2 | 5 |
| MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion | Apr 12, 2024 | Image ReconstructionMamba | CodeCode Available | 2 | 5 |
| Anomaly Detection with Conditioned Denoising Diffusion Models | May 25, 2023 | Anomaly DetectionDenoising | CodeCode Available | 2 | 5 |
| A Modular and Robust Physics-Based Approach for Lensless Image Reconstruction | Mar 1, 2024 | Image Reconstruction | CodeCode Available | 2 | 5 |
| DetailCLIP: Detail-Oriented CLIP for Fine-Grained Tasks | Sep 10, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 2 | 5 |
| ASCNet: Asymmetric Sampling Correction Network for Infrared Image Destriping | Jan 28, 2024 | Feature UpsamplingImage Reconstruction | CodeCode Available | 2 | 5 |