SOTAVerified

Image Reconstruction

Papers

Showing 150 of 2143 papers

TitleStatusHype
HART: Efficient Visual Generation with Hybrid Autoregressive TransformerCode9
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionCode9
Autoregressive Model Beats Diffusion: Llama for Scalable Image GenerationCode5
MoVQ: Modulating Quantized Vectors for High-Fidelity Image GenerationCode5
DeepInverse: A Python package for solving imaging inverse problems with deep learningCode4
Taming Scalable Visual Tokenizer for Autoregressive Image GenerationCode4
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual GenerationCode4
End-to-End Hybrid Refractive-Diffractive Lens Design with Differentiable Ray-Wave ModelCode4
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single ImageCode4
High-Resolution Image Synthesis with Latent Diffusion ModelsCode4
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image GenerationCode3
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual RepresentationsCode3
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and GenerationCode3
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive GenerationCode3
ImageFolder: Autoregressive Image Generation with Folded TokensCode3
VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series ForecastersCode3
An Image is Worth 32 Tokens for Reconstruction and GenerationCode3
Bidirectional Multi-Scale Implicit Neural Representations for Image DerainingCode3
Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputsCode3
High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain ActivityCode3
Unifying Vision, Text, and Layout for Universal Document ProcessingCode3
Image Quality Assessment for Magnetic Resonance ImagingCode3
Autoregressive Image Generation using Residual QuantizationCode3
MaskGIT: Masked Generative Image TransformerCode3
Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image ReconstructionCode3
SwinIR: Image Restoration Using Swin TransformerCode3
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
Visual Text Processing: A Comprehensive Review and Unified EvaluationCode2
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion RefinementCode2
Q-Insight: Understanding Image Quality via Visual Reinforcement LearningCode2
Advancing MRI Reconstruction: A Systematic Review of Deep Learning and Compressed Sensing IntegrationCode2
Navigating Image Restoration with VAR's Distribution Alignment PriorCode2
Varformer: Adapting VAR's Generative Prior for Image RestorationCode2
Preventing Local Pitfalls in Vector Quantization via Optimal TransportCode2
MaskBit: Embedding-free Image Generation via Bit TokensCode2
DetailCLIP: Detail-Oriented CLIP for Fine-Grained TasksCode2
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%Code2
IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation ModelCode2
Learning A Spiking Neural Network for Efficient Image DerainingCode2
Towards Extreme Image Compression with Latent Feature Guidance and Diffusion PriorCode2
MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image FusionCode2
MindBridge: A Cross-Subject Brain Decoding FrameworkCode2
Invertible Diffusion Models for Compressed SensingCode2
A Modular and Robust Physics-Based Approach for Lensless Image ReconstructionCode2
ASCNet: Asymmetric Sampling Correction Network for Infrared Image DestripingCode2
DiAD: A Diffusion-based Framework for Multi-class Anomaly DetectionCode2
Controlling Vision-Language Models for Multi-Task Image RestorationCode2
Orientation-Independent Chinese Text Recognition in Scene ImagesCode2
Implicit Neural Representation in Medical Imaging: A Comparative SurveyCode2
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion PriorsCode2
Show:102550
← PrevPage 1 of 43Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Taming-VQGAN (16x16)FID3.64Unverified
2VQGAN-LC (16x16)FID2.62Unverified
3MaskGIT-VQGAN (16x16)FID2.28Unverified
4RQ-VAE (8x8x16)FID1.83Unverified
5TiTok-S-128FID1.71Unverified
6MaskBit (16x16)FID1.66Unverified
7ViT-VQGAN (16x16)FID1.28Unverified
8Open-Magvit2 (16x16)FID1.17Unverified
9Mo-VQGAN (16x16x4)FID1.12Unverified
10IBQ (16x16)FID1Unverified
#ModelMetricClaimedVerifiedStatus
1VAR (16x16)rFID9.85Unverified
2VQGAN (16x16)rFID5.95Unverified
3LlamaGen (16x16)rFID5.59Unverified
4Open-Magvit2 (16x16)rFID4.18Unverified
5MGVQ (16x16x4)rFID1.59Unverified
6SD-VAE (16x16)rFID1.07Unverified
#ModelMetricClaimedVerifiedStatus
1pix2pixFID96.31Unverified
2bFTFID74.9Unverified
3Xian et al._FID60.85Unverified
4PI-RECFID0.07Unverified
#ModelMetricClaimedVerifiedStatus
1pix2pixFID197.49Unverified
2bFTFID121.2Unverified
3Xian et al._FID44.76Unverified
4PI-RECFID0.02Unverified
#ModelMetricClaimedVerifiedStatus
1Ours (STFT: magnitude, L1l=1) W-ReplicateSSIM0.88Unverified
2Ours (STFT: magnitude, L1l=1) WS-ReplicateSSIM0.86Unverified
#ModelMetricClaimedVerifiedStatus
1AugVAE-SLFID3.28Unverified
2AugVAE-MLFID1.04Unverified
#ModelMetricClaimedVerifiedStatus
1bFTFID58.4Unverified
#ModelMetricClaimedVerifiedStatus
1SwinSFAverage PSNR39.61Unverified