SOTAVerified

Image Reconstruction

Papers

Showing 150 of 2143 papers

TitleStatusHype
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale PredictionCode9
HART: Efficient Visual Generation with Hybrid Autoregressive TransformerCode9
MoVQ: Modulating Quantized Vectors for High-Fidelity Image GenerationCode5
Autoregressive Model Beats Diffusion: Llama for Scalable Image GenerationCode5
End-to-End Hybrid Refractive-Diffractive Lens Design with Differentiable Ray-Wave ModelCode4
Metric3D: Towards Zero-shot Metric 3D Prediction from A Single ImageCode4
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual GenerationCode4
DeepInverse: A Python package for solving imaging inverse problems with deep learningCode4
High-Resolution Image Synthesis with Latent Diffusion ModelsCode4
Taming Scalable Visual Tokenizer for Autoregressive Image GenerationCode4
TokenFlow: Unified Image Tokenizer for Multimodal Understanding and GenerationCode3
An Image is Worth 32 Tokens for Reconstruction and GenerationCode3
Autoregressive Image Generation using Residual QuantizationCode3
SwinIR: Image Restoration Using Swin TransformerCode3
Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputsCode3
Image Quality Assessment for Magnetic Resonance ImagingCode3
High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain ActivityCode3
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image GenerationCode3
Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image ReconstructionCode3
Bidirectional Multi-Scale Implicit Neural Representations for Image DerainingCode3
MaskGIT: Masked Generative Image TransformerCode3
XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive GenerationCode3
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual RepresentationsCode3
Unifying Vision, Text, and Layout for Universal Document ProcessingCode3
VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series ForecastersCode3
ImageFolder: Autoregressive Image Generation with Folded TokensCode3
Scaling the Codebook Size of VQGAN to 100,000 with a Utilization Rate of 99%Code2
Anomaly Detection with Conditioned Denoising Diffusion ModelsCode2
Snowflake Point Deconvolution for Point Cloud Completion and Generation with Skip-TransformerCode2
Preventing Local Pitfalls in Vector Quantization via Optimal TransportCode2
Orientation-Independent Chinese Text Recognition in Scene ImagesCode2
Q-Insight: Understanding Image Quality via Visual Reinforcement LearningCode2
Navigating Image Restoration with VAR's Distribution Alignment PriorCode2
A Modular and Robust Physics-Based Approach for Lensless Image ReconstructionCode2
MindBridge: A Cross-Subject Brain Decoding FrameworkCode2
Reconstructing the Mind's Eye: fMRI-to-Image with Contrastive Learning and Diffusion PriorsCode2
Towards Extreme Image Compression with Latent Feature Guidance and Diffusion PriorCode2
MaskBit: Embedding-free Image Generation via Bit TokensCode2
Learning A Sparse Transformer Network for Effective Image DerainingCode2
Implicit Neural Representation in Medical Imaging: A Comparative SurveyCode2
Invertible Diffusion Models for Compressed SensingCode2
Advancing MRI Reconstruction: A Systematic Review of Deep Learning and Compressed Sensing IntegrationCode2
IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation ModelCode2
Generative Adversarial Network in Medical Imaging: A ReviewCode2
Learning A Spiking Neural Network for Efficient Image DerainingCode2
MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image FusionCode2
ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion RefinementCode2
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group QuantizationCode2
DetailCLIP: Detail-Oriented CLIP for Fine-Grained TasksCode2
ASCNet: Asymmetric Sampling Correction Network for Infrared Image DestripingCode2
Show:102550
← PrevPage 1 of 43Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Taming-VQGAN (16x16)FID3.64Unverified
2VQGAN-LC (16x16)FID2.62Unverified
3MaskGIT-VQGAN (16x16)FID2.28Unverified
4RQ-VAE (8x8x16)FID1.83Unverified
5TiTok-S-128FID1.71Unverified
6MaskBit (16x16)FID1.66Unverified
7ViT-VQGAN (16x16)FID1.28Unverified
8Open-Magvit2 (16x16)FID1.17Unverified
9Mo-VQGAN (16x16x4)FID1.12Unverified
10IBQ (16x16)FID1Unverified
#ModelMetricClaimedVerifiedStatus
1VAR (16x16)rFID9.85Unverified
2VQGAN (16x16)rFID5.95Unverified
3LlamaGen (16x16)rFID5.59Unverified
4Open-Magvit2 (16x16)rFID4.18Unverified
5MGVQ (16x16x4)rFID1.59Unverified
6SD-VAE (16x16)rFID1.07Unverified
#ModelMetricClaimedVerifiedStatus
1pix2pixFID96.31Unverified
2bFTFID74.9Unverified
3Xian et al._FID60.85Unverified
4PI-RECFID0.07Unverified
#ModelMetricClaimedVerifiedStatus
1pix2pixFID197.49Unverified
2bFTFID121.2Unverified
3Xian et al._FID44.76Unverified
4PI-RECFID0.02Unverified
#ModelMetricClaimedVerifiedStatus
1Ours (STFT: magnitude, L1l=1) W-ReplicateSSIM0.88Unverified
2Ours (STFT: magnitude, L1l=1) WS-ReplicateSSIM0.86Unverified
#ModelMetricClaimedVerifiedStatus
1AugVAE-SLFID3.28Unverified
2AugVAE-MLFID1.04Unverified
#ModelMetricClaimedVerifiedStatus
1bFTFID58.4Unverified
#ModelMetricClaimedVerifiedStatus
1SwinSFAverage PSNR39.61Unverified