| InstantSplat: Sparse-view SfM-free Gaussian Splatting in Seconds | Mar 29, 2024 | 3D ReconstructionNovel View Synthesis | CodeCode Available | 5 |
| ControlNet++: Improving Conditional Controls with Efficient Consistency Feedback | Apr 11, 2024 | SSIM | CodeCode Available | 4 |
| Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration | Oct 1, 2024 | Blind Face RestorationImage Colorization | CodeCode Available | 4 |
| R^2-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic Reconstruction | May 31, 2024 | 3DGSNeRF | CodeCode Available | 4 |
| MagCache: Fast Video Generation with Magnitude-Aware Cache | Jun 10, 2025 | SSIMVideo Generation | CodeCode Available | 3 |
| VidTok: A Versatile and Open-Source Video Tokenizer | Dec 17, 2024 | QuantizationSSIM | CodeCode Available | 3 |
| The Unreasonable Effectiveness of Deep Features as a Perceptual Metric | Jan 11, 2018 | Image Quality AssessmentSSIM | CodeCode Available | 3 |
| NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video Reconstruction | Oct 25, 2024 | SSIMVideo Reconstruction | CodeCode Available | 2 |
| Distillation-Supervised Convolutional Low-Rank Adaptation for Efficient Image Super-Resolution | Apr 15, 2025 | Image Super-ResolutionKnowledge Distillation | CodeCode Available | 2 |
| BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis | Nov 13, 2024 | NeRFNovel View Synthesis | CodeCode Available | 2 |
| SwinFuSR: an image fusion-inspired model for RGB-guided thermal image super-resolution | Apr 22, 2024 | Image Super-ResolutionSSIM | CodeCode Available | 2 |
| Generalizable Human Gaussians from Single-View Image | Jun 10, 2024 | Novel View SynthesisSSIM | CodeCode Available | 2 |
| Brain Tumour Removing and Missing Modality Generation using 3D WDM | Nov 7, 2024 | GPUPrediction | CodeCode Available | 2 |
| IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model | May 16, 2024 | Image EnhancementImage Reconstruction | CodeCode Available | 2 |
| From NeRFs to Gaussian Splats, and Back | May 15, 2024 | SSIM | CodeCode Available | 2 |
| Deep Learning-based Compression Detection for explainable Face Image Quality Assessment | Jan 7, 2025 | Face Image QualityFace Image Quality Assessment | CodeCode Available | 2 |
| Is Attention All That NeRF Needs? | Jul 27, 2022 | AllGeneralizable Novel View Synthesis | CodeCode Available | 2 |
| UVDoc: Neural Grid-based Document Unwarping | Feb 6, 2023 | distortion correctionMS-SSIM | CodeCode Available | 2 |
| DVMSR: Distillated Vision Mamba for Efficient Super-Resolution | May 5, 2024 | Image Super-ResolutionLong-range modeling | CodeCode Available | 2 |
| Volumetrically Consistent 3D Gaussian Rasterization | Dec 4, 2024 | 3DGSSSIM | CodeCode Available | 2 |
| InvisMark: Invisible and Robust Watermarking for AI-generated Image Provenance | Nov 10, 2024 | SSIM | CodeCode Available | 2 |
| Cross-view Masked Diffusion Transformers for Person Image Synthesis | Feb 2, 2024 | DenoisingImage Generation | CodeCode Available | 2 |
| SDCNet: Video Prediction Using Spatially-Displaced Convolution | Nov 2, 2018 | Optical Flow EstimationPrediction | CodeCode Available | 2 |
| DeblurDiNAT: A Compact Model with Exceptional Generalization and Visual Fidelity on Unseen Domains | Mar 19, 2024 | DeblurringDecoder | CodeCode Available | 1 |
| ADC-Net: An Open-Source Deep Learning Network for Automated Dispersion Compensation in Optical Coherence Tomography | Jan 29, 2022 | DecoderMS-SSIM | CodeCode Available | 1 |
| D'ARTAGNAN: Counterfactual Video Generation | Jun 3, 2022 | Anatomycounterfactual | CodeCode Available | 1 |
| Deep Convolutional Dictionary Learning for Image Denoising | Jun 19, 2021 | DenoisingDictionary Learning | CodeCode Available | 1 |
| DarkVisionNet: Low-Light Imaging via RGB-NIR Fusion with Deep Inconsistency Prior | Mar 13, 2023 | SSIM | CodeCode Available | 1 |
| DARTS: Double Attention Reference-based Transformer for Super-resolution | Jul 17, 2023 | Image Super-ResolutionKnowledge Distillation | CodeCode Available | 1 |
| DC-cycleGAN: Bidirectional CT-to-MR Synthesis from Unpaired Data | Nov 2, 2022 | Image GenerationSSIM | CodeCode Available | 1 |
| D2C-SR: A Divergence to Convergence Approach for Real-World Image Super-Resolution | Mar 26, 2021 | Image Super-ResolutionSSIM | CodeCode Available | 1 |
| CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment | Oct 2, 2024 | AstronomyImage Quality Assessment | CodeCode Available | 1 |
| DAOT: Domain-Agnostically Aligned Optimal Transport for Domain-Adaptive Crowd Counting | Aug 10, 2023 | Crowd CountingDomain Adaptation | CodeCode Available | 1 |
| Cross-Resolution Flow Propagation for Foveated Video Super-Resolution | Dec 27, 2022 | SSIMSuper-Resolution | CodeCode Available | 1 |
| Context-adaptive Entropy Model for End-to-end Optimized Image Compression | Sep 27, 2018 | Image CompressionMS-SSIM | CodeCode Available | 1 |
| ConsistentNeRF: Enhancing Neural Radiance Fields with 3D Consistency for Sparse View Synthesis | May 18, 2023 | 3D ReconstructionNeRF | CodeCode Available | 1 |
| DARES: Depth Anything in Robotic Endoscopic Surgery with Self-supervised Vector-LoRA of the Foundation Model | Aug 30, 2024 | 3D ReconstructionDepth Estimation | CodeCode Available | 1 |
| Deeper into Self-Supervised Monocular Indoor Depth Estimation | Dec 3, 2023 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 1 |
| Cloud Removal in Satellite Images Using Spatiotemporal Generative Networks | Dec 14, 2019 | Cloud RemovalEarth Observation | CodeCode Available | 1 |
| BF-STVSR: B-Splines and Fourier---Best Friends for High Fidelity Spatial-Temporal Video Super-Resolution | Jan 1, 2025 | Optical Flow EstimationSSIM | CodeCode Available | 1 |
| Combining Attention Module and Pixel Shuffle for License Plate Super-Resolution | Oct 30, 2022 | Image Super-ResolutionLicense Plate Recognition | CodeCode Available | 1 |
| BemaGANv2: A Tutorial and Comparative Survey of GAN-based Vocoders for Long-Term Audio Generation | Jun 11, 2025 | Audio GenerationFAD | CodeCode Available | 1 |
| BASNet: Boundary-Aware Salient Object Detection | Jun 1, 2019 | Camouflaged Object SegmentationDecoder | CodeCode Available | 1 |
| Bidirectional Stereo Image Compression with Cross-Dimensional Entropy Model | Jul 15, 2024 | Image CompressionMS-SSIM | CodeCode Available | 1 |
| Better "CMOS" Produces Clearer Images: Learning Space-Variant Blur Estimation for Blind Image Super-Resolution | Apr 7, 2023 | Image Super-ResolutionSSIM | CodeCode Available | 1 |
| Cloud removal in remote sensing images using generative adversarial networks and SAR-to-optical image translation | Dec 22, 2020 | Cloud RemovalSSIM | CodeCode Available | 1 |
| A Text Attention Network for Spatial Deformation Robust Scene Text Image Super-resolution | Mar 17, 2022 | Image Super-ResolutionSSIM | CodeCode Available | 1 |
| Assessing the (Un)Trustworthiness of Saliency Maps for Localizing Abnormalities in Medical Imaging | Aug 6, 2020 | SSIM | CodeCode Available | 1 |
| ABLE-NeRF: Attention-Based Rendering with Learnable Embeddings for Neural Radiance Field | Mar 24, 2023 | NeRFSSIM | CodeCode Available | 1 |
| Attention-Guided Hierarchical Structure Aggregation for Image Matting | Jun 1, 2020 | Image MattingSSIM | CodeCode Available | 1 |