| Unifying Vision, Text, and Layout for Universal Document Processing | Dec 5, 2022 | Document AIdocument understanding | CodeCode Available | 3 |
| Image Quality Assessment for Magnetic Resonance Imaging | Mar 15, 2022 | DenoisingImage Enhancement | CodeCode Available | 3 |
| Autoregressive Image Generation using Residual Quantization | Mar 3, 2022 | Conditional Image GenerationImage Generation | CodeCode Available | 3 |
| MaskGIT: Masked Generative Image Transformer | Feb 8, 2022 | DecoderImage Generation | CodeCode Available | 3 |
| Mask-guided Spectral-wise Transformer for Efficient Hyperspectral Image Reconstruction | Nov 15, 2021 | Compressive SensingImage Reconstruction | CodeCode Available | 3 |
| SwinIR: Image Restoration Using Swin Transformer | Aug 23, 2021 | Color Image DenoisingDenoising | CodeCode Available | 3 |
| MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization | Jul 14, 2025 | 2kImage Generation | CodeCode Available | 2 |
| Visual Text Processing: A Comprehensive Review and Unified Evaluation | Apr 30, 2025 | Image ManipulationImage Reconstruction | CodeCode Available | 2 |
| ILLUME+: Illuminating Unified MLLM with Dual Visual Tokenization and Diffusion Refinement | Apr 2, 2025 | DecoderImage Generation | CodeCode Available | 2 |
| Q-Insight: Understanding Image Quality via Visual Reinforcement Learning | Mar 28, 2025 | DescriptiveImage Quality Assessment | CodeCode Available | 2 |