| GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation | Apr 11, 2025 | DecoderImage Generation | CodeCode Available | 3 |
| Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations | Dec 19, 2024 | Contrastive LearningImage Reconstruction | CodeCode Available | 3 |
| TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation | Dec 4, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 |
| XQ-GAN: An Open-source Image Tokenization Framework for Autoregressive Generation | Dec 2, 2024 | Image ReconstructionQuantization | CodeCode Available | 3 |
| ImageFolder: Autoregressive Image Generation with Folded Tokens | Oct 2, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 |
| VisionTS: Visual Masked Autoencoders Are Free-Lunch Zero-Shot Time Series Forecasters | Aug 30, 2024 | Image ReconstructionTime Series | CodeCode Available | 3 |
| An Image is Worth 32 Tokens for Reconstruction and Generation | Jun 11, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 3 |
| Bidirectional Multi-Scale Implicit Neural Representations for Image Deraining | Apr 2, 2024 | Image ReconstructionRain Removal | CodeCode Available | 3 |
| Improving visual image reconstruction from human brain activity using latent diffusion models via multiple decoded inputs | Jun 20, 2023 | Deep LearningImage Reconstruction | CodeCode Available | 3 |
| High-Resolution Image Reconstruction With Latent Diffusion Models From Human Brain Activity | Jan 1, 2023 | DenoisingImage Reconstruction | CodeCode Available | 3 |