| NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results | Apr 22, 2024 | 4kImage Enhancement | CodeCode Available | 5 |
| PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Mar 7, 2024 | 4kImage Captioning | CodeCode Available | 5 |
| Scaling Granite Code Models to 128K Context | Jul 18, 2024 | 2k4k | CodeCode Available | 4 |
| Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Jun 11, 2024 | 4kLanguage Modeling | CodeCode Available | 4 |
| Highly Accurate Dichotomous Image Segmentation | Mar 6, 2022 | 2k3D Reconstruction | CodeCode Available | 4 |
| Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation | Jun 2, 2025 | 4kDescriptive | CodeCode Available | 3 |
| Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models | Mar 24, 2025 | 4kImage Generation | CodeCode Available | 3 |
| Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray | Feb 7, 2025 | 4kGeneral Knowledge | CodeCode Available | 3 |
| LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs | Jan 10, 2025 | 4kVisual Reasoning | CodeCode Available | 3 |
| PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Dec 16, 2024 | 3D Reconstruction4k | CodeCode Available | 3 |