| NeedleBench: Can LLMs Do Retrieval and Reasoning in Information-Dense Context? | Jul 16, 2024 | 4k8k | CodeCode Available | 9 |
| InternLM2 Technical Report | Mar 26, 2024 | 4kLong-Context Understanding | CodeCode Available | 9 |
| World Model on Million-Length Video And Language With Blockwise RingAttention | Feb 13, 2024 | 4kVideo Understanding | CodeCode Available | 9 |
| ComfyUI-R1: Exploring Reasoning Models for Workflow Generation | Jun 11, 2025 | 4k | CodeCode Available | 7 |
| Scaling Vision Pre-Training to 4K Resolution | Mar 25, 2025 | 4kContrastive Learning | CodeCode Available | 7 |
| Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation | Oct 10, 2024 | 4kImage Animation | CodeCode Available | 7 |
| LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models | Sep 21, 2023 | 4kGPU | CodeCode Available | 6 |
| FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness | May 27, 2022 | 16k4k | CodeCode Available | 6 |
| Prompting Depth Anything for 4K Resolution Accurate Metric Depth Estimation | Dec 18, 2024 | 3D Reconstruction4k | CodeCode Available | 5 |
| Make Your LLM Fully Utilize the Context | Apr 25, 2024 | 4kInformation Retrieval | CodeCode Available | 5 |
| NTIRE 2024 Challenge on Low Light Image Enhancement: Methods and Results | Apr 22, 2024 | 4kImage Enhancement | CodeCode Available | 5 |
| PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation | Mar 7, 2024 | 4kImage Captioning | CodeCode Available | 5 |
| Scaling Granite Code Models to 128K Context | Jul 18, 2024 | 2k4k | CodeCode Available | 4 |
| Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Jun 11, 2024 | 4kLanguage Modeling | CodeCode Available | 4 |
| Highly Accurate Dichotomous Image Segmentation | Mar 6, 2022 | 2k3D Reconstruction | CodeCode Available | 4 |
| Ultra-High-Resolution Image Synthesis: Data, Method and Evaluation | Jun 2, 2025 | 4kDescriptive | CodeCode Available | 3 |
| Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion Models | Mar 24, 2025 | 4kImage Generation | CodeCode Available | 3 |
| Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray | Feb 7, 2025 | 4kGeneral Knowledge | CodeCode Available | 3 |
| LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs | Jan 10, 2025 | 4kVisual Reasoning | CodeCode Available | 3 |
| PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting | Dec 16, 2024 | 3D Reconstruction4k | CodeCode Available | 3 |
| 360Zhinao Technical Report | May 22, 2024 | 4k | CodeCode Available | 3 |
| Real-Time 4K Super-Resolution of Compressed AVIF Images. AIS 2024 Challenge Survey | Apr 25, 2024 | 4kImage Super-Resolution | CodeCode Available | 3 |
| Data Engineering for Scaling Language Models to 128K Context | Feb 15, 2024 | 4kContinual Pretraining | CodeCode Available | 3 |
| VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training | Mar 23, 2022 | 4kAction Classification | CodeCode Available | 3 |
| Robust High-Resolution Video Matting with Temporal Guidance | Aug 25, 2021 | 4kGPU | CodeCode Available | 3 |
| Real-Time High-Resolution Background Matting | Dec 14, 2020 | 4kGPU | CodeCode Available | 3 |
| SeerAttention-R: Sparse Attention Adaptation for Long Reasoning | Jun 10, 2025 | 4kGPU | CodeCode Available | 2 |
| Learning Adaptive Parallel Reasoning with Language Models | Apr 21, 2025 | 4k | CodeCode Available | 2 |
| Surg-3M: A Dataset and Foundation Model for Perception in Surgical Settings | Mar 25, 2025 | 4kAction Recognition | CodeCode Available | 2 |
| MaSS13K: A Matting-level Semantic Segmentation Benchmark | Mar 24, 2025 | 4kImage Matting | CodeCode Available | 2 |
| Ultra-Resolution Adaptation with Ease | Mar 20, 2025 | 2k4k | CodeCode Available | 2 |
| DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario Understanding | Mar 13, 2025 | 4kAutonomous Driving | CodeCode Available | 2 |
| GeoPixel: Pixel Grounding Large Multimodal Model in Remote Sensing | Jan 23, 2025 | 4k | CodeCode Available | 2 |
| CaPa: Carve-n-Paint Synthesis for Efficient 4K Textured Mesh Generation | Jan 16, 2025 | 3D Generation4k | CodeCode Available | 2 |
| MemLong: Memory-Augmented Retrieval for Long Text Modeling | Aug 30, 2024 | 4kDecoder | CodeCode Available | 2 |
| VFIMamba: Video Frame Interpolation with State Space Models | Jul 2, 2024 | 2k4k | CodeCode Available | 2 |
| UVEB: A Large-scale Benchmark and Baseline Towards Real-World Underwater Video Enhancement | Apr 22, 2024 | 4kImage Enhancement | CodeCode Available | 2 |
| LongEmbed: Extending Embedding Models for Long Context Retrieval | Apr 18, 2024 | 4k8k | CodeCode Available | 2 |
| LLoCO: Learning Long Contexts Offline | Apr 11, 2024 | 4kIn-Context Learning | CodeCode Available | 2 |
| Counting-Stars: A Multi-evidence, Position-aware, and Scalable Benchmark for Evaluating Long-Context Large Language Models | Mar 18, 2024 | 4kPosition | CodeCode Available | 2 |
| Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture | Oct 18, 2023 | 4kimage-classification | CodeCode Available | 2 |
| Giraffe: Adventures in Expanding Context Lengths in LLMs | Aug 21, 2023 | 16k4k | CodeCode Available | 2 |
| HyenaDNA: Long-Range Genomic Sequence Modeling at Single Nucleotide Resolution | Jun 27, 2023 | 4kIn-Context Learning | CodeCode Available | 2 |
| Bicubic++: Slim, Slimmer, Slimmest -- Designing an Industry-Grade Super-Resolution Network | May 3, 2023 | 4kImage Super-Resolution | CodeCode Available | 2 |
| Neural Preset for Color Style Transfer | Mar 23, 2023 | 4kColor Normalization | CodeCode Available | 2 |
| Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method | Dec 22, 2022 | 4k8k | CodeCode Available | 2 |
| 4K-NeRF: High Fidelity Neural Radiance Fields at Ultra High Resolutions | Dec 9, 2022 | 4kDecoder | CodeCode Available | 2 |
| Text2Light: Zero-Shot Text-Driven HDR Panorama Generation | Sep 20, 2022 | 4kinverse tone mapping | CodeCode Available | 2 |
| VEViD: Vision Enhancement via Virtual diffraction and coherent Detection | Aug 25, 2022 | 4kImage Enhancement | CodeCode Available | 2 |
| BoW3D: Bag of Words for Real-Time Loop Closing in 3D LiDAR SLAM | Aug 15, 2022 | 4kSimultaneous Localization and Mapping | CodeCode Available | 2 |