| Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing | Jul 20, 2022 | 4kImage Enhancement | CodeCode Available | 2 |
| SoccerTrack: A Dataset and Tracking Algorithm for Soccer With Fish-Eye and Drone Videos | Jun 20, 2022 | 4k8k | CodeCode Available | 2 |
| Matryoshka Representation Learning | May 26, 2022 | 4kImage Classification | CodeCode Available | 2 |
| Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations | Jun 5, 2025 | 4kSpatial Reasoning | CodeCode Available | 1 |
| Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models | May 29, 2025 | 2k4k | CodeCode Available | 1 |
| MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention | May 24, 2025 | 16k4k | CodeCode Available | 1 |
| Analog Foundation Models | May 14, 2025 | 4kQuantization | CodeCode Available | 1 |
| Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone Images | Mar 10, 2025 | 4kBenchmarking | CodeCode Available | 1 |
| Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Dec 12, 2024 | 4k8k | CodeCode Available | 1 |
| Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries | Dec 12, 2024 | 4kGSM8K | CodeCode Available | 1 |
| Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery | Nov 4, 2024 | 4kgeo-localization | CodeCode Available | 1 |
| AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Sep 24, 2024 | 4kComputational Efficiency | CodeCode Available | 1 |
| Hybrid Cost Volume for Memory-Efficient Optical Flow | Sep 6, 2024 | 4kOptical Flow Estimation | CodeCode Available | 1 |
| HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts | Sep 4, 2024 | 4kDenoising | CodeCode Available | 1 |
| Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency | Sep 1, 2024 | 4kImage Quality Assessment | CodeCode Available | 1 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 |
| MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion | Aug 15, 2024 | 4kComputational Efficiency | CodeCode Available | 1 |
| MedOdyssey: A Medical Domain Benchmark for Long Context Evaluation Up to 200K Tokens | Jun 21, 2024 | 4kFairness | CodeCode Available | 1 |
| Ultra-High-Definition Image Restoration: New Benchmarks and A Dual Interaction Prior-Driven Solution | Jun 19, 2024 | 4kDeblurring | CodeCode Available | 1 |
| An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding | Jun 11, 2024 | 4k | CodeCode Available | 1 |
| LoCoCo: Dropping In Convolutions for Long Context Compression | Jun 8, 2024 | 4k | CodeCode Available | 1 |
| Towards Ultra-High-Definition Image Deraining: A Benchmark and An Efficient Method | May 27, 2024 | 4kImage Reconstruction | CodeCode Available | 1 |
| m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks | Mar 17, 2024 | 4k | CodeCode Available | 1 |
| Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search | Feb 12, 2024 | 4kConversational Search | CodeCode Available | 1 |
| Memory-Efficient Optical Flow via Radius-Distribution Orthogonal Cost Volume | Dec 6, 2023 | 4kOptical Flow Estimation | CodeCode Available | 1 |