| Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing | Jul 20, 2022 | 4kImage Enhancement | CodeCode Available | 2 |
| SoccerTrack: A Dataset and Tracking Algorithm for Soccer With Fish-Eye and Drone Videos | Jun 20, 2022 | 4k8k | CodeCode Available | 2 |
| Matryoshka Representation Learning | May 26, 2022 | 4kImage Classification | CodeCode Available | 2 |
| Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual Simulations | Jun 5, 2025 | 4kSpatial Reasoning | CodeCode Available | 1 |
| Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models | May 29, 2025 | 2k4k | CodeCode Available | 1 |
| MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention | May 24, 2025 | 16k4k | CodeCode Available | 1 |
| Analog Foundation Models | May 14, 2025 | 4kQuantization | CodeCode Available | 1 |
| Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone Images | Mar 10, 2025 | 4kBenchmarking | CodeCode Available | 1 |
| Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec Compression | Dec 12, 2024 | 4k8k | CodeCode Available | 1 |
| Lexico: Extreme KV Cache Compression via Sparse Coding over Universal Dictionaries | Dec 12, 2024 | 4kGSM8K | CodeCode Available | 1 |
| Advanced computer vision for extracting georeferenced vehicle trajectories from drone imagery | Nov 4, 2024 | 4kgeo-localization | CodeCode Available | 1 |
| AIM 2024 Challenge on UHD Blind Photo Quality Assessment | Sep 24, 2024 | 4kComputational Efficiency | CodeCode Available | 1 |
| Hybrid Cost Volume for Memory-Efficient Optical Flow | Sep 6, 2024 | 4kOptical Flow Estimation | CodeCode Available | 1 |
| HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM Prompts | Sep 4, 2024 | 4kDenoising | CodeCode Available | 1 |
| Assessing UHD Image Quality from Aesthetics, Distortions, and Saliency | Sep 1, 2024 | 4kImage Quality Assessment | CodeCode Available | 1 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 |
| MobileMEF: Fast and Efficient Method for Multi-Exposure Fusion | Aug 15, 2024 | 4kComputational Efficiency | CodeCode Available | 1 |
| MedOdyssey: A Medical Domain Benchmark for Long Context Evaluation Up to 200K Tokens | Jun 21, 2024 | 4kFairness | CodeCode Available | 1 |
| Ultra-High-Definition Image Restoration: New Benchmarks and A Dual Interaction Prior-Driven Solution | Jun 19, 2024 | 4kDeblurring | CodeCode Available | 1 |
| An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding | Jun 11, 2024 | 4k | CodeCode Available | 1 |
| LoCoCo: Dropping In Convolutions for Long Context Compression | Jun 8, 2024 | 4k | CodeCode Available | 1 |
| Towards Ultra-High-Definition Image Deraining: A Benchmark and An Efficient Method | May 27, 2024 | 4kImage Reconstruction | CodeCode Available | 1 |
| m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks | Mar 17, 2024 | 4k | CodeCode Available | 1 |
| Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational Search | Feb 12, 2024 | 4kConversational Search | CodeCode Available | 1 |
| Memory-Efficient Optical Flow via Radius-Distribution Orthogonal Cost Volume | Dec 6, 2023 | 4kOptical Flow Estimation | CodeCode Available | 1 |
| 4K-Resolution Photo Exposure Correction at 125 FPS with ~8K Parameters | Nov 15, 2023 | 4k8k | CodeCode Available | 1 |
| CLEX: Continuous Length Extrapolation for Large Language Models | Oct 25, 2023 | 4kPosition | CodeCode Available | 1 |
| PAD: A Dataset and Benchmark for Pose-agnostic Anomaly Detection | Oct 11, 2023 | 4kAnomaly Detection | CodeCode Available | 1 |
| MEFLUT: Unsupervised 1D Lookup Tables for Multi-exposure Image Fusion | Sep 21, 2023 | 4kGPU | CodeCode Available | 1 |
| Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation Surveillance | Sep 15, 2023 | 2k4k | CodeCode Available | 1 |
| Towards Efficient SDRTV-to-HDRTV by Learning from Image Formation | Sep 8, 2023 | 4k | CodeCode Available | 1 |
| LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language Models | Aug 30, 2023 | 2k4k | CodeCode Available | 1 |
| StarSRGAN: Improving Real-World Blind Super-Resolution | Jul 30, 2023 | 4kBlind Super-Resolution | CodeCode Available | 1 |
| Efficient Deep Models for Real-Time 4K Image Super-Resolution. NTIRE 2023 Benchmark and Report | Jun 1, 2023 | 4kImage Super-Resolution | CodeCode Available | 1 |
| Towards Real-Time 4K Image Super-Resolution | Jun 1, 2023 | 4kImage Super-Resolution | CodeCode Available | 1 |
| MAILEX: Email Event and Argument Extraction | May 22, 2023 | 4k8k | CodeCode Available | 1 |
| SFD2: Semantic-guided Feature Detection and Description | Apr 28, 2023 | 2k4k | CodeCode Available | 1 |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input | Apr 13, 2023 | 2k4k | CodeCode Available | 1 |
| BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame Interpolation | Apr 5, 2023 | 4kMotion Estimation | CodeCode Available | 1 |
| Form-NLU: Dataset for the Form Natural Language Understanding | Apr 4, 2023 | 4kForm | CodeCode Available | 1 |
| 4K-HAZE: A Dehazing Benchmark with 4K Resolution Hazy and Haze-Free Images | Mar 28, 2023 | 4kDepth Estimation | CodeCode Available | 1 |
| TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering | Mar 21, 2023 | 4kImage Generation | CodeCode Available | 1 |
| Simulating analogue film damage to analyse and improve artefact restoration on high-resolution scans | Feb 20, 2023 | 4kDenoising | CodeCode Available | 1 |
| Fewer is More: Efficient Object Detection in Large Aerial Images | Dec 26, 2022 | 4kObject | CodeCode Available | 1 |
| MicroAST: Towards Super-Fast Ultra-Resolution Arbitrary Style Transfer | Nov 28, 2022 | 4kDecoder | CodeCode Available | 1 |
| Efficient Feature Extraction for High-resolution Video Frame Interpolation | Nov 25, 2022 | 4kDimensionality Reduction | CodeCode Available | 1 |
| Capturing and Inferring Dense Full-Body Human-Scene Contact | Jun 20, 2022 | 4kContact Detection | CodeCode Available | 1 |
| ParkPredict+: Multimodal Intent and Motion Prediction for Vehicles in Parking Lots with CNN and Transformer | Apr 17, 2022 | 4kmotion prediction | CodeCode Available | 1 |
| Pyramid Grafting Network for One-Stage High Resolution Saliency Detection | Apr 11, 2022 | 4k8k | CodeCode Available | 1 |
| STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video Prediction | Mar 30, 2022 | 4kVideo Prediction | CodeCode Available | 1 |