SOTAVerified

4k

Papers

Showing 51100 of 367 papers

TitleStatusHype
Towards Efficient and Scale-Robust Ultra-High-Definition Image DemoireingCode2
SoccerTrack: A Dataset and Tracking Algorithm for Soccer With Fish-Eye and Drone VideosCode2
Matryoshka Representation LearningCode2
Unfolding Spatial Cognition: Evaluating Multimodal Models on Visual SimulationsCode1
Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language ModelsCode1
MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured AttentionCode1
Analog Foundation ModelsCode1
Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone ImagesCode1
Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec CompressionCode1
Lexico: Extreme KV Cache Compression via Sparse Coding over Universal DictionariesCode1
Advanced computer vision for extracting georeferenced vehicle trajectories from drone imageryCode1
AIM 2024 Challenge on UHD Blind Photo Quality AssessmentCode1
Hybrid Cost Volume for Memory-Efficient Optical FlowCode1
HiPrompt: Tuning-free Higher-Resolution Generation with Hierarchical MLLM PromptsCode1
Assessing UHD Image Quality from Aesthetics, Distortions, and SaliencyCode1
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language ModelsCode1
MobileMEF: Fast and Efficient Method for Multi-Exposure FusionCode1
MedOdyssey: A Medical Domain Benchmark for Long Context Evaluation Up to 200K TokensCode1
Ultra-High-Definition Image Restoration: New Benchmarks and A Dual Interaction Prior-Driven SolutionCode1
An Efficient Recipe for Long Context Extension via Middle-Focused Positional EncodingCode1
LoCoCo: Dropping In Convolutions for Long Context CompressionCode1
Towards Ultra-High-Definition Image Deraining: A Benchmark and An Efficient MethodCode1
m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal TasksCode1
Asking Multimodal Clarifying Questions in Mixed-Initiative Conversational SearchCode1
Memory-Efficient Optical Flow via Radius-Distribution Orthogonal Cost VolumeCode1
4K-Resolution Photo Exposure Correction at 125 FPS with ~8K ParametersCode1
CLEX: Continuous Length Extrapolation for Large Language ModelsCode1
PAD: A Dataset and Benchmark for Pose-agnostic Anomaly DetectionCode1
MEFLUT: Unsupervised 1D Lookup Tables for Multi-exposure Image FusionCode1
Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation SurveillanceCode1
Towards Efficient SDRTV-to-HDRTV by Learning from Image FormationCode1
LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language ModelsCode1
StarSRGAN: Improving Real-World Blind Super-ResolutionCode1
Efficient Deep Models for Real-Time 4K Image Super-Resolution. NTIRE 2023 Benchmark and ReportCode1
Towards Real-Time 4K Image Super-ResolutionCode1
MAILEX: Email Event and Argument ExtractionCode1
SFD2: Semantic-guided Feature Detection and DescriptionCode1
CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large InputCode1
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame InterpolationCode1
Form-NLU: Dataset for the Form Natural Language UnderstandingCode1
4K-HAZE: A Dehazing Benchmark with 4K Resolution Hazy and Haze-Free ImagesCode1
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question AnsweringCode1
Simulating analogue film damage to analyse and improve artefact restoration on high-resolution scansCode1
Fewer is More: Efficient Object Detection in Large Aerial ImagesCode1
MicroAST: Towards Super-Fast Ultra-Resolution Arbitrary Style TransferCode1
Efficient Feature Extraction for High-resolution Video Frame InterpolationCode1
Capturing and Inferring Dense Full-Body Human-Scene ContactCode1
ParkPredict+: Multimodal Intent and Motion Prediction for Vehicles in Parking Lots with CNN and TransformerCode1
Pyramid Grafting Network for One-Stage High Resolution Saliency DetectionCode1
STRPM: A Spatiotemporal Residual Predictive Model for High-Resolution Video PredictionCode1
Show:102550
← PrevPage 2 of 8Next →

No leaderboard results yet.