SOTAVerified

4k

Papers

Showing 51100 of 367 papers

TitleStatusHype
GeoPixel: Pixel Grounding Large Multimodal Model in Remote SensingCode2
Giraffe: Adventures in Expanding Context Lengths in LLMsCode2
Surg-3M: A Dataset and Foundation Model for Perception in Surgical SettingsCode2
Pyramid Grafting Network for One-Stage High Resolution Saliency DetectionCode1
PhotoWCT^2: Compact Autoencoder for Photorealistic Style Transfer Resulting from Blockwise Training and Skip Connections of High-Frequency ResidualsCode1
CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local RefinementCode1
Capturing and Inferring Dense Full-Body Human-Scene ContactCode1
CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large InputCode1
End-to-End Speech Recognition from Federated Acoustic ModelsCode1
ParkPredict+: Multimodal Intent and Motion Prediction for Vehicles in Parking Lots with CNN and TransformerCode1
Real-Time Super-Resolution System of 4K-Video Based on Deep LearningCode1
An Efficient Recipe for Long Context Extension via Middle-Focused Positional EncodingCode1
Bitstream-based Model Standard for 4K/UHD: ITU-T P.1204.3 - Model Details, Evaluation, Analysis and Open Source ImplementationCode1
BiFormer: Learning Bilateral Motion Estimation via Bilateral Transformer for 4K Video Frame InterpolationCode1
Analog Foundation ModelsCode1
One-Shot Affordance DetectionCode1
Multi-Scale Separable Network for Ultra-High-Definition Video DeblurringCode1
NAS-HPO-Bench-II: A Benchmark Dataset on Joint Optimization of Convolutional Neural Network Architecture and Training HyperparametersCode1
4K-HAZE: A Dehazing Benchmark with 4K Resolution Hazy and Haze-Free ImagesCode1
Best-Buddy GANs for Highly Detailed Image Super-ResolutionCode1
MobileMEF: Fast and Efficient Method for Multi-Exposure FusionCode1
Multi-Curve Translator for High-Resolution Photorealistic Image TranslationCode1
PAD: A Dataset and Benchmark for Pose-agnostic Anomaly DetectionCode1
Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec CompressionCode1
MEFLUT: Unsupervised 1D Lookup Tables for Multi-exposure Image FusionCode1
BEATS: An Open-Source, High-Precision, Multi-Channel EEG Acquisition Tool SystemCode1
MedOdyssey: A Medical Domain Benchmark for Long Context Evaluation Up to 200K TokensCode1
XResolution Correspondence NetworksCode1
AvatarMe: Realistically Renderable 3D Facial Reconstruction "in-the-wild"Code1
Memory-Efficient Optical Flow via Radius-Distribution Orthogonal Cost VolumeCode1
LoCoCo: Dropping In Convolutions for Long Context CompressionCode1
Abg-CoQA: Clarifying Ambiguity in Conversational Question AnsweringCode1
MACER: A Modular Framework for Accelerated Compilation Error RepairCode1
Lexico: Extreme KV Cache Compression via Sparse Coding over Universal DictionariesCode1
Advanced computer vision for extracting georeferenced vehicle trajectories from drone imageryCode1
Learning Image-adaptive 3D Lookup Tables for High Performance Photo Enhancement in Real-timeCode1
LM-Infinite: Zero-Shot Extreme Length Generalization for Large Language ModelsCode1
MAILEX: Email Event and Argument ExtractionCode1
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language ModelsCode1
ABPN: Adaptive Blend Pyramid Network for Real-Time Local Retouching of Ultra High-Resolution PhotoCode1
Meticulous Object SegmentationCode1
Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation SurveillanceCode1
AIM 2024 Challenge on UHD Blind Photo Quality AssessmentCode1
Illuminating Darkness: Enhancing Real-world Low-light Scenes with Smartphone ImagesCode1
A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large ShiftCode1
m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal TasksCode1
Internal Video Inpainting by Implicit Long-range PropagationCode1
MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured AttentionCode1
Efficient Deep Models for Real-Time 4K Image Super-Resolution. NTIRE 2023 Benchmark and ReportCode1
Hybrid Cost Volume for Memory-Efficient Optical FlowCode1
Show:102550
← PrevPage 2 of 8Next →

No leaderboard results yet.