SOTAVerified

8k

Papers

Showing 2650 of 202 papers

TitleStatusHype
Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based MethodCode2
SoccerTrack: A Dataset and Tracking Algorithm for Soccer With Fish-Eye and Drone VideosCode2
CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language ModelCode2
GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K ResolutionCode1
SkyLadder: Better and Faster Pretraining via Context Window SchedulingCode1
One ruler to measure them all: Benchmarking multilingual long-context language modelsCode1
Reversing the Damage: A QP-Aware Transformer-Diffusion Approach for 8K Video Restoration under Codec CompressionCode1
Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative DecodingCode1
C^2: Scalable Auto-Feedback for LLM-based Chart GenerationCode1
Facilitating Multi-turn Function Calling for LLMs via Compositional Instruction TuningCode1
AuthFace: Towards Authentic Blind Face Restoration with Face-oriented Generative Diffusion PriorCode1
L-CiteEval: Do Long-Context Models Truly Leverage Context for Responding?Code1
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularizationCode1
Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language ModelsCode1
FocusLLM: Precise Understanding of Long Context by Dynamic CondensingCode1
SORSA: Singular Values and Orthonormal Regularized Singular Vectors Adaptation of Large Language ModelsCode1
Dataset Decomposition: Faster LLM Training with Variable Sequence Length CurriculumCode1
Fast Kernel Scene FlowCode1
Referring Expression CountingCode1
4K-Resolution Photo Exposure Correction at 125 FPS with ~8K ParametersCode1
A High-Resolution Dataset for Instance Detection with Multi-View Instance CaptureCode1
M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language ModelsCode1
Spatial LibriSpeech: An Augmented Dataset for Spatial Audio LearningCode1
Recurrent Multi-scale Transformer for High-Resolution Salient Object DetectionCode1
VPP: Efficient Conditional 3D Generation via Voxel-Point Progressive RepresentationCode1
Show:102550
← PrevPage 2 of 9Next →

No leaderboard results yet.