SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 78767900 of 177340 papers

TitleStatusHype
MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU LanguagesCode2
ColorMNet: A Memory-based Deep Spatial-Temporal Feature Propagation Network for Video ColorizationCode2
KVQ: Kwai Video Quality Assessment for Short-form VideosCode2
MedPromptX: Grounded Multimodal Prompting for Chest X-ray DiagnosisCode2
On Embeddings for Numerical Features in Tabular Deep LearningCode2
3D Vision with Transformers: A SurveyCode2
DSVT: Dynamic Sparse Voxel Transformer with Rotated SetsCode2
Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphsCode2
How to Merge Your Multimodal Models Over Time?Code2
PIXIU: A Large Language Model, Instruction Data and Evaluation Benchmark for FinanceCode2
DS-1000: A Natural and Reliable Benchmark for Data Science Code GenerationCode2
Luminance-GS: Adapting 3D Gaussian Splatting to Challenging Lighting Conditions with View-Adaptive Curve AdjustmentCode2
MM-IFEngine: Towards Multimodal Instruction FollowingCode2
MoFE-Time: Mixture of Frequency Domain Experts for Time-Series Forecasting ModelsCode2
Animal Avatars: Reconstructing Animatable 3D Animals from Casual VideosCode2
CFBench: A Comprehensive Constraints-Following Benchmark for LLMsCode2
Maintaining Plasticity in Deep Continual LearningCode2
Text-Only Training for Image Captioning using Noise-Injected CLIPCode2
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading SystemsCode2
Leveraging Temporal Contextualization for Video Action RecognitionCode2
Towards Building Text-To-Speech Systems for the Next Billion UsersCode2
FlashSloth : Lightning Multimodal Large Language Models via Embedded Visual CompressionCode2
u-HuBERT: Unified Mixed-Modal Speech Pretraining And Zero-Shot Transfer to Unlabeled ModalityCode2
Monocular 3D Object Detection with Depth from MotionCode2
SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language ModelsCode2
Show:102550
← PrevPage 316 of 7094Next →