SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 776800 of 659983 papers

TitleStatusHype
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric PerspectivesCode5
Cosmos World Foundation Model Platform for Physical AICode5
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion ModelsCode5
Exploring GLU Expansion Ratios: A Study of Structured Pruning in LLaMA-3.2 ModelsCode5
HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMsCode5
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree SearchCode5
Don't Do RAG: When Cache-Augmented Generation is All You Need for Knowledge TasksCode5
LongBench v2: Towards Deeper Understanding and Reasoning on Realistic Long-context MultitasksCode5
Noisereduce: Domain General Noise Reduction for Time Series SignalsCode5
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and InferenceCode5
Prompting Depth Anything for 4K Resolution Accurate Metric Depth EstimationCode5
DUET: Dual Clustering Enhanced Multivariate Time Series ForecastingCode5
SCBench: A KV Cache-Centric Analysis of Long-Context MethodsCode5
Representing Long Volumetric Video with Temporal Gaussian HierarchyCode5
SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB VideosCode5
Arbitrary-steps Image Super-resolution via Diffusion InversionCode5
Learning Flow Fields in Attention for Controllable Person Image GenerationCode5
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive AnnotationsCode5
EDiT: A Local-SGD-Based Efficient Distributed Training Method for Large Language ModelsCode5
Training Large Language Models to Reason in a Continuous Latent SpaceCode5
The BrowserGym Ecosystem for Web Agent ResearchCode5
DEIM: DETR with Improved Matching for Fast ConvergenceCode5
MegaSaM: Accurate, Fast, and Robust Structure and Motion from Casual Dynamic VideosCode5
MV-Adapter: Multi-view Consistent Image Generation Made EasyCode5
Free Process Rewards without Process LabelsCode5
Show:102550
← PrevPage 32 of 26400Next →