SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 36813690 of 177340 papers

TitleStatusHype
Vision-and-Language Navigation Today and Tomorrow: A Survey in the Era of Foundation ModelsCode3
Decoding-based RegressionCode3
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task SynthesisCode3
Demystifying Long Chain-of-Thought Reasoning in LLMsCode3
MAXIM: Multi-Axis MLP for Image ProcessingCode3
MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding BenchmarkCode3
Towards Automatic Power Battery Detection: New Challenge Benchmark Dataset and BaselineCode3
TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative DecodingCode3
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian LanguagesCode3
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object DetectionCode3
Show:102550
← PrevPage 369 of 17734Next →