SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 34013410 of 474278 papers

TitleStatusHype
Open-Source Web Service with Morphological Dictionary-Supplemented Deep Learning for Morphosyntactic Analysis of CzechCode3
An Imitative Reinforcement Learning Framework for Autonomous DogfightCode3
DocGenome: An Open Large-scale Scientific Document Benchmark for Training and Testing Multi-modal Large Language ModelsCode3
AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive ReasoningCode3
HyperSIGMA: Hyperspectral Intelligence Comprehension Foundation ModelCode3
Refusal in Language Models Is Mediated by a Single DirectionCode3
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and RefinementCode3
Unveiling Encoder-Free Vision-Language ModelsCode3
AutoHallusion: Automatic Generation of Hallucination Benchmarks for Vision-Language ModelsCode3
AgileCoder: Dynamic Collaborative Agents for Software Development based on Agile MethodologyCode3
Show:102550
← PrevPage 341 of 47428Next →