SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 32413250 of 474278 papers

TitleStatusHype
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation AgentsCode3
Mambular: A Sequential Model for Tabular Deep LearningCode3
Music2Latent: Consistency Autoencoders for Latent Audio CompressionCode3
LLMServingSim: A HW/SW Co-Simulation Infrastructure for LLM Inference Serving at ScaleCode3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2Code3
MooER: LLM-based Speech Recognition and Translation Models from Moore ThreadsCode3
UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond ScalingCode3
DeepInteraction++: Multi-Modality Interaction for Autonomous DrivingCode3
Hyper-YOLO: When Visual Object Detection Meets Hypergraph ComputationCode3
BoFire: Bayesian Optimization Framework Intended for Real ExperimentsCode3
Show:102550
← PrevPage 325 of 47428Next →