SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 891900 of 661570 papers

TitleStatusHype
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and BeyondCode5
MuSR: Testing the Limits of Chain-of-thought with Multistep Soft ReasoningCode5
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language ModelsCode5
Assessing Language Model Deployment with Risk CardsCode5
UniVLA: Learning to Act Anywhere with Task-centric Latent ActionsCode5
SantaCoder: don't reach for the stars!Code5
Uni-MoE: Scaling Unified Multimodal LLMs with Mixture of ExpertsCode5
Evolutionary Optimization of Model Merging RecipesCode5
MoVQ: Modulating Quantized Vectors for High-Fidelity Image GenerationCode5
Automatic Interactive Evaluation for Large Language Models with State Aware Patient SimulatorCode5
Show:102550
← PrevPage 90 of 66157Next →