SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 12611270 of 661570 papers

TitleStatusHype
HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and ManipulationCode4
The Era of 1-bit LLMs: All Large Language Models are in 1.58 BitsCode4
SAT: Dynamic Spatial Aptitude Training for Multimodal Language ModelsCode4
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RLCode4
UniWorld-V1: High-Resolution Semantic Encoders for Unified Visual Understanding and GenerationCode4
Unified Reward Model for Multimodal Understanding and GenerationCode4
TorchRL: A data-driven decision-making library for PyTorchCode4
What Makes Good In-Context Examples for GPT-3?Code4
LLM Comparator: Visual Analytics for Side-by-Side Evaluation of Large Language ModelsCode4
AGS-Mesh: Adaptive Gaussian Splatting and Meshing with Geometric Priors for Indoor Room Reconstruction Using SmartphonesCode4
Show:102550
← PrevPage 127 of 66157Next →