SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 37513760 of 177340 papers

TitleStatusHype
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
VideoGPT+: Integrating Image and Video Encoders for Enhanced Video UnderstandingCode3
AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language ModelsCode3
BiGym: A Demo-Driven Mobile Bi-Manual Manipulation BenchmarkCode3
MP-SfM: Monocular Surface Priors for Robust Structure-from-MotionCode3
Recurrent Drafter for Fast Speculative Decoding in Large Language ModelsCode3
MAD-ICP: It Is All About Matching Data -- Robust and Informed LiDAR OdometryCode3
MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare CopilotCode3
AER: Auto-Encoder with Regression for Time Series Anomaly DetectionCode3
Various Lengths, Constant Speed: Efficient Language Modeling with Lightning AttentionCode3
Show:102550
← PrevPage 376 of 17734Next →