SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 201225 of 659983 papers

TitleStatusHype
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language ModelsCode9
ORPO: Monolithic Preference Optimization without Reference ModelCode9
LLM4Decompile: Decompiling Binary Code with Large Language ModelsCode9
Divide and Conquer: High-Resolution Industrial Anomaly Detection via Memory Efficient Tiled EnsembleCode9
Yi: Open Foundation Models by 01.AICode9
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-onCode9
TripoSR: Fast 3D Object Reconstruction from a Single ImageCode9
World Model on Million-Length Video And Language With Blockwise RingAttentionCode9
UFO: A UI-Focused Agent for Windows OS InteractionCode9
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language ModelsCode9
Natural language guidance of high-fidelity text-to-speech with synthetic annotationsCode9
OLMo: Accelerating the Science of Language ModelsCode9
YOLO-World: Real-Time Open-Vocabulary Object DetectionCode9
Grounded SAM: Assembling Open-World Models for Diverse Visual TasksCode9
Steering Language Models with Game-Theoretic SolversCode9
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding BenchmarkCode9
Depth Anything: Unleashing the Power of Large-Scale Unlabeled DataCode9
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion ModelsCode9
DeepSeek LLM: Scaling Open-Source Language Models with LongtermismCode9
Perception Encoder: The best visual embeddings are not at the output of the networkCode8
GPT4All: An Ecosystem of Open Source Compressed Language ModelsCode8
Llama 2: Open Foundation and Fine-Tuned Chat ModelsCode8
Adapting Large Language Model with Speech for Fully Formatted End-to-End Speech RecognitionCode8
DETRs Beat YOLOs on Real-time Object DetectionCode8
Robust Speech Recognition via Large-Scale Weak SupervisionCode8
Show:102550
← PrevPage 9 of 26400Next →