SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 33913400 of 474278 papers

TitleStatusHype
VisualRWKV: Exploring Recurrent Neural Networks for Visual Language ModelsCode3
APPL: A Prompt Programming Language for Harmonious Integration of Programs and Large Language Model PromptsCode3
GenAI-Bench: Evaluating and Improving Compositional Text-to-Visual GenerationCode3
Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?Code3
SpatialBot: Precise Spatial Understanding with Vision Language ModelsCode3
DF40: Toward Next-Generation Deepfake DetectionCode3
Detecting hallucinations in large language models using semantic entropyCode3
VoCo-LLaMA: Towards Vision Compression with Large Language ModelsCode3
TSI-Bench: Benchmarking Time Series ImputationCode3
Open-Source Web Service with Morphological Dictionary-Supplemented Deep Learning for Morphosyntactic Analysis of CzechCode3
Show:102550
← PrevPage 340 of 47428Next →