SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 13911400 of 661570 papers

TitleStatusHype
R1-Onevision:An Open-Source Multimodal Large Language Model Capable of Deep ReasoningCode4
LettuceDetect: A Hallucination Detection Framework for RAG ApplicationsCode4
Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic EvaluationCode4
REFINE: Inversion-Free Backdoor Defense via Model ReprogrammingCode4
Natural Language GenerationCode4
SurveyX: Academic Survey Automation via Large Language ModelsCode4
Building reliable sim driving agents by scaling self-playCode4
LServe: Efficient Long-sequence LLM Serving with Unified Sparse AttentionCode4
Craw4LLM: Efficient Web Crawling for LLM PretrainingCode4
A deep learning framework for efficient pathology image analysisCode4
Show:102550
← PrevPage 140 of 66157Next →