SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers257,941 code links4,818 tasks

Papers

Showing 1120 of 658356 papers

TitleStatusHype
LightRAG: Simple and Fast Retrieval-Augmented GenerationCode14
Optimizing Instructions and Demonstrations for Multi-Stage Language Model ProgramsCode14
Assisting in Writing Wikipedia-like Articles From Scratch with Large Language ModelsCode14
TradingAgents: Multi-Agents LLM Financial Trading FrameworkCode14
Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria RerankingCode13
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All ToolsCode13
UI-TARS: Pioneering Automated GUI Interaction with Native AgentsCode13
Qwen2 Technical ReportCode13
R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint OptimizationCode13
1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on CPUsCode13
Show:102550
← PrevPage 2 of 65836Next →