SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 2650 of 177340 papers

TitleStatusHype
Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria RerankingCode14
Autonomous Agents for Collaborative Task under Information AsymmetryCode14
Qwen3 Technical ReportCode14
Qwen2.5 Technical ReportCode13
Qwen2 Technical ReportCode13
R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint OptimizationCode13
Open-Sora: Democratizing Efficient Video Production for AllCode13
Bitnet.cpp: Efficient Edge Inference for Ternary LLMsCode13
MiniCPM-V: A GPT-4V Level MLLM on Your PhoneCode12
Zep: A Temporal Knowledge Graph Architecture for Agent MemoryCode12
OmniParser for Pure Vision Based GUI AgentCode12
SAM 2: Segment Anything in Images and VideosCode12
FlashAttention-3: Fast and Accurate Attention with Asynchrony and Low-precisionCode12
SmolVLA: A Vision-Language-Action Model for Affordable and Efficient RoboticsCode12
DeepSeek-Coder: When the Large Language Model Meets Programming -- The Rise of Code IntelligenceCode11
Qwen2.5-Coder Technical ReportCode11
EAP4EMSIG -- Experiment Automation Pipeline for Event-Driven Microscopy to Smart Microfluidic Single-Cells AnalysisCode11
AgentScope: A Flexible yet Robust Multi-Agent PlatformCode11
NYU CTF Bench: A Scalable Open-Source Benchmark Dataset for Evaluating LLMs in Offensive SecurityCode11
WebWalker: Benchmarking LLMs in Web TraversalCode11
Gymnasium: A Standard Interface for Reinforcement Learning EnvironmentsCode11
KAN: Kolmogorov-Arnold NetworksCode11
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow MatchingCode11
HunyuanVideo: A Systematic Framework For Large Video Generative ModelsCode11
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any ResolutionCode11
Show:102550
← PrevPage 2 of 7094Next →