SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers258,108 code links4,818 tasks

Papers

Showing 2650 of 658356 papers

TitleStatusHype
Relevance Isn't All You Need: Scaling RAG Systems With Inference-Time Compute Via Multi-Criteria RerankingCode13
UI-TARS: Pioneering Automated GUI Interaction with Native AgentsCode13
Qwen3 Technical ReportCode13
MiniCPM-V: A GPT-4V Level MLLM on Your PhoneCode12
Zep: A Temporal Knowledge Graph Architecture for Agent MemoryCode12
OmniParser for Pure Vision Based GUI AgentCode12
DeepPlanning: Benchmarking Long-Horizon Agentic Planning with Verifiable Constraints11
Qwen3-Coder-Next Technical Report11
HunyuanVideo: A Systematic Framework For Large Video Generative ModelsCode11
F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow MatchingCode11
InstantID: Zero-shot Identity-Preserving Generation in SecondsCode11
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any ResolutionCode11
Absolute Zero: Reinforced Self-play Reasoning with Zero DataCode11
Janus: Decoupling Visual Encoding for Unified Multimodal Understanding and GenerationCode11
USP: A Unified Sequence Parallelism Approach for Long Context Generative AICode11
Eagle and Finch: RWKV with Matrix-Valued States and Dynamic RecurrenceCode11
KAN 2.0: Kolmogorov-Arnold Networks Meet ScienceCode11
CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-trainingCode11
CogVideoX: Text-to-Video Diffusion Models with An Expert TransformerCode11
Eliza: A Web3 friendly AI Agent Operating SystemCode11
KAN: Kolmogorov-Arnold NetworksCode11
SWIFT:A Scalable lightWeight Infrastructure for Fine-TuningCode11
LangGPT: Rethinking Structured Reusable Prompt Design Framework for LLMs from the Programming LanguageCode11
Pixtral 12BCode11
Introduction to Reinforcement LearningCode11
Show:102550
← PrevPage 2 of 26335Next →