SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 251275 of 659983 papers

TitleStatusHype
Pre^3: Enabling Deterministic Pushdown Automata for Faster Structured LLM GenerationCode7
OpenThoughts: Data Recipes for Reasoning ModelsCode7
AReaL: A Large-Scale Asynchronous Reinforcement Learning System for Language ReasoningCode7
HiDream-I1: A High-Efficient Image Generative Foundation Model with Sparse Diffusion TransformerCode7
Let Them Talk: Audio-Driven Multi-Person Conversational Video GenerationCode7
Paper2Poster: Towards Multimodal Poster Automation from Scientific PapersCode7
SageAttention2++: A More Efficient Implementation of SageAttention2Code7
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple CharactersCode7
SEW: Self-Evolving Agentic Workflows for Automated Code GenerationCode7
AI-Researcher: Autonomous Scientific InnovationCode7
Speechless: Speech Instruction Training Without Speech for Low Resource LanguagesCode7
ViDoRe Benchmark V2: Raising the Bar for Visual RetrievalCode7
An Empirical Study on Reinforcement Learning for Reasoning-Search Interleaved LLM AgentsCode7
Visual Agentic Reinforcement Fine-TuningCode7
MAGI-1: Autoregressive Video Generation at ScaleCode7
Faster Video Diffusion with Trainable Sparse AttentionCode7
Logo-LLM: Local and Global Modeling with Large Language Models for Time Series ForecastingCode7
SageAttention3: Microscaling FP4 Attention for Inference and An Exploration of 8-Bit TrainingCode7
Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image AnalysisCode7
Fast Text-to-Audio Generation with Adversarial Post-TrainingCode7
HealthBench: Evaluating Large Language Models Towards Improved Human HealthCode7
Embedding Atlas: Low-Friction, Interactive Embedding VisualizationCode7
Flow-GRPO: Training Flow Matching Models via Online RLCode7
Practical Efficiency of Muon for PretrainingCode7
Kimi-Audio Technical ReportCode7
Show:102550
← PrevPage 11 of 26400Next →