SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

658,356 papers247,172 code links4,818 tasks

Papers

Showing 226250 of 658356 papers

TitleStatusHype
DocLayNet: A Large Human-Annotated Dataset for Document-Layout AnalysisCode8
Fine-mixing: Mitigating Backdoors in Fine-tuned Language ModelsCode8
Qwen3-ASR Technical Report7
SAM 3D Body: Robust Full-Body Human Mesh Recovery7
Attention Residuals7
WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning7
dLLM: Simple Diffusion Language Modeling7
Pretraining Large Language Models with NVFP47
GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning7
Advancing Open-source World Models7
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem7
Transparent Image Layer Diffusion using Latent TransparencyCode7
One-Step Image Translation with Text-to-Image ModelsCode7
MiniGPT-4: Enhancing Vision-Language Understanding with Advanced Large Language ModelsCode7
SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference AccelerationCode7
From Bytes to Ideas: Language Modeling with Autoregressive U-NetsCode7
Robust Inverse Graphics via Probabilistic InferenceCode7
InternVideo2: Scaling Foundation Models for Multimodal Video UnderstandingCode7
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented GenerationCode7
HealthBench: Evaluating Large Language Models Towards Improved Human HealthCode7
Prometheus: Inducing Fine-grained Evaluation Capability in Language ModelsCode7
Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion TransformersCode7
OmniGen: Unified Image GenerationCode7
LHM: Large Animatable Human Reconstruction Model from a Single Image in SecondsCode7
FourierKAN outperforms MLP on Text Classification Head Fine-tuningCode7
Show:102550
← PrevPage 10 of 26335Next →