SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 13011325 of 659983 papers

TitleStatusHype
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
Scaling Law for Quantization-Aware TrainingCode4
VideoEval-Pro: Robust and Realistic Long Video Understanding EvaluationCode4
Multi-head Temporal Latent AttentionCode4
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level SupervisionCode4
Mean Flows for One-step Generative ModelingCode4
DreamGen: Unlocking Generalization in Robot Learning through Video World ModelsCode4
CPGD: Toward Stable Rule-based Reinforcement Learning for Language ModelsCode4
Kornia-rs: A Low-Level 3D Computer Vision Library In RustCode4
VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement LearningCode4
Attention on the SphereCode4
Accelerating Visual-Policy Learning through Parallel Differentiable SimulationCode4
OnPrem.LLM: A Privacy-Conscious Document Intelligence ToolkitCode4
Gated Attention for Large Language Models: Non-linearity, Sparsity, and Attention-Sink-FreeCode4
Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning ModelsCode4
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
3D Scene Generation: A SurveyCode4
VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language ModelCode4
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-TuningCode4
Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal InteractionCode4
Towards One-shot Federated Learning: Advances, Challenges, and Future DirectionsCode4
Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and ModalityCode4
T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoTCode4
Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of LightCode4
AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning datasetCode4
Show:102550
← PrevPage 53 of 26400Next →