SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 13511375 of 659983 papers

TitleStatusHype
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
UniK3D: Universal Camera Monocular 3D EstimationCode4
Sonata: Self-Supervised Learning of Reliable Point RepresentationsCode4
Stop Overthinking: A Survey on Efficient Reasoning for Large Language ModelsCode4
Cube: A Roblox View of 3D IntelligenceCode4
DPFlow: Adaptive Optical Flow Estimation with a Dual-Pyramid FrameworkCode4
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal ControlCode4
Cosmos-Reason1: From Physical Common Sense To Embodied ReasoningCode4
Multimodal Chain-of-Thought Reasoning: A Comprehensive SurveyCode4
R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal FormalizationCode4
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and BeyondCode4
Retrieval-Augmented Generation with Hierarchical KnowledgeCode4
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language ModelsCode4
PharMolixFM: All-Atom Foundation Models for Molecular Modeling and GenerationCode4
LocAgent: Graph-Guided LLM Agents for Code LocalizationCode4
VLog: Video-Language Models by Generative Retrieval of Narration VocabularyCode4
Beyond Outlining: Heterogeneous Recursive Planning for Adaptive Long-form Writing with Language ModelsCode4
Towards All-in-One Medical Image Re-IdentificationCode4
LMM-R1: Empowering 3B LMMs with Strong Reasoning Abilities Through Two-Stage Rule-Based RLCode4
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement LearningCode4
PointVLA: Injecting the 3D World into Vision-Language-Action ModelsCode4
Ideas in Inference-time Scaling can Benefit Generative Pre-training AlgorithmsCode4
WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image GenerationCode4
LBM: Latent Bridge Matching for Fast Image-to-Image TranslationCode4
Inductive Moment MatchingCode4
Show:102550
← PrevPage 55 of 26400Next →