SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 826850 of 659983 papers

TitleStatusHype
ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical AgentsCode5
R-CoT: Reverse Chain-of-Thought Problem Generation for Geometric Reasoning in Large Multimodal ModelsCode5
TimeMixer++: A General Time Series Pattern Machine for Universal Predictive AnalysisCode5
Allegro: Open the Black Box of Commercial-Level Video Generation ModelCode5
YOLO-RD: Introducing Relevant and Compact Explicit Knowledge to YOLO by Retriever-DictionaryCode5
DepthSplat: Connecting Gaussian Splatting and DepthCode5
FlashAudio: Rectified Flows for Fast and High-Fidelity Text-to-Audio GenerationCode5
Mini-Omni2: Towards Open-source GPT-4o with Vision, Speech and Duplex CapabilitiesCode5
KBLaM: Knowledge Base augmented Language ModelCode5
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture ModificationCode5
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of ExpertsCode5
OpenR: An Open Source Framework for Advanced Reasoning with Large Language ModelsCode5
Conditional Generative Models for Contrast-Enhanced Synthesis of T1w and T1 Maps in Brain MRICode5
Low Bitrate High-Quality RVQGAN-based Discrete Speech TokenizerCode5
RDT-1B: a Diffusion Foundation Model for Bimanual ManipulationCode5
Scaling Up Your Kernels: Large Kernel Design in ConvNets towards Universal RepresentationsCode5
IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image GenerationCode5
Enabling Novel Mission Operations and Interactions with ROSA: The Robot Operating System AgentCode5
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You ThinkCode5
MLE-bench: Evaluating Machine Learning Agents on Machine Learning EngineeringCode5
Aria: An Open Multimodal Native Mixture-of-Experts ModelCode5
MonST3R: A Simple Approach for Estimating Geometry in the Presence of MotionCode5
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical ReasoningCode5
Loki: An Open-Source Tool for Fact VerificationCode5
Maia-2: A Unified Model for Human-AI Alignment in ChessCode5
Show:102550
← PrevPage 34 of 26400Next →