SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 63516375 of 474278 papers

TitleStatusHype
ChemAgent: Self-updating Library in Large Language Models Improves Chemical ReasoningCode2
Test-time Alignment of Diffusion Models without Reward Over-optimizationCode2
VideoRAG: Retrieval-Augmented Generation over Video CorpusCode2
AI-powered virtual tissues from spatial proteomics for clinical diagnostics and biomedical discoveryCode2
xLSTM-SENet: xLSTM for Single-Channel Speech EnhancementCode2
TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response ScenariosCode2
Russian Financial Statements Database: A firm-level collection of the universe of financial statementsCode2
Do we actually understand the impact of renewables on electricity prices? A causal inference approachCode2
ReFocus: Visual Editing as a Chain of Thought for Structured Image UnderstandingCode2
Mechanistic understanding and validation of large AI models with SemanticLensCode2
FOCUS: Towards Universal Foreground SegmentationCode2
UAV-VLA: Vision-Language-Action System for Large Scale Aerial Mission GenerationCode2
CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation ModelsCode2
MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image ClassificationCode2
V2C-CBM: Building Concept Bottlenecks with Vision-to-Concept TokenizerCode2
FLowHigh: Towards Efficient and High-Quality Audio Super-Resolution with Single-Step Flow MatchingCode2
OVO-Bench: How Far is Your Video-LLMs from Real-World Online Video Understanding?Code2
Generative AI for Cel-Animation: A SurveyCode2
LLM4SR: A Survey on Large Language Models for Scientific ResearchCode2
InfiGUIAgent: A Multimodal Generalist GUI Agent with Native Reasoning and ReflectionCode2
Stable Derivative Free Gaussian Mixture Variational Inference for Bayesian Inverse ProblemsCode2
FatesGS: Fast and Accurate Sparse-View Surface Reconstruction using Gaussian Splatting with Depth-Feature ConsistencyCode2
OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech SynthesisCode2
URSA: Understanding and Verifying Chain-of-thought Reasoning in Multimodal MathematicsCode2
A Plug-and-Play Bregman ADMM Module for Inferring Event Branches in Temporal Point ProcessesCode2
Show:102550
← PrevPage 255 of 18972Next →