SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 32763300 of 661570 papers

TitleStatusHype
MFil-Mamba: Multi-Filter Scanning for Spatial Redundancy-Aware Visual State Space ModelsCode0
Agentic Harness for Real-World CompilersCode0
The Y-Combinator for LLMs: Solving Long-Context Rot with λ-CalculusCode0
ReLi3D: Relightable Multi-view 3D Reconstruction with Disentangled Illumination1
Continual Learning for Food Category Classification Dataset: Enhancing Model Adaptability and Performance0
AIGQ: An End-to-End Hybrid Generative Architecture for E-commerce Query Recommendation0
RAM: Recover Any 3D Human Motion in-the-Wild0
NEC-Diff: Noise-Robust Event-RAW Complementary Diffusion for Seeing Motion in Extreme DarknessCode0
ODySSeI: An Open-Source End-to-End Framework for Automated Detection, Segmentation, and Severity Estimation of Lesions in Invasive Coronary Angiography Images0
Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation0
SutureAgent: Learning Surgical Trajectories via Goal-conditioned Offline RL in Pixel Space0
Stress Classification from ECG Signals Using Vision Transformer0
Brain-inspired AI for Edge Intelligence: a systematic review0
Interpretable liquid crystal phase classification via two-by-two ordinal patterns0
UniFluids: Unified Neural Operator Learning with Conditional Flow-matching0
Ca2+ transient detection and segmentation with the Astronomically motivated algorithm for Background Estimation And Transient Segmentation (Astro-BEATS)0
The Efficiency Attenuation Phenomenon: A Computational Challenge to the Language of Thought Hypothesis0
LLM-Enhanced Energy Contrastive Learning for Out-of-Distribution Detection in Text-Attributed Graphs0
MARLIN: Multi-Agent Reinforcement Learning for Incremental DAG Discovery0
InjectFlow: Weak Guides Strong via Orthogonal Injection for Flow Matching0
Transferable Multi-Bit Watermarking Across Frozen Diffusion Models via Latent Consistency Bridges0
kRAIG: A Natural Language-Driven Agent for Automated DataOps Pipeline Generation0
Semantic Tool Discovery for Large Language Models: A Vector-Based Approach to MCP Tool Selection0
VGS-Decoding: Visual Grounding Score Guided Decoding for Hallucination Mitigation in Medical VLMs0
Rolling-Origin Validation Reverses Model Rankings in Multi-Step PM10 Forecasting: XGBoost, SARIMA, and Persistence0
Show:102550
← PrevPage 132 of 26463Next →