SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 93519375 of 177340 papers

TitleStatusHype
ExpeL: LLM Agents Are Experiential LearnersCode2
SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond WordsCode2
MuMA-ToM: Multi-modal Multi-Agent Theory of MindCode2
Retrieval-Augmented Diffusion Models for Time Series ForecastingCode2
Collaborative Decoding Makes Visual Auto-Regressive Modeling EfficientCode2
Time Will Tell: New Outlooks and A Baseline for Temporal Multi-View 3D Object DetectionCode2
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body SimulationCode2
SynRS3D: A Synthetic Dataset for Global 3D Semantic Understanding from Monocular Remote Sensing ImageryCode2
Machine learning interatomic potential can infer electrical responseCode2
HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial OptimizationCode2
Fully Sparse 3D Occupancy PredictionCode2
SensorLLM: Human-Intuitive Alignment of Multivariate Sensor Data with LLMs for Activity RecognitionCode2
MambaMorph: a Mamba-based Framework for Medical MR-CT Deformable RegistrationCode2
Common Objects in 3D: Large-Scale Learning and Evaluation of Real-life 3D Category ReconstructionCode2
Human Pose as Compositional TokensCode2
Dense Distinct Query for End-to-End Object DetectionCode2
Deduplicating Training Data Makes Language Models BetterCode2
Approximate Convex Decomposition for 3D Meshes with Collision-Aware Concavity and Tree SearchCode2
Autonomous GIS: the next-generation AI-powered GISCode2
The Surprising Effectiveness of Negative Reinforcement in LLM ReasoningCode2
DIRECT-3D: Learning Direct Text-to-3D Generation on Massive Noisy 3D DataCode2
TinyVLA: Towards Fast, Data-Efficient Vision-Language-Action Models for Robotic ManipulationCode2
Graph Neural Network Surrogates to leverage Mechanistic Expert Knowledge towards Reliable and Immediate Pandemic ResponseCode2
UniHDSA: A Unified Relation Prediction Approach for Hierarchical Document Structure AnalysisCode2
LVM-Med: Learning Large-Scale Self-Supervised Vision Models for Medical Imaging via Second-order Graph MatchingCode2
Show:102550
← PrevPage 375 of 7094Next →