SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 926950 of 659983 papers

TitleStatusHype
Unified Spatiotemporal Token Compression for Video-LLMs at Ultra-Low Retention0
BHDD: A Burmese Handwritten Digit Dataset0
Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe0
SecureBreak -- A dataset towards safe and secure models0
BOOST-RPF: Boosted Sequential Trees for Radial Power Flow0
λ-GELU: Learning Gating Hardness for Controlled ReLU-ization in Deep Networks0
STENet: Superpixel Token Enhancing Network for RGB-D Salient Object Detection0
CRPS-Optimal Binning for Conformal Regression0
SegMaFormer: A Hybrid State-Space and Transformer Model for Efficient Segmentation0
A plug-and-play approach with fast uncertainty quantification for weak lensing mass mapping0
On the Challenges and Opportunities of Learned Sparse Retrieval for Code0
6D Robotic OCT Scanning of Curved Tissue Surfaces0
Retrieving Climate Change Disinformation by Narrative0
ROM: Real-time Overthinking Mitigation via Streaming Detection and Intervention0
AdditiveLLM2: A Multi-modal Large Language Model for Additive Manufacturing0
Do Papers Match Code? A Benchmark and Framework for Paper-Code Consistency Detection in Bioinformatics Software0
Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models0
On the Interplay of Priors and Overparametrization in Bayesian Neural Network Posteriors0
Future-Interactions-Aware Trajectory Prediction via Braid Theory0
GTSR: Subsurface Scattering Awared 3D Gaussians for Translucent Surface Reconstruction0
RAFL: Generalizable Sim-to-Real of Soft Robots with Residual Acceleration Field Learning0
Sharper Generalization Bounds for Transformer0
Toward a Theory of Hierarchical Memory for Language Agents0
Optimal Memory Encoding Through Fluctuation-Response Structure0
Rethinking Token Reduction for Large Vision-Language Models0
Show:102550
← PrevPage 38 of 26400Next →