SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 92269250 of 474278 papers

TitleStatusHype
Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning0
Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization0
TimeSeriesScientist: A General-Purpose AI Agent for Time Series Analysis0
TAG:Tangential Amplifying Guidance for Hallucination-Resistant Diffusion Sampling0
Unifying Autoregressive and Diffusion-Based Sequence Generation0
MedPAO: A Protocol-Driven Agent for Structuring Medical ReportsCode0
A Spatial-Spectral-Frequency Interactive Network for Multimodal Remote Sensing ClassificationCode0
Multi-Agent Tool-Integrated Policy Optimization0
MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image ClassificationCode0
Federated Computation of ROC and PR Curves0
The Telephone Game: Evaluating Semantic Drift in Unified ModelsCode0
ML2B: Multi-Lingual ML Benchmark For AutoMLCode0
Semantic Similarity in Radiology Reports via LLMs and NERCode0
How Different from the Past? Spatio-Temporal Time Series Forecasting with Self-Supervised Deviation LearningCode0
Modeling Student Learning with 3.8 Million Program TracesCode0
First Hallucination Tokens Are Different from Conditional OnesCode0
Fast Witness Persistence for MRI Volumes via Hybrid LandmarkingCode0
GRACE: Generative Representation Learning via Contrastive Policy OptimizationCode0
FocusMed: A Large Language Model-based Framework for Enhancing Medical Question Summarization with Focus IdentificationCode0
ID-Consistent, Precise Expression Generation with Blendshape-Guided DiffusionCode0
JSON Whisperer: Efficient JSON Editing with LLMsCode0
LightCache: Memory-Efficient, Training-Free Acceleration for Video GenerationCode0
Explaining Human Preferences via Metrics for Structured 3D ReconstructionCode0
RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt InjectionCode0
HyperVLA: Efficient Inference in Vision-Language-Action Models via HypernetworksCode0
Show:102550
← PrevPage 370 of 18972Next →