SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 50765100 of 661570 papers

TitleStatusHype
The PokeAgent Challenge: Competitive and Long-Context Learning at Scale0
χ_0: Resource-Aware Robust Manipulation via Taming Distributional Inconsistencies3
A Novel Evolutionary Method for Automated Skull-Face Overlay in Computer-Aided Craniofacial Superimposition0
Answer Bubbles: Information Exposure in AI-Mediated Search0
Artificial intelligence-enabled single-lead ECG for non-invasive hyperkalemia detection: development, multicenter validation, and proof-of-concept deployment0
GNNVerifier: Graph-based Verifier for LLM Task PlanningCode0
Molecular Identifier Visual Prompt and Verifiable Reinforcement Learning for Chemical Reaction Diagram Parsing0
Towards the Vision-Sound-Language-Action Paradigm: The HEAR Framework for Sound-Centric Manipulation0
HYDRA: Unifying Multi-modal Generation and Understanding via Representation-Harmonized Tokenization0
RepoReviewer: A Local-First Multi-Agent Architecture for Repository-Level Code Review0
Functional Stochastic Localization0
SineProject: Machine Unlearning for Stable Vision Language Alignment0
Traj2Action: A Co-Denoising Framework for Trajectory-Guided Human-to-Robot Skill Transfer0
When Silence Matters: The Impact of Irrelevant Audio on Text Reasoning in Large Audio-Language Models0
Exploring the Underwater World Segmentation without Extra Training0
Zero-Shot Time Series Foundation Models for Annual Institutional Forecasting Under Data Sparsity0
Structured Semantic Cloaking for Jailbreak Attacks on Large Language Models0
Detecting Sentiment Steering Attacks on RAG-enabled Large Language Models0
ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control0
An Interpretable Machine Learning Framework for Non-Small Cell Lung Cancer Drug Response Analysis0
Robust Generative Audio Quality Assessment: Disentangling Quality from Spurious Correlations0
Fast-FoundationStereo: Real-Time Zero-Shot Stereo Matching4
VIGIL: Towards Edge-Extended Agentic AI for Enterprise IT Support0
Coded Robust Aggregation for Distributed Learning under Byzantine Attacks0
BridgeShape: Latent Diffusion Schrödinger Bridge for 3D Shape Completion0
Show:102550
← PrevPage 204 of 26463Next →