SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 35013525 of 661570 papers

TitleStatusHype
AS2 -- Attention-Based Soft Answer Sets: An End-to-End Differentiable Neuro-Soft-Symbolic Reasoning Architecture0
SODIUM: From Open Web Data to Queryable Databases0
Seeking Universal Shot Language Understanding Solutions0
MedQ-UNI: Toward Unified Medical Image Quality Assessment and Restoration via Vision-Language Modeling0
Recolour What Matters: Region-Aware Colour Editing via Token-Level Diffusion0
GAIN: A Benchmark for Goal-Aligned Decision-Making of Large Language Models under Imperfect Norms0
Cognitive Mismatch in Multimodal Large Language Models for Discrete Symbol Understanding0
Do Vision Language Models Understand Human Engagement in Games?0
T-QPM: Enabling Temporal Out-Of-Distribution Detection and Domain Generalization for Vision-Language Models in Open-World0
The Truncation Blind Spot: How Decoding Strategies Systematically Exclude Human-Like Token Choices0
Precise Performance of Linear Denoisers in the Proportional Regime0
TexEditor: Structure-Preserving Text-Driven Texture EditingCode0
Cross-Domain Demo-to-Code via Neurosymbolic Counterfactual Reasoning0
NymeriaPlus: Enriching Nymeria Dataset with Additional Annotations and Data0
OnlinePG: Online Open-Vocabulary Panoptic Mapping with 3D Gaussian Splatting0
From Snapshots to Symphonies: The Evolution of Protein Prediction from Static Structures to Generative Dynamics and Multimodal Interactions0
Expert Personas Improve LLM Alignment but Damage Accuracy: Bootstrapping Intent-Based Persona Routing with PRISM0
CAFlow: Adaptive-Depth Single-Step Flow Matching for Efficient Histopathology Super-Resolution0
Counting Circuits: Mechanistic Interpretability of Visual Reasoning in Large Vision-Language Models0
Correlation-Weighted Multi-Reward Optimization for Compositional Generation0
Data-efficient pre-training by scaling synthetic megadocs0
Remedying Target-Domain Astigmatism for Cross-Domain Few-Shot Object Detection0
HEP Statistical Inference for UAV Fault Detection: CLs, LRT, and SBI Applied to Blade Damage0
SINDy-KANs: Sparse identification of non-linear dynamics through Kolmogorov-Arnold networks0
CausalVAD: De-confounding End-to-End Autonomous Driving via Causal Intervention0
Show:102550
← PrevPage 141 of 26463Next →