SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 64266450 of 474278 papers

TitleStatusHype
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR0
HUME: Measuring the Human-Model Performance Gap in Text Embedding Tasks0
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction0
ShadowDraw: From Any Object to Shadow-Drawing Compositional Art0
Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement LearningCode0
TTRV: Test-Time Reinforcement Learning for Vision Language Models0
EoS-FM: Can an Ensemble of Specialist Models act as a Generalist Feature Extractor?Code0
ViDiC: Video Difference Captioning0
In-Context Representation Hijacking0
FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring0
X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale0
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates0
LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging0
Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models0
Reflection Removal through Efficient Adaptation of Diffusion Transformers0
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression0
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning0
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation0
Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model0
Inferring Compositional 4D Scenes without Ever Seeing One0
The SAM2-to-SAM3 Gap in the Segment Anything Model Family: Why Prompt-Based Expertise Fails in Concept-Driven Image Segmentation0
Flowing Backwards: Improving Normalizing Flows via Reverse Representation AlignmentCode0
Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language ModelsCode0
Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs0
A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA)Code0
Show:102550
← PrevPage 258 of 18972Next →