SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 92019225 of 474278 papers

TitleStatusHype
LLM Unlearning Without an Expert Curated DatasetCode0
Bridging Semantic Logic Gaps: A Cognition Inspired Multimodal Boundary Preserving Network for Image Manipulation LocalizationCode0
Benchmarking the Robustness of Agentic Systems to Adversarially-Induced HarmsCode0
Sparse Representations Improve Adversarial Robustness of Neural Network ClassifiersCode0
vAttention: Verified Sparse AttentionCode0
Redefining Generalization in Visual Domains: A Two-Axis Framework for Fake Image Detection with FusionDetectCode0
Are Heterogeneous Graph Neural Networks Truly Effective? A Causal PerspectiveCode0
CalibCLIP: Contextual Calibration of Dominant Semantics for Text-Driven Image RetrievalCode0
Towards Unified Image Deblurring using a Mixture-of-Experts DecoderCode0
AUREXA-SE: Audio-Visual Unified Representation Exchange Architecture with Cross-Attention and Squeezeformer for Speech EnhancementCode0
Reproducibility Study of "XRec: Large Language Models for Explainable Recommendation"Code0
Time Is a Feature: Exploiting Temporal Dynamics in Diffusion Language Models0
One Patch to Caption Them All: A Unified Zero-Shot Captioning Framework0
ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs0
Learning on the Job: Test-Time Curricula for Targeted Reinforcement Learning0
When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA0
Less is More: Recursive Reasoning with Tiny Networks0
Test-Time Scaling in Diffusion LLMs via Hidden Semi-Autoregressive Experts0
StaMo: Unsupervised Learning of Generalizable Robot Motion from Compact State Representation0
Character Mixing for Video Generation0
VChain: Chain-of-Visual-Thought for Reasoning in Video Generation0
Pulp Motion: Framing-aware multimodal camera and human motion generation0
A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility0
DynaGuard: A Dynamic Guardian Model With User-Defined Policies0
VER: Vision Expert Transformer for Robot Learning via Foundation Distillation and Dynamic Routing0
Show:102550
← PrevPage 369 of 18972Next →