SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 41764200 of 661570 papers

TitleStatusHype
S-VAM: Shortcut Video-Action Model by Self-Distilling Geometric and Semantic Foresight0
VIEW2SPACE: Studying Multi-View Visual Reasoning from Sparse Observations0
Wasserstein-type Gaussian Process Regressions for Input Measurement Uncertainty0
The Causal Uncertainty Principle: Manifold Tearing and the Topological Limits of Counterfactual Interventions0
Gesture-Aware Pretraining and Token Fusion for 3D Hand Pose Estimation0
Adaptive Anchor Policies for Efficient 4D Gaussian Streaming0
From Drop-off to Recovery: A Mechanistic Analysis of Segmentation in MLLMs0
Visual SLAM with DEM Anchoring for Lunar Surface Navigation0
KANtize: Exploring Low-bit Quantization of Kolmogorov-Arnold Networks for Efficient Inference0
Neuron-Level Emotion Control in Speech-Generative Large Audio-Language Models0
Deployment and Evaluation of an EHR-integrated, Large Language Model-Powered Tool to Triage Surgical Patients0
Neural Radiance Maps for Extraterrestrial Navigation and Path Planning0
On the Cone Effect and Modality Gap in Medical Vision-Language Embeddings0
Variational Rectification Inference for Learning with Noisy Labels0
GigaWorld-Policy: An Efficient Action-Centered World--Action Model2
LED: A Benchmark for Evaluating Layout Error Detection in Document Analysis0
DANCE: Dynamic 3D CNN Pruning: Joint Frame, Channel, and Feature Adaptation for Energy Efficiency on the Edge0
WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation0
From Words to Worlds: Benchmarking Cross-Cultural Cultural Understanding in Machine Translation0
Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations0
Towards Safer Large Reasoning Models by Promoting Safety Decision-Making before Chain-of-Thought Generation0
ReLMXEL: Adaptive RL-Based Memory Controller with Explainable Energy and Latency Optimization0
InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning0
Deploying Semantic ID-based Generative Retrieval for Large-Scale Podcast Discovery at Spotify0
Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress0
Show:102550
← PrevPage 168 of 26463Next →