SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 86518700 of 661570 papers

TitleStatusHype
Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety0
DEO: Training-Free Direct Embedding Optimization for Negation-Aware Retrieval0
RubiCap: Rubric-Guided Reinforcement Learning for Dense Image Captioning0
Wrong Code, Right Structure: Learning Netlist Representations from Imperfect LLM-Generated RTL0
POLISH'ing the Sky: Wide-Field and High-Dynamic Range Interferometric Image Reconstruction with Application to Strong Lens Discovery0
ZeroWBC: Learning Natural Visuomotor Humanoid Control Directly from Human Egocentric Video0
Progressive Split Mamba: Effective State Space Modelling for Image Restoration0
Differentiable Stochastic Traffic Dynamics: Physics-Informed Generative Modelling in Transportation0
The Costs of Reproducibility in Music Separation Research: a Replication of Band-Split RNN0
P^2GNN: Two Prototype Sets to boost GNN Performance0
The Radio-Frequency Transformer for Signal Separation0
LooComp: Leverage Leave-One-Out Strategy to Encoder-only Transformer for Efficient Query-aware Context Compression0
Strategically Robust Multi-Agent Reinforcement Learning with Linear Function Approximation0
Abundant Intelligence and Deficient Demand: A Macro-Financial Stress Test of Rapid AI Adoption0
Geometry-Aware Metric Learning for Cross-Lingual Few-Shot Sign Language Recognition on Static Hand Keypoints0
PrivPRISM: Automatically Detecting Discrepancies Between Google Play Data Safety Declarations and Developer Privacy Policies0
SPAR-K: Scheduled Periodic Alternating Early Exit for Spoken Language Models0
Embodied Human Simulation for Quantitative Design and Analysis of Interactive Robotics0
Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control0
HelixTrack: Event-Based Tracking and RPM Estimation of Propeller-like Objects0
BridgeDiff: Bridging Human Observations and Flat-Garment Synthesis for Virtual Try-Off0
RAE-NWM: Navigation World Model in Dense Visual Representation Space0
When Detectors Forget Forensics: Blocking Semantic Shortcuts for Generalizable AI-Generated Image Detection0
Towards Instance Segmentation with Polygon Detection Transformers0
Social-R1: Towards Human-like Social Reasoning in LLMs0
A Generative Sampler for distributions with possible discrete parameter based on Reversibility0
Efficient Reasoning at Fixed Test-Time Cost via Length-Aware Attention Priors and Gain-Aware Training0
Multi-model approach for autonomous driving: A comprehensive study on traffic sign-, vehicle- and lane detection and behavioral cloning0
Multimodal Graph Representation Learning with Dynamic Information Pathways0
Transductive Generalization via Optimal Transport and Its Application to Graph Node ClassificationCode0
Implicit Geometry Representations for Vision-and-Language Navigation from Web Videos0
Logos: An evolvable reasoning engine for rational molecular design0
DendroNN: Dendrocentric Neural Networks for Energy-Efficient Classification of Event-Based Data0
On Regret Bounds of Thompson Sampling for Bayesian Optimization0
Speeding Up the Learning of 3D Gaussians with Much Shorter Gaussian Lists0
From Ideal to Real: Stable Video Object Removal under Imperfect Conditions0
CogBlender: Towards Continuous Cognitive Intervention in Text-to-Image Generation0
Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking0
CLoE: Expert Consistency Learning for Missing Modality Segmentation0
See, Plan, Rewind: Progress-Aware Vision-Language-Action Models for Robust Robotic Manipulation0
Diagnosing and Repairing Citation Failures in Generative Engine Optimization0
TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA0
Rescaling Confidence: What Scale Design Reveals About LLM Metacognition0
A Gaussian Comparison Theorem for Training Dynamics in Machine Learning0
NLiPsCalib: An Efficient Calibration Framework for High-Fidelity 3D Reconstruction of Curved Visuotactile Sensors0
OddGridBench: Exposing the Lack of Fine-Grained Visual Discrepancy Sensitivity in Multimodal Large Language Models0
Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning0
TimberAgent: Gram-Guided Retrieval for Executable Music Effect Control0
Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments0
TaSR-RAG: Taxonomy-guided Structured Reasoning for Retrieval-Augmented Generation0
Show:102550
← PrevPage 174 of 13232Next →