SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 91019150 of 661570 papers

TitleStatusHype
Impact of Connectivity on Laplacian Representations in Reinforcement Learning0
BioGait-VLM: A Tri-Modal Vision-Language-Biomechanics Framework for Interpretable Clinical Gait Assessment0
MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation0
Drift-to-Action Controllers: Budgeted Interventions with Online Risk Certificates0
Towards Batch-to-Streaming Deep Reinforcement Learning for Continuous Control0
StreamReady: Learning What to Answer and When in Long Streaming Videos0
UNBOX: Unveiling Black-box visual models with Natural-language0
Retrieval-Augmented Gaussian Avatars: Improving Expression Generalization0
Grow, Don't Overwrite: Fine-tuning Without Forgetting0
Group Entropies and Mirror Duality: A Class of Flexible Mirror Descent Updates for Machine Learning0
Automated Tensor-Relational Decomposition for Large-Scale Sparse Tensor Computation0
CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning0
Characterization and upgrade of a quantum graph neural network for charged particle tracking0
Agentic Critical Training0
Talking Together: Synthesizing Co-Located 3D Conversations from Audio0
A Multi-Objective Optimization Approach for Sustainable AI-Driven Entrepreneurship in Resilient Economies0
A New Lower Bound for the Random Offerer Mechanism in Bilateral Trade using AI-Guided Evolutionary Search0
Split Federated Learning Architectures for High-Accuracy and Low-Delay Model Training0
Scalable Message Passing Neural Networks: No Need for Attention in Large Graph Representation Learning0
ThinkQE: Query Expansion via an Evolving Thinking Process0
When Thinking Backfires: Mechanistic Insights Into Reasoning-Induced Misalignment0
Quantifying Genuine Awareness in Hallucination Prediction Beyond Question-Side Shortcuts0
Reinforced Generation of Combinatorial Structures: Hardness of Approximation0
VLCE: A Knowledge-Enhanced Framework for Image Description in Disaster Assessment0
Mapping Historic Urban Footprints in France: Balancing Quality, Scalability and AI Techniques0
Bradley-Terry Policy Optimization for Generative Preference Modeling0
Personalized Collaborative Learning with Affinity-Based Variance Reduction0
SynthWorlds: Controlled Parallel Worlds for Disentangling Reasoning and Knowledge in Language Models0
Provable Acceleration of Distributed Optimization with Local Updates0
Automating Forecasting Question Generation and Resolution for AI Evaluation0
RegionReasoner: Region-Grounded Multi-Round Visual Reasoning0
MolCrystalFlow: Molecular Crystal Structure Prediction via Flow Matching0
Exploiting Completeness Perception with Diffusion Transformer for Unified 3D MRI Synthesis0
AuditBench: Evaluating Alignment Auditing Techniques on Models with Hidden Behaviors0
SPREAD: Subspace Representation Distillation for Lifelong Imitation Learning0
X-AVDT: Audio-Visual Cross-Attention for Robust Deepfake Detection0
Quantifying the Accuracy and Cost Impact of Design Decisions in Budget-Constrained Agentic LLM Search0
Multi-level meta-reinforcement learning with skill-based curriculum0
Granulon: Awakening Pixel-Level Visual Encoders with Adaptive Multi-Granularity Semantics for MLLM0
A Lightweight Multi-Cancer Tumor Localization Framework for Deployable Digital PathologyCode0
The Temporal Markov Transition Field0
Where, What, Why: Toward Explainable 3D-GS Watermarking0
VisionCreator-R1: A Reflection-Enhanced Native Visual-Generation Agentic Model0
Scale-Plan: Scalable Language-Enabled Task Planning for Heterogeneous Multi-Robot Teams0
Are Expressive Encoders Necessary for Discrete Graph Generation?0
Computer Vision-Based Vehicle Allotment System using Perspective Mapping0
MASEval: Extending Multi-Agent Evaluation from Models to SystemsCode0
LDP: An Identity-Aware Protocol for Multi-Agent LLM Systems0
Unpacking Interpretability: Human-Centered Criteria for Optimal Combinatorial Solutions0
Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models0
Show:102550
← PrevPage 183 of 13232Next →