SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1140111450 of 661570 papers

TitleStatusHype
EvoPrune: Early-Stage Visual Token Pruning for Efficient MLLMs0
When to restart? Exploring escalating restarts on convergence0
CONCUR: Benchmarking LLMs for Concurrent Code Generation0
MOOSE-Star: Unlocking Tractable Training for Scientific Discovery by Breaking the Complexity Barrier1
UrbanHuRo: A Two-Layer Human-Robot Collaboration Framework for the Joint Optimization of Heterogeneous Urban Services0
MPFlow: Multi-modal Posterior-Guided Flow Matching for Zero-Shot MRI Reconstruction0
Why Do Unlearnable Examples Work: A Novel Perspective of Mutual Information0
PROSPECT: Unified Streaming Vision-Language Navigation via Semantic--Spatial Fusion and Latent Predictive Representation0
HALyPO: Heterogeneous-Agent Lyapunov Policy Optimization for Human-Robot Collaboration0
ErrorLLM: Modeling SQL Errors for Text-to-SQL Refinement0
RAGNav: A Retrieval-Augmented Topological Reasoning Framework for Multi-Goal Visual-Language Navigation0
JANUS: Structured Bidirectional Generation for Guaranteed Constraints and Analytical Uncertainty0
Assessing the Effectiveness of LLMs in Delivering Cognitive Behavioral Therapy0
WSI-INR: Implicit Neural Representations for Lesion Segmentation in Whole-Slide Images0
Interaction-Aware Whole-Body Control for Compliant Object Transport0
Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning0
Agentic Peer-to-Peer Networks: From Content Distribution to Capability and Action Sharing0
Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding0
LiDAR Prompted Spatio-Temporal Multi-View Stereo for Autonomous Driving0
Cognition to Control - Multi-Agent Learning for Human-Humanoid Collaborative Transport0
Not All Candidates are Created Equal: A Heterogeneity-Aware Approach to Pre-ranking in Recommender Systems0
Towards Effective Orchestration of AI x DB Workloads0
Inverse Contextual Bandits without Rewards: Learning from a Non-Stationary Learner via Suffix Imitation0
MACC: Multi-Agent Collaborative Competition for Scientific Exploration0
DisenReason: Behavior Disentanglement and Latent Reasoning for Shared-Account Sequential Recommendation0
Specification-Driven Generation and Evaluation of Discrete-Event World Models via the DEVS Formalism0
Observationally Informed Adaptive Causal Experimental Design0
Small Object Detection in Complex Backgrounds with Multi-Scale Attention and Global Relation Modeling0
T2S-Bench & Structure-of-Thought: Benchmarking and Prompting Comprehensive Text-to-Structure Reasoning0
TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration0
A Rubric-Supervised Critic from Sparse Real-World Outcomes0
Unsupervised Surrogate-Assisted Synthesis of Free-Form Planar Antenna Topologies for IoT Applications0
Separators in Enhancing Autoregressive Pretraining for Vision Mamba0
Universal Pansharpening Foundation Model0
Adaptive Enhancement and Dual-Pooling Sequential Attention for Lightweight Underwater Object Detection with YOLOv100
In-Context Environments Induce Evaluation-Awareness in Language Models0
PatchDecomp: Interpretable Patch-Based Time Series Forecasting0
Semantic Bridging Domains: Pseudo-Source as Test-Time Connector0
Non-Invasive Reconstruction of Cardiac Activation Dynamics Using Physics-Informed Neural Networks0
Structure-Aware Distributed Backdoor Attacks in Federated Learning0
All-in-One Image Restoration via Causal-Deconfounding Wavelet-Disentangled Prompt Network0
On the Suitability of LLM-Driven Agents for Dark Pattern Audits0
Benchmarking Motivational Interviewing Competence of Large Language Models0
Coupling Local Context and Global Semantic Prototypes via a Hierarchical Architecture for Rhetorical Roles Labeling0
k-hop Fairness: Addressing Disparities in Graph Link Prediction Beyond First-Order Neighborhoods0
Believe Your Model: Distribution-Guided Confidence Calibration0
How Predicted Links Influence Network Evolution: Disentangling Choice and Algorithmic Feedback in Dynamic Graphs0
UniRain: Unified Image Deraining with RAG-based Dataset Distillation and Multi-objective Reweighted Optimization0
UniSync: Towards Generalizable and High-Fidelity Lip Synchronization for Challenging Scenarios0
A novel network for classification of cuneiform tablet metadata0
Show:102550
← PrevPage 229 of 13232Next →