SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 53265350 of 661570 papers

TitleStatusHype
Mitigating the Multiplicity Burden: The Role of Calibration in Reducing Predictive Multiplicity of Classifiers0
Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models0
The DIME Architecture: A Unified Operational Algorithm for Neural Representation, Dynamics, Control and Integration0
Optimizing Task Completion Time Updates Using POMDPs0
MetaKE: Meta-learning Aligned Knowledge Editing via Bi-level Optimization0
The COTe score: A decomposable framework for evaluating Document Layout Analysis models0
A Fractional Fox H-Function Kernel for Support Vector Machines: Robust Classification via Weighted Transmutation Operators0
Exact Federated Continual Unlearning for Ridge Heads on Frozen Foundation Models0
BadLLM-TG: A Backdoor Defender powered by LLM Trigger Generator0
Beyond Final Answers: CRYSTAL Benchmark for Transparent Multimodal Reasoning Evaluation0
Semantic Invariance in Agentic AI0
MVHOI: Bridge Multi-view Condition to Complex Human-Object Interaction Video Reenactment via 3D Foundation Model0
Robust Building Damage Detection in Cross-Disaster Settings Using Domain Adaptation0
Scaling Autoregressive Models for Lattice Thermodynamics0
AURORA-KITTI: Any-Weather Depth Completion and Denoising in the Wild0
Beyond Local Code Optimization: Multi-Agent Reasoning for Software System Optimization0
Towards Next-Generation LLM Training: From the Data-Centric Perspective0
Training-Free Generation of Protein Sequences from Small Family Alignments via Stochastic Attention0
Multimodal Deep Learning for Early Prediction of Patient Deterioration in the ICU: Integrating Time-Series EHR Data with Clinical Notes0
GameUIAgent: An LLM-Powered Framework for Automated Game UI Design with Structured Intermediate Representation0
Enhancing Hands in 3D Whole-Body Pose Estimation with Conditional Hands Modulator0
Automated Diabetic Screening via Anterior Segment Ocular Imaging: A Deep Learning and Explainable AI Approach0
A Skill-augmented Agentic Framework and Benchmark for Multi-Video Understanding0
Gauge-Equivariant Intrinsic Neural Operators for Geometry-Consistent Learning of Elliptic PDE Maps0
Efficient Event Camera Volume System0
Show:102550
← PrevPage 214 of 26463Next →