SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 1195112000 of 661570 papers

TitleStatusHype
Tether: Autonomous Functional Play with Correspondence-Driven Trajectory Warping0
ULTRA: Unified Multimodal Control for Autonomous Humanoid Whole-Body Loco-Manipulation0
How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference0
MIBURI: Towards Expressive Interactive Gesture Synthesis0
Utonia: Toward One Encoder for All Point Clouds4
Q-Guided Stein Variational Model Predictive Control via RL-informed Policy Prior0
Classification of Histopathology Slides with Persistent Homology Convolutions0
HAMLET: A Hierarchical and Adaptive Multi-Agent Framework for Live Embodied Theatrics0
Effective Sample Size and Generalization Bounds for Temporal Networks0
CAD-Tokenizer: Towards Text-based CAD Prototyping via Modality-Specific Tokenization0
Uni-NTFM: A Unified Foundation Model for EEG Signal Representation Learning0
FLOWR.root: A flow matching based foundation model for joint multi-purpose structure-aware 3D ligand generation and affinity prediction0
The Geometry of Reasoning: Flowing Logics in Representation Space1
A Geometry-Based View of Mahalanobis OOD Detection0
UniLight: A Unified Representation for Lighting0
CNFP: Optimizing Cloud-Native Network Function Placement with Diffusion Models on the Cloud Continuum0
Implicit Bias of the JKO Scheme0
BumpNet: A Sparse MLP Framework for Learning PDE Solutions0
The Epistemological Consequences of Large Language Models: Rethinking collective intelligence and institutional knowledge0
LeanTutor: Towards a Verified AI Mathematical Proof Tutor0
Causal Identification from Counterfactual Data: Completeness and Bounding Results0
GENAI WORKBENCH: AI-Assisted Analysis and Synthesis of Engineering Systems from Multimodal Engineering Data0
Surprisal-Rényi Free Energy0
LiteVLA-Edge: Quantized On-Device Multimodal Control for Embedded Robotics0
Learning Order Forest for Qualitative-Attribute Data Clustering0
Multi-Agent-Based Simulation of Archaeological Mobility in Uneven Landscapes0
Zero-Knowledge Federated Learning with Lattice-Based Hybrid Encryption for Quantum-Resilient Medical AI0
Beyond Cross-Validation: Adaptive Parameter Selection for Kernel-Based Gradient Descents0
Heterogeneous Time Constants Improve Stability in Equilibrium Propagation0
Tracing Pharmacological Knowledge In Large Language Models0
Scalable Contrastive Causal Discovery under Unknown Soft Interventions0
Parallel Test-Time Scaling with Multi-Sequence Verifiers0
Beyond Accuracy: Evaluating Visual Grounding In Multimodal Medical Reasoning0
Asymmetric Goal Drift in Coding Agents Under Value Conflict0
Graph Hopfield Networks: Energy-Based Node Classification with Associative Memory0
Biased Generalization in Diffusion Models0
When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning0
Beyond Pixel Histories: World Models with Persistent 3D State0
Optimal trajectory-guided stochastic co-optimization for e-fuel system design and real-time operation0
Quantifying Ranking Instability Across Evaluation Protocol Axes in Gene Regulatory Network Benchmarking0
Geographically-Weighted Weakly Supervised Bayesian High-Resolution Transformer for 200m Resolution Pan-Arctic Sea Ice Concentration Mapping and Uncertainty Estimation using Sentinel-1, RCM, and AMSR2 Data0
Raising Bars, Not Parameters: LilMoo Compact Language Model for Hindi0
Orbital Transformers for Predicting Wavefunctions in Time-Dependent Density Functional Theory0
The Controllability Trap: A Governance Framework for Military AI Agents0
MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery0
Q-Measure-Learning for Continuous State RL: Efficient Implementation and Convergence0
Molt Dynamics: Emergent Social Phenomena in Autonomous AI Agent Populations0
Multi-Agent Influence Diagrams to Hybrid Threat Modeling0
Logit-Level Uncertainty Quantification in Vision-Language Models for Histopathology Image Analysis0
Directional Neural Collapse Explains Few-Shot Transfer in Self-Supervised Learning0
Show:102550
← PrevPage 240 of 13232Next →