SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 39013925 of 661570 papers

TitleStatusHype
Action Draft and Verify: A Self-Verifying Framework for Vision-Language-Action Model0
Tula: Optimizing Time, Cost, and Generalization in Distributed Large-Batch Training0
Modeling Overlapped Speech with Shuffles0
S3T-Former: A Purely Spike-Driven State-Space Topology Transformer for Skeleton Action Recognition0
LRConv-NeRV: Low Rank Convolution for Efficient Neural Video Compression0
On Additive Gaussian Processes for Wind Farm Power Prediction0
Don't Vibe Code, Do Skele-Code: Interactive No-Code Notebooks for Subject Matter Experts to Build Lower-Cost Agentic Workflows0
GMT: Goal-Conditioned Multimodal Transformer for 6-DOF Object Trajectory Synthesis in 3D Scenes0
The Unreasonable Effectiveness of Text Embedding Interpolation for Continuous Image Steering0
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs0
Fourier Learning Machines: Nonharmonic Fourier-Based Neural Networks for Scientific Machine Learning0
OT-MeanFlow3D: Bridging Optimal Transport and Meanflow for Efficient 3D Point Cloud Generation0
Large Language Models Hallucination: A Comprehensive Survey0
DUAL-Bench: Measuring Over-Refusal and Robustness in Vision-Language Models0
AI Pose Analysis and Kinematic Profiling of Range-of-Motion Variations in Resistance Training0
MCP-38: A Comprehensive Threat Taxonomy for Model Context Protocol Systems (v1.0)0
Evolved Sample Weights for Bias Mitigation: Effectiveness Depends on the Fairness Objective0
Cast and Attached Shadow Detection via Iterative Light and Geometry Reasoning0
Memory Bear AI A Breakthrough from Memory to Cognition Toward Artificial General Intelligence0
Vulnerability of LLMs' Stated Beliefs? LLMs Belief Resistance Check Through Strategic Persuasive Conversation Interventions0
Age-Aware Edge-Blind Federated Learning via Over-the-Air Aggregation0
Gender Dynamics and Homophily in a Social Network of LLM Agents0
Krause Synchronization Transformers0
Theory and interpretability of Quantum Extreme Learning Machines: a Pauli-transfer matrix approach0
CIRCLE: A Framework for Evaluating AI from a Real-World Lens0
Show:102550
← PrevPage 157 of 26463Next →