SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 23512400 of 659983 papers

TitleStatusHype
Transformer-Based Rate Prediction for Multi-Band Cellular Handsets0
Action Draft and Verify: A Self-Verifying Framework for Vision-Language-Action Model0
Tula: Optimizing Time, Cost, and Generalization in Distributed Large-Batch Training0
Modeling Overlapped Speech with Shuffles0
S3T-Former: A Purely Spike-Driven State-Space Topology Transformer for Skeleton Action Recognition0
LRConv-NeRV: Low Rank Convolution for Efficient Neural Video Compression0
On Additive Gaussian Processes for Wind Farm Power Prediction0
Don't Vibe Code, Do Skele-Code: Interactive No-Code Notebooks for Subject Matter Experts to Build Lower-Cost Agentic Workflows0
GMT: Goal-Conditioned Multimodal Transformer for 6-DOF Object Trajectory Synthesis in 3D Scenes0
The Unreasonable Effectiveness of Text Embedding Interpolation for Continuous Image Steering0
Unified Spatio-Temporal Token Scoring for Efficient Video VLMs0
Fourier Learning Machines: Nonharmonic Fourier-Based Neural Networks for Scientific Machine Learning0
OT-MeanFlow3D: Bridging Optimal Transport and Meanflow for Efficient 3D Point Cloud Generation0
Large Language Models Hallucination: A Comprehensive Survey0
DUAL-Bench: Measuring Over-Refusal and Robustness in Vision-Language Models0
AI Pose Analysis and Kinematic Profiling of Range-of-Motion Variations in Resistance Training0
MCP-38: A Comprehensive Threat Taxonomy for Model Context Protocol Systems (v1.0)0
Evolved Sample Weights for Bias Mitigation: Effectiveness Depends on the Fairness Objective0
Cast and Attached Shadow Detection via Iterative Light and Geometry Reasoning0
Memory Bear AI A Breakthrough from Memory to Cognition Toward Artificial General Intelligence0
Vulnerability of LLMs' Stated Beliefs? LLMs Belief Resistance Check Through Strategic Persuasive Conversation Interventions0
Age-Aware Edge-Blind Federated Learning via Over-the-Air Aggregation0
Gender Dynamics and Homophily in a Social Network of LLM Agents0
Krause Synchronization Transformers0
Theory and interpretability of Quantum Extreme Learning Machines: a Pauli-transfer matrix approach0
CIRCLE: A Framework for Evaluating AI from a Real-World Lens0
ClinConsensus: A Consensus-Based Benchmark for Evaluating Chinese Medical LLMs across Difficulty Levels0
Adversarial Latent-State Training for Robust Policies in Partially Observable Domains0
Structure from rank: Rank-order coding as a bridge from sequence to structure0
AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents0
Listening to the Echo: User-Reaction Aware Policy Optimization via Scalar-Verbal Hybrid Reinforcement Learning0
Variational Phasor Circuits for Phase-Native Brain-Computer Interface Classification0
Discovering What You Can Control: Interventional Boundary Discovery for Reinforcement Learning0
A Synthesizable RTL Implementation of Predictive Coding Networks0
CWoMP: Morpheme Representation Learning for Interlinear Glossing0
Lightweight Adaptation for LLM-based Technical Service Agent: Latent Logic Augmentation and Robust Noise Reduction0
SLEA-RL: Step-Level Experience Augmented Reinforcement Learning for Multi-Turn Agentic Training0
EgoAdapt: Enhancing Robustness in Egocentric Interactive Speaker Detection Under Missing Modalities0
Probabilistic Federated Learning on Uncertain and Heterogeneous Data with Model Personalization0
Uncovering Latent Phase Structures and Branching Logic in Locomotion Policies: A Case Study on HalfCheetah0
One-to-More: High-Fidelity Training-Free Anomaly Generation with Attention Control0
A Trace-Based Assurance Framework for Agentic AI Orchestration: Contracts, Testing, and Governance0
ARTEMIS: A Neuro Symbolic Framework for Economically Constrained Market Dynamics0
From Concepts to Judgments: Interpretable Image Aesthetic Assessment0
Discovery of Bimodal Drift Rate Structure in FRB 20240114A: Evidence for Dual Emission Regions0
BoundAD: Boundary-Aware Negative Generation for Time Series Anomaly Detection0
VC-Soup: Value-Consistency Guided Multi-Value Alignment for Large Language Models0
MAED: Mathematical Activation Error Detection for Mitigating Physical Fault Attacks in DNN Inference0
Evaluating FrameNet-Based Semantic Modeling for Gender-Based Violence Detection in Clinical Records0
Towards sample-optimal learning of bosonic Gaussian quantum states0
Show:102550
← PrevPage 48 of 13200Next →