SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 21262150 of 661570 papers

TitleStatusHype
A One-Inclusion Graph Approach to Multi-Group Learning0
A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling0
Is AI Catching Up to Human Expression? Exploring Emotion, Personality, Authorship, and Linguistic Style in English and Arabic with Six Large Language Models0
AI Lifecycle-Aware Feasibility Framework for Split-RIC Orchestration in NTN O-RAN0
Permutation-Symmetrized Diffusion for Unconditional Molecular Generation0
Revisiting Real-Time Digging-In Effects: No Evidence from NP/Z Garden-Paths0
Harnessing Lightweight Transformer with Contextual Synergic Enhancement for Efficient 3D Medical Image Segmentation0
Kinetic Langevin Splitting Schemes for Constrained Sampling0
Graph Energy Matching: Transport-Aligned Energy-Based Modeling for Graph Generation0
Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning0
Evaluating a Multi-Agent Voice-Enabled Smart Speaker for Care Homes: A Safety-Focused Framework0
SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling0
Biased Error Attribution in Multi-Agent Human-AI Systems Under Delayed Feedback0
Bilevel Autoresearch: Meta-Autoresearching Itself0
Mecha-nudges for Machines0
Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning0
Targeted Adversarial Traffic Generation : Black-box Approach to Evade Intrusion Detection Systems in IoT Networks0
SIGMA: A Physics-Based Benchmark for Gas Chimney Understanding in Seismic Images0
Evaluating LLM-Based Test Generation Under Software Evolution0
VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions0
Estimating Flow Velocity and Vehicle Angle-of-Attack from Non-invasive Piezoelectric Structural Measurements Using Deep Learning0
WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG0
DA-Flow: Degradation-Aware Optical Flow Estimation with Diffusion Models0
UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation0
MedObvious: Exposing the Medical Moravec's Paradox in VLMs via Clinical Triage0
Show:102550
← PrevPage 86 of 26463Next →