SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 29012925 of 661570 papers

TitleStatusHype
Elite Lanes: Evolutionary Generation of Realistic Small-Scale Road Networks0
Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification0
Hard labels sampled from sparse targets mislead rotation invariant algorithms0
From Causal Discovery to Dynamic Causal Inference in Neural Time Series0
Cyber Deception for Mission Surveillance via Hypergame-Theoretic Deep Reinforcement Learning0
The Nonverbal Gap: Toward Affective Computer Vision for Safer and More Equitable Online Dating0
SEAR: Schema-Based Evaluation and Routing for LLM Gateways0
Multi-view Graph Convolutional Network with Fully Leveraging Consistency via Granular-ball-based Topology Construction, Feature Enhancement and Interactive Fusion0
Capability Safety as Datalog: A Foundational Equivalence0
An Annotation-to-Detection Framework for Autonomous and Robust Vine Trunk Localization in the Field by Mobile Agricultural Robots0
A Multimodal Deep Learning Framework for Edema Classification Using HCT and Clinical Data0
Contextual inference from single objects in Vision-Language models0
Mixture of Experts with Soft Nearest Neighbor Loss: Resolving Expert Collapse via Representation Disentanglement0
AEGIS: An Operational Infrastructure for Post-Market Governance of Adaptive Medical AI Under US and EU Regulations0
A Multi-Task Targeted Learning Framework for Lithium-Ion Battery State-of-Health and Remaining Useful Life0
DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression0
Trained Persistent Memory for Frozen Decoder-Only LLMs0
Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-structure prediction0
Bridging the Gap Between Climate Science and Machine Learning in Climate Model Emulation0
From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs0
A Multi-Modal CNN-LSTM Framework with Multi-Head Attention and Focal Loss for Real-Time Elderly Fall Detection0
Enhancing AI-Based Tropical Cyclone Track and Intensity Forecasting via Systematic Bias Correction0
Emergency Preemption Without Online Exploration: A Decision Transformer Approach0
ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography0
Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning0
Show:102550
← PrevPage 117 of 26463Next →