SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 13761400 of 659983 papers

TitleStatusHype
Trained Persistent Memory for Frozen Decoder-Only LLMs0
Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-structure prediction0
Bridging the Gap Between Climate Science and Machine Learning in Climate Model Emulation0
From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs0
A Multi-Modal CNN-LSTM Framework with Multi-Head Attention and Focal Loss for Real-Time Elderly Fall Detection0
Enhancing AI-Based Tropical Cyclone Track and Intensity Forecasting via Systematic Bias Correction0
Emergency Preemption Without Online Exploration: A Decision Transformer Approach0
ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography0
Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning0
A graph neural network based chemical mechanism reduction method for combustion applications0
Sparsely-Supervised Data Assimilation via Physics-Informed Schrödinger Bridge0
Hybrid Associative Memories0
A Direct Classification Approach for Reliable Wind Ramp Event Forecasting under Severe Class Imbalance0
AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI0
Beyond the Mean: Distribution-Aware Loss Functions for Bimodal Regression0
Conformal Risk Control for Safety-Critical Wildfire Evacuation Mapping: A Comparative Study of Tabular, Spatial, and Graph-Based Models0
Large Language Models for Missing Data Imputation: Understanding Behavior, Hallucination Effects, and Control Mechanisms0
Graph Signal Processing Meets Mamba2: Adaptive Filter Bank via Delta Modulation0
PDGMM-VAE: A Variational Autoencoder with Adaptive Per-Dimension Gaussian Mixture Model Priors for Nonlinear ICA0
Coding Agents are Effective Long-Context Processors0
ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models' In-Context Learning Ability0
Verifiable Error Bounds for Physics-Informed Neural KKL Observers0
Deep reflective reasoning in interdependence constrained structured data extraction from clinical notes for digital health0
A Training-Free Regeneration Paradigm: Contrastive Reflection Memory Guided Self-Verification and Self-Improvement0
Detecting Neurovascular Instability from Multimodal Physiological Signals Using Wearable-Compatible Edge AI: A Responsible Computational Framework0
Show:102550
← PrevPage 56 of 26400Next →