SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 13511400 of 659983 papers

TitleStatusHype
Mamba Learns in Context: Structure-Aware Domain Generalization for Multi-Task Point Cloud Understanding0
Less is More in Semantic Space: Intrinsic Decoupling via Clifford-M for Fundus Image Classification0
BenchBench: Benchmarking Automated Benchmark Generation0
Predictive Regularization Against Visual Representation Degradation in Multimodal Large Language Models0
Lean Learning Beyond Clouds: Efficient Discrepancy-Conditioned Optical-SAR Fusion for Semantic Segmentation0
GMPilot: An Expert AI Agent For FDA cGMP Compliance0
Fast and Robust Deformable 3D Gaussian Splatting0
Restoring Neural Network Plasticity for Faster Transfer Learning0
Universal Coefficients and Mayer-Vietoris for Moore Homology of Ample Groupoids0
Semantic Sections: An Atlas-Native Feature Ontology for Obstructed Representation Spaces0
Deep Adaptive Rate Allocation in Volatile Heterogeneous Wireless Networks0
Active Inference for Physical AI Agents -- An Engineering Perspective0
Stability of Sequential and Parallel Coordinate Ascent Variational Inference0
Causally-Guided Diffusion for Stable Feature Selection0
AC4A: Access Control for Agents0
Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents0
Beyond Expression Similarity: Contrastive Learning Recovers Functional Gene Associations from Protein Interaction Structure0
Elite Lanes: Evolutionary Generation of Realistic Small-Scale Road Networks0
Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification0
Hard labels sampled from sparse targets mislead rotation invariant algorithms0
From Causal Discovery to Dynamic Causal Inference in Neural Time Series0
Cyber Deception for Mission Surveillance via Hypergame-Theoretic Deep Reinforcement Learning0
AEGIS: An Operational Infrastructure for Post-Market Governance of Adaptive Medical AI Under US and EU Regulations0
A Multi-Task Targeted Learning Framework for Lithium-Ion Battery State-of-Health and Remaining Useful Life0
DAQ: Delta-Aware Quantization for Post-Training LLM Weight Compression0
Trained Persistent Memory for Frozen Decoder-Only LLMs0
Fair splits flip the leaderboard: CHANRG reveals limited generalization in RNA secondary-structure prediction0
Bridging the Gap Between Climate Science and Machine Learning in Climate Model Emulation0
From Instructions to Assistance: a Dataset Aligning Instruction Manuals with Assembly Videos for Evaluating Multimodal LLMs0
A Multi-Modal CNN-LSTM Framework with Multi-Head Attention and Focal Loss for Real-Time Elderly Fall Detection0
Enhancing AI-Based Tropical Cyclone Track and Intensity Forecasting via Systematic Bias Correction0
Emergency Preemption Without Online Exploration: A Decision Transformer Approach0
ST-GDance++: A Scalable Spatial-Temporal Diffusion for Long-Duration Group Choreography0
Geometric Mixture-of-Experts with Curvature-Guided Adaptive Routing for Graph Representation Learning0
A graph neural network based chemical mechanism reduction method for combustion applications0
Sparsely-Supervised Data Assimilation via Physics-Informed Schrödinger Bridge0
Hybrid Associative Memories0
A Direct Classification Approach for Reliable Wind Ramp Event Forecasting under Severe Class Imbalance0
AgentSLR: Automating Systematic Literature Reviews in Epidemiology with Agentic AI0
Beyond the Mean: Distribution-Aware Loss Functions for Bimodal Regression0
Conformal Risk Control for Safety-Critical Wildfire Evacuation Mapping: A Comparative Study of Tabular, Spatial, and Graph-Based Models0
Large Language Models for Missing Data Imputation: Understanding Behavior, Hallucination Effects, and Control Mechanisms0
Graph Signal Processing Meets Mamba2: Adaptive Filter Bank via Delta Modulation0
PDGMM-VAE: A Variational Autoencoder with Adaptive Per-Dimension Gaussian Mixture Model Priors for Nonlinear ICA0
Coding Agents are Effective Long-Context Processors0
ALICE: A Multifaceted Evaluation Framework of Large Audio-Language Models' In-Context Learning Ability0
Verifiable Error Bounds for Physics-Informed Neural KKL Observers0
Deep reflective reasoning in interdependence constrained structured data extraction from clinical notes for digital health0
A Training-Free Regeneration Paradigm: Contrastive Reflection Memory Guided Self-Verification and Self-Improvement0
Detecting Neurovascular Instability from Multimodal Physiological Signals Using Wearable-Compatible Edge AI: A Responsible Computational Framework0
Show:102550
← PrevPage 28 of 13200Next →