SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 54265450 of 661570 papers

TitleStatusHype
Sequential Transport for Causal Mediation Analysis0
Token Coherence: Adapting MESI Cache Protocols to Minimize Synchronization Overhead in Multi-Agent LLM Systems0
CATFormer: When Continual Learning Meets Spiking Transformers With Dynamic Thresholds0
What Matters for Scalable and Robust Learning in End-to-End Driving Planners?0
Joint Routing and Model Pruning for Decentralized Federated Learning in Bandwidth-Constrained Multi-Hop Wireless Networks0
The Sampling Complexity of Condorcet Winner Identification in Dueling Bandits0
ADV-0: Closed-Loop Min-Max Adversarial Training for Long-Tail Robustness in Autonomous Driving0
Bidirectional Chinese and English Passive Sentences Dataset for Machine Translation0
In-Context Symbolic Regression for Robustness-Improved Kolmogorov-Arnold Networks0
HalDec-Bench: Benchmarking Hallucination Detector in Image Captioning0
Probe-then-Plan: Environment-Aware Planning for Industrial E-commerce Search0
IConE: Batch Independent Collapse Prevention for Self-Supervised Representation Learning0
From Documents to Spans: Code-Centric Learning for LLM-based ICD Coding0
Faster Inference of Flow-Based Generative Models via Improved Data-Noise Coupling0
Advancing Multimodal Agent Reasoning with Long-Term Neuro-Symbolic Memory0
Pointing-Based Object Recognition0
Evaluating the Robustness of Reinforcement Learning based Adaptive Traffic Signal Control0
Generative Video Compression with One-Dimensional Latent Representation0
Oscillating Dispersion for Maximal Light-throughput Spectral Imaging0
xplainfi: Feature Importance and Statistical Inference for Machine Learning in R0
A Kolmogorov-Arnold Surrogate Model for Chemical Equilibria: Application to Solid Solutions0
CCTU: A Benchmark for Tool Use under Complex Constraints0
CASHomon Sets: Efficient Rashomon Sets Across Multiple Model Classes and their Hyperparameters0
A scaled TW-PINN: A physics-informed neural network for traveling wave solutions of reaction-diffusion equations with general coefficients0
DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models0
Show:102550
← PrevPage 218 of 26463Next →