SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 98019850 of 661570 papers

TitleStatusHype
Self-Supervised Evolutionary Learning of Neurodynamic Progression and Identity Manifolds from EEG During Safety-Critical Decision Making0
Training for Trustworthy Saliency Maps: Adversarial Training Meets Feature-Map Smoothing0
VisualScratchpad: Inference-time Visual Concepts Analysis in Vision Language Models0
Norm-Hierarchy Transitions in Representation Learning: When and Why Neural Networks Abandon Shortcuts0
A Lightweight Digital-Twin-Based Framework for Edge-Assisted Vehicle Tracking and Collision Prediction0
The Yerkes-Dodson Curve for AI Agents: Emergent Cooperation Under Environmental Pressure in Multi-Agent LLM Simulations0
Scaling Laws in the Tiny Regime: How Small Models Change Their Mistakes0
RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts0
Position: LLMs Must Use Functor-Based and RAG-Driven Bias Mitigation for Fairness0
Scheduling Parallel Optical Circuit Switches for AI Training0
SoK: Agentic Retrieval-Augmented Generation (RAG): Taxonomy, Architectures, Evaluation, and Research Directions0
Video-EM: Event-Centric Episodic Memory for Long-Form Video Understanding0
A Miniature Brain Transformer: Thalamic Gating, Hippocampal Lateralization, Amygdaloid Salience, and Prefrontal Working Memory in Attention-Coupled Latent Memory0
See It, Say It, Sorted: An Iterative Training-Free Framework for Visually-Grounded Multimodal Reasoning in LVLMs0
ScenePilot-Bench: A Large-Scale Dataset and Benchmark for Evaluation of Vision-Language Models in Autonomous Driving0
Lap2: Revisiting Laplace DP-SGD for High Dimensions via Majorization Theory0
Enhancing Web Agents with a Hierarchical Memory Tree0
AdaGen: Learning Adaptive Policy for Image Synthesis0
Efficient Vision Mamba for MRI Super-Resolution via Hybrid Selective Scanning0
MrBERT: Modern Multilingual Encoders via Vocabulary, Domain, and Dimensional Adaptation0
Tight Robustness Certification Through the Convex Hull of _0 Attacks0
Efficient Algorithms for Logistic Contextual Slate Bandits with Bandit Feedback0
PASS: Certified Subset Repair for Classical and Quantum Pairwise Constrained Clustering0
Extended Empirical Validation of the Explainability Solution Space0
Topology-Aware Reinforcement Learning over Graphs for Resilient Power Distribution Networks0
Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information0
Learning When to Cooperate Under Heterogeneous Goals0
Shaping Parameter Contribution Patterns for Out-of-Distribution Detection0
Taiwan Safety Benchmark and Breeze Guard: Toward Trustworthy AI for Taiwanese Mandarin0
StructSAM: Structure- and Spectrum-Preserving Token Merging for Segment Anything Models0
VINO: Video-driven Invariance for Non-contextual Objects via Structural Prior Guided De-contextualization0
RAmmStein: Regime Adaptation in Mean-reverting Markets with Stein Thresholds -- Optimal Impulse Control in Concentrated AMMs0
PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue SystemsCode0
Do Modern Video-LLMs Need to Listen? A Benchmark Audit and Scalable RemedyCode0
Efficient Diffusion-Based 3D Human Pose Estimation with Hierarchical Temporal Pruning0
Unified Multi-Modal Interactive & Reactive 3D Motion Generation via Rectified Flow0
ELHPlan: Efficient Long-Horizon Task Planning for Multi-Agent Collaboration0
Toward a Physical Theory of Intelligence0
Synthetic Augmentation in Imbalanced Learning: When It Helps, When It Hurts, and How Much to Add0
CGL: Advancing Continual GUI Learning via Reinforcement Fine-Tuning0
Masked Unfairness: Hiding Causality within Zero ATE0
MAviS: A Multimodal Conversational Assistant For Avian Species0
Learning Clinical Representations Under Systematic Distribution Shift0
Learning to Reflect: Hierarchical Multi-Agent Reinforcement Learning for CSI-Free mmWave Beam-Focusing0
Domain-Specific Quality Estimation for Machine Translation in Low-Resource Scenarios0
Cold-Start Active Correlation Clustering0
Reliable Grid Forecasting: State Space Models for Safety-Critical Energy Systems0
Towards Strategic Persuasion with Language Models0
Faster Gradient Methods for Highly-Smooth Stochastic Bilevel Optimization0
SAGA: Selective Adaptive Gating for Efficient and Expressive Linear Attention0
Show:102550
← PrevPage 197 of 13232Next →