SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 36513675 of 661570 papers

TitleStatusHype
Balanced Thinking: Improving Chain of Thought Training in Vision Language Models0
Ontology-Guided Diffusion for Zero-Shot Visual Sim2Real Transfer0
Measuring 3D Spatial Geometric Consistency in Dynamic Generated Videos0
RADIUS: Ranking, Distribution, and Significance - A Comprehensive Alignment Suite for Survey Simulation0
Robustness, Cost, and Attack-Surface Concentration in Phishing Detection0
LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs1
A Model Ensemble-Based Post-Processing Framework for Fairness-Aware Prediction0
A Comparative Empirical Study of Catastrophic Forgetting Mitigation in Sequential Task Adaptation for Continual Natural Language Processing Systems0
Multiscale Switch for Semi-Supervised and Contrastive Learning in Medical Ultrasound Image SegmentationCode0
Unmasking Algorithmic Bias in Predictive Policing: A GAN-Based Simulation Framework with Multi-City Temporal Analysis0
AlignMamba-2: Enhancing Multimodal Fusion and Sentiment Analysis with Modality-Aware Mamba0
CoDA: Exploring Chain-of-Distribution Attacks and Post-Hoc Token-Space Repair for Medical Vision-Language Models0
Model Order Reduction of Cerebrovascular Hemodynamics Using POD_Galerkin and Reservoir Computing_based Approach0
Beyond Passive Aggregation: Active Auditing and Topology-Aware Defense in Decentralized Federated Learning0
Single Agent Robust Deep Reinforcement Learning for Bus Fleet Control0
Transfer Learning for Neutrino Scattering: Domain Adaptation with GANs0
Multi-Preconditioned LBFGS for Training Finite-Basis PINNs0
Foundations and Architectures of Artificial Intelligence for Motor Insurance0
SRRM: Improving Recursive Transport Surrogates in the Small-Discrepancy Regime0
Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review0
Teleological Inference in Structural Causal Models via Intentional Interventions0
Zipper-LoRA: Dynamic Parameter Decoupling for Speech-LLM based Multilingual Speech RecognitionCode0
Evaluating Model-Free Policy Optimization in Masked-Action Environments via an Exact Blackjack Oracle0
Deep Expert Injection for Anchoring Retinal VLMs with Domain-Specific Knowledge0
HaltNav: Reactive Visual Halting over Lightweight Topological Priors for Robust Vision-Language Navigation0
Show:102550
← PrevPage 147 of 26463Next →