SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 10511075 of 659983 papers

TitleStatusHype
WIST: Web-Grounded Iterative Self-Play Tree for Domain-Targeted Reasoning Improvement0
Demystifying Low-Rank Knowledge Distillation in Large Language Models: Convergence, Generalization, and Information-Theoretic Guarantees0
Bridging neuroscience and AI: adaptive, culturally sensitive technologies transforming aphasia rehabilitation0
STEM Agent: A Self-Adapting, Tool-Enabled, Extensible Architecture for Multi-Protocol AI Agent Systems0
ECI: Effective Contrastive Information to Evaluate Hard-Negatives0
Structural Sensitivity in Compressed Transformers: Error Propagation, Lyapunov Stability, and Formally Verified Bounds0
Long-Term Outlier Prediction Through Outlier Score Modeling0
The Intelligent Disobedience Game: Formulating Disobedience in Stackelberg Games and Markov Decision Processes0
When Does Content-Based Routing Work? Representation Requirements for Selective Attention in Hybrid Sequence Models0
CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs0
Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO0
SpatialFly: Geometry-Guided Representation Alignment for UAV Vision-and-Language Navigation in Urban Environments0
When Minor Edits Matter: LLM-Driven Prompt Attack for Medical VLM Robustness in Ultrasound0
NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection0
CTFS : Collaborative Teacher Framework for Forward-Looking Sonar Image Semantic Segmentation with Extremely Limited Labels0
SqueezeComposer: Temporal Speed-up is A Simple Trick for Long-form Music Composing0
CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models0
Assessing the Ability of Neural TTS Systems to Model Consonant-Induced F0 Perturbation0
Hierarchical Text-Guided Brain Tumor Segmentation via Sub-Region-Aware Prompts0
ViCLSR: A Supervised Contrastive Learning Framework with Natural Language Inference for Natural Language Understanding Tasks0
Interpreting the Synchronization Gap: The Hidden Mechanism Inside Diffusion Transformers0
Can we automatize scientific discovery in the cognitive sciences?0
Behavioural feasible set: Value alignment constraints on AI decision support0
Text-Image Conditioned 3D Generation0
Direct Interval Propagation Methods using Neural-Network Surrogates for Uncertainty Quantification in Physical Systems Surrogate Model0
Show:102550
← PrevPage 43 of 26400Next →