SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 48014825 of 661570 papers

TitleStatusHype
From Natural Language to Executable Option Strategies via Large Language Models0
Tabular LLMs for Interpretable Few-Shot Alzheimer's Disease Prediction with Multimodal Biomedical DataCode0
Ethical Fairness without Demographics in Human-Centered AI0
The Cost of Reasoning: Chain-of-Thought Induces Overconfidence in Vision-Language Models0
Incongruent Positivity: When Miscalibrated Positivity Undermines Online Supportive Conversations0
Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models0
SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding0
LUMINA: A Multi-Vendor Mammography Benchmark with Energy Harmonization Protocol0
Anonymous-by-Construction: An LLM-Driven Framework for Privacy-Preserving Text0
When the City Teaches the Car: Label-Free 3D Perception from Infrastructure0
Automated identification of Ichneumonoidea wasps via YOLO-based deep learning: Integrating HiresCam for Explainable AI0
Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation0
A Scalable Approach to Solving Simulation-Based Network Security Games0
Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation0
Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network0
Edge-Efficient Two-Stream Multimodal Architecture for Non-Intrusive Bathroom Fall Detection0
CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning0
ProgressiveAvatars: Progressive Animatable 3D Gaussian Avatars0
Data-driven generalized perimeter control: Zürich case study0
Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models0
Transformers can do Bayesian Clustering0
Knowing What You Cannot Explain: Learning to Reject Low-Quality Explanations0
EdiVal-Agent: An Object-Centric Framework for Automated, Fine-Grained Evaluation of Multi-Turn Editing0
Accurate Shift Invariant Convolutional Neural Networks Using Gaussian-Hermite Moments0
Patient4D: Temporally Consistent Patient Body Mesh Recovery from Monocular Operating Room Video0
Show:102550
← PrevPage 193 of 26463Next →