SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 33763400 of 661570 papers

TitleStatusHype
Balancing Performance and Fairness in Explainable AI for Anomaly Detection in Distributed Power Plants Monitoring0
Sheaf Neural Networks and biomedical applications0
Image2Gcode: Image-to-G-code Generation for Additive Manufacturing Using Diffusion-Transformer Model0
Score Reversal Is Not Free for Quantum Diffusion Models0
To See or To Please: Uncovering Visual Sycophancy and Split Beliefs in VLMs0
Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders0
Generalization of Long-Range Machine Learning Potentials in Complex Chemical Spaces0
All-in-One Slider for Attribute Manipulation in Diffusion ModelsCode0
PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents0
Language Model Maps for Prompt-Response Distributions via Log-Likelihood Vectors0
WarPGNN: A Parametric Thermal Warpage Analysis Framework with Physics-aware Graph Neural Network0
Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs0
AU Codes, Language, and Synthesis: Translating Anatomy to Text for Facial Behavior Synthesis0
Student views in AI Ethics and Social Impact0
ITKIT: Feasible CT Image Analysis based on SimpleITK and MMEngine0
Investigating Faithfulness in Large Audio Language Models0
From Workflow Automation to Capability Closure: A Formal Framework for Safe and Revenue-Aware Customer Service AI0
Redundancy-as-Masking: Formalizing the Artificial Age Score (AAS) to Model Memory Aging in Generative AI0
Augmenting Rating-Scale Measures with Text-Derived Items Using the Information-Determined Scoring (IDS) Framework0
REST: Receding Horizon Explorative Steiner Tree for Zero-Shot Object-Goal Navigation0
Holter-to-Sleep: AI-Enabled Repurposing of Single-Lead ECG for Sleep Phenotyping0
Learning Consistent Temporal Grounding between Related Tasks in Sports Coaching0
Look Before You Fuse: 2D-Guided Cross-Modal Alignment for Robust 3D Detection0
AgroCoT: A Chain-of-Thought Benchmark for Evaluating Reasoning in Vision-Language Models for Agriculture0
Self-Tuning Sparse Attention: Multi-Fidelity Hyperparameter Optimization for Transformer Acceleration0
Show:102550
← PrevPage 136 of 26463Next →