SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1435114400 of 474278 papers

TitleStatusHype
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and DiagnosisCode1
Broad Validity of the First-Order Approach in Moral Hazard0
100-Day Analysis of USD/IDR Exchange Rate Dynamics Around the 2025 U.S. Presidential InaugurationCode0
Temporal Neural Cellular Automata: Application to modeling of contrast enhancement in breast MRICode0
An Audio-centric Multi-task Learning Framework for Streaming Ads Targeting on Spotify0
Optimal Design of Experiment for Electrochemical Parameter Identification of Li-ion Battery via Deep Reinforcement Learning0
Leveraging neural network interatomic potentials for a foundation model of chemistry0
Adaptive Mask-guided K-space Diffusion for Accelerated MRI Reconstruction0
Infant Cry Emotion Recognition Using Improved ECAPA-TDNN with Multiscale Feature Fusion and Attention EnhancementCode0
Transforming H&E images into IHC: A Variance-Penalized GAN for Precision Oncology0
Dynamic Hybrid Modeling: Incremental Identification and Model Predictive Control0
VHU-Net: Variational Hadamard U-Net for Body MRI Bias Field Correction0
Transformer World Model for Sample Efficient Multi-Agent Reinforcement LearningCode0
T-CPDL: A Temporal Causal Probabilistic Description Logic for Developing Logic-RAG Agent0
Structured Kolmogorov-Arnold Neural ODEs for Interpretable Learning and Symbolic Discovery of Nonlinear DynamicsCode0
DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked ModelingCode1
Neural Total Variation Distance Estimators for Changepoint Detection in News Data0
Survey of HPC in US Research Institutions0
A Large Language Model-based Multi-Agent Framework for Analog Circuits' Sizing Relationships Extraction0
Bayesian Evolutionary Swarm Architecture: A Formal Epistemic System Grounded in Truth-Based Competition0
Improving Student-AI Interaction Through Pedagogical Prompting: An Example in Computer Science Education0
TRIZ Agents: A Multi-Agent LLM Approach for TRIZ-Based Innovation0
Let Your Video Listen to Your Music!0
OmniAvatar: Efficient Audio-Driven Avatar Video Generation with Adaptive Body Animation0
Plan for Speed -- Dilated Scheduling for Masked Diffusion Language Models0
IndieFake Dataset: A Benchmark Dataset for Audio Deepfake Detection0
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners0
Understanding Software Engineering Agents: A Study of Thought-Action-Result Trajectories0
Phase retrieval with rank d measurements -- descending algorithms phase transitions0
Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement0
A Modular Taxonomy for Hate Speech Definitions and Its Impact on Zero-Shot LLM Classification PerformanceCode0
What You Think Is What You Get: Bridge User Intent and Transfer Function Design through Multimodal Large Language ModelsCode1
GradualDiff-Fed: A Federated Learning Specialized Framework for Large Language Model0
Efficient Beam Selection for ISAC in Cell-Free Massive MIMO via Digital Twin-Assisted Deep Reinforcement Learning0
NIC-RobustBench: A Comprehensive Open-Source Toolkit for Neural Image Compression and Robustness AnalysisCode1
Robots and Children that Learn Together : Improving Knowledge Retention by Teaching Peer-Like Interactive Robots0
LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMRCode1
Phase transition of descending phase retrieval algorithms0
Blind Source Separation in Biomedical Signals Using Variational Methods0
SOF: Sorted Opacity Fields for Fast Unbounded Surface Reconstruction0
Reading Smiles: Proxy Bias in Foundation Models for Facial Emotion Recognition0
Enhanced Hybrid Transducer and Attention Encoder Decoder with Text Data0
SHAMaNS: Sound Localization with Hybrid Alpha-Stable Spatial Measure and Neural Steerer0
On the algorithmic construction of deep ReLU networksCode0
The Debugging Decay Index: Rethinking Debugging Strategies for Code LLMs0
PuckTrick: A Library for Making Synthetic Data More Realistic0
Benchmarking Music Generation Models and Metrics via Human Preference Studies0
End-to-End Spoken Grammatical Error Correction0
LIGHTHOUSE: Fast and precise distance to shoreline calculations from anywhere on earthCode1
AI-Generated Song Detection via Lyrics TranscriptsCode0
Show:102550
← PrevPage 288 of 9486Next →