SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 376400 of 659983 papers

TitleStatusHype
YOLOv10 with Kolmogorov-Arnold networks and vision-language foundation models for interpretable object detection and trustworthy multimodal AI in computer vision perception0
HUydra: Full-Range Lung CT Synthesis via Multiple HU Interval Generative Modelling0
Assessing the Robustness of Climate Foundation Models under No-Analog Distribution Shifts0
Machine Learning Models for the Early Detection of Burnout in Software Engineering: a Systematic Literature Review0
AuthorMix: Modular Authorship Style Transfer via Layer-wise Adapter Mixing0
MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models0
General Machine Learning: Theory for Learning Under Variable Regimes0
I Came, I Saw, I Explained: Benchmarking Multimodal LLMs on Figurative Meaning in Memes0
PERMA: Benchmarking Personalized Memory Agents via Event-Driven Preference and Realistic Task Environments0
GEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RL0
MemCollab: Cross-Agent Memory Collaboration via Contrastive Trajectory Distillation0
Online library learning in human visual puzzle solving0
Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning0
GO-Renderer: Generative Object Rendering with 3D-aware Controllable Video Diffusion Models0
SynForceNet: A Force-Driven Global-Local Latent Representation Framework for Lithium-Ion Battery Fault Diagnosis0
SafeSeek: Universal Attribution of Safety Circuits in Language Models0
Not All Tokens Are Created Equal: Query-Efficient Jailbreak Fuzzing for LLMs0
A Multimodal Framework for Human-Multi-Agent Interaction0
Multi-Modal Image Fusion via Intervention-Stable Feature Learning0
CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection0
Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook0
A Comparative Study of Machine Learning Models for Hourly Forecasting of Air Temperature and Relative Humidity0
WaveSFNet: A Wavelet-Based Codec and Spatial--Frequency Dual-Domain Gating Network for Spatiotemporal Prediction0
LLM Olympiad: Why Model Evaluation Needs a Sealed Exam0
Mamba-driven MRI-to-CT Synthesis for MRI-only Radiotherapy Planning0
Show:102550
← PrevPage 16 of 26400Next →