SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 40014025 of 661570 papers

TitleStatusHype
EchoGen: Cycle-Consistent Learning for Unified Layout-Image Generation and Understanding0
Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models0
Universal Sparse Autoencoders: Interpretable Cross-Model Concept Alignment0
Surrogate Model for Heat Transfer Prediction in Impinging Jet Arrays using Dynamic Inlet/Outlet and Flow Rate Control0
Entity-Specific Cyber Risk Assessment using InsurTech Empowered Risk Factors0
MMSearch-Plus: Benchmarking Provenance-Aware Search for Multimodal Browsing Agents0
Enhancing Reinforcement Learning Fine-Tuning with an Online Refiner0
Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)Code0
Simulation to Rules: A Dual-VLM Framework for Formal Visual Planning0
An Improved Model-Free Decision-Estimation Coefficient with Applications in Adversarial MDPs0
Bridging Earth and Space: A Survey on HAPS for Non-Terrestrial Networks0
Seeing Beyond the Image: ECG and Anatomical Knowledge-Guided Myocardial Scar Segmentation from Late Gadolinium-Enhanced Images0
DuoTeach: Dual Role Self-Teaching for Coarse-to-Fine Decision Coordination in Vision--Language Models0
Embedding Physical Reasoning into Diffusion-Based Shadow Generation0
GriDiT: Factorized Grid-Based Diffusion for Efficient Long Image Sequence Generation0
SF-RAG: Structure-Fidelity Retrieval-Augmented Generation for Academic Question Answering0
Causality is Key for Interpretability Claims to Generalise0
Thousand-GPU Large-Scale Training and Optimization Recipe for AI-Native Cloud Embodied Intelligence Infrastructure0
Systematic Scaling Analysis of Jailbreak Attacks in Large Language Models0
MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent SystemsCode0
SoulX-LiveAct: Towards Hour-Scale Real-Time Human Animation with Neighbor Forcing and ConvKV Memory0
A Stability-Aware Frozen Euler Autoencoder for Physics-Informed Tracking in Continuum Mechanics (SAFE-PIT-CM)0
Spectral Edge Dynamics of Training Trajectories: Signal--Noise Geometry Across Scales0
AsgardBench -- Evaluating Visually Grounded Interactive Planning Under Minimal Feedback0
Generative Replica-Exchange: A Flow-based Framework for Accelerating Replica Exchange Simulations0
Show:102550
← PrevPage 161 of 26463Next →