SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 78267850 of 474278 papers

TitleStatusHype
Machine and Deep Learning for Indoor UWB Jammer LocalizationCode0
CosmoBench: A Multiscale, Multiview, Multitask Cosmology Benchmark for Geometric Deep Learning0
Actial: Activate Spatial Reasoning Ability of Multimodal Large Language Models0
Cold-Start Active Preference Learning in Socio-Economic Domains0
SynBrain: Enhancing Visual-to-fMRI Synthesis via Probabilistic Representation LearningCode0
ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models0
Kineo: Calibration-Free Metric Motion Capture From Sparse RGB Cameras0
Towards Robust Mathematical Reasoning0
Trove: A Flexible Toolkit for Dense Retrieval0
FreeArt3D: Training-Free Articulated Object Generation using 3D Diffusion0
|\,\,BUS\,|: A Large and Diverse Multimodal Benchmark for evaluating the ability of Vision-Language Models to understand Rebus Puzzles0
CMI-MTL: Cross-Mamba interaction based multi-task learning for medical visual question answeringCode0
EPAN: Robust Pedestrian Re-Identification via Enhanced Alignment Network for IoT SurveillanceCode0
UniLumos: Fast and Unified Image and Video Relighting with Physics-Plausible FeedbackCode0
GTAlign: Game-Theoretic Alignment of LLM Assistants for Social WelfareCode0
MedREK: Retrieval-Based Editing for Medical LLMs with Key-Aware PromptsCode0
Benchmark-Ready 3D Anatomical Shape ClassificationCode0
Open Character Training: Shaping the Persona of AI Assistants through Constitutional AICode0
Learnable Fractional Reaction-Diffusion Dynamics for Under-Display ToF Imaging and BeyondCode0
Edge AI in Highly Volatile Environments: Is Fairness Worth the Accuracy Trade-off?Code0
TRACE: Textual Reasoning for Affordance Coordinate ExtractionCode0
Learning Intractable Multimodal Policies with Reparameterization and Diversity RegularizationCode0
DAMBench: A Multi-Modal Benchmark for Deep Learning-based Atmospheric Data AssimilationCode0
Fast, memory-efficient genomic interval tokenizers for modern machine learningCode0
ShortV: Efficient Multimodal Large Language Models by Freezing Visual Tokens in Ineffective LayersCode0
Show:102550
← PrevPage 314 of 18972Next →