SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 42514275 of 661570 papers

TitleStatusHype
FailureMem: A Failure-Aware Multimodal Framework for Autonomous Software Repair0
Trust the Unreliability: Inward Backward Dynamic Unreliability Driven Coreset Selection for Medical Image Classification0
End-to-end data-driven prediction of urban airflow and pollutant dispersion0
VeriAgent: A Tool-Integrated Multi-Agent System with Evolving Memory for PPA-Aware RTL Code Generation0
Temporal Narrative Monitoring in Dynamic Information Environments0
Do Language Models Encode Semantic Relations? Probing and Sparse Feature Analysis0
A Multi-Agent System for Building-Age Cohort Mapping to Support Urban Energy Planning0
Atomic Trajectory Modeling with State Space Models for Biomolecular Dynamics0
DSS-GAN: Directional State Space GAN with Mamba backbone for Class-Conditional Image Synthesis0
Towards Infinitely Long Neural Simulations: Self-Refining Neural Surrogate Models for Dynamical Systems0
VeriGrey: Greybox Agent Validation0
Interpretable Cross-Domain Few-Shot Learning with Rectified Target-Domain Local Alignment0
Few-Step Diffusion Sampling Through Instance-Aware Discretizations0
Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards0
Illumination-Aware Contactless Fingerprint Spoof Detection via Paired Flash-Non-Flash Imaging0
WeatherReasonSeg: A Benchmark for Weather-Aware Reasoning Segmentation in Visual Language Models0
Sensi: Learn One Thing at a Time -- Curriculum-Based Test-Time Learning for LLM Game Agents0
Does YOLO Really Need to See Every Training Image in Every Epoch?0
Objective Mispricing Detection for Shortlisting Undervalued Football Players via Market Dynamics and News Signals0
Stochastic set-valued optimization and its application to robust learning0
Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos0
Exploring parameter-efficient fine-tuning (PEFT) of billion-parameter vision models with QLoRA and DoRA: insights into generalization for limited-data image classification under a 98:1 test-to-train regime0
AERR-Nav: Adaptive Exploration-Recovery-Reminiscing Strategy for Zero-Shot Object Navigation0
PC-CrossDiff: Point-Cluster Dual-Level Cross-Modal Differential Attention for Unified 3D Referring and Segmentation0
Evidence Packing for Cross-Domain Image Deepfake Detection with LVLMs0
Show:102550
← PrevPage 171 of 26463Next →