SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 55015525 of 177340 papers

TitleStatusHype
Reliable and Efficient Concept Erasure of Text-to-Image Diffusion ModelsCode2
Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1Code2
Scattertext: a Browser-Based Tool for Visualizing how Corpora DifferCode2
ScaleKD: Strong Vision Transformers Could Be Excellent TeachersCode2
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image FormatsCode2
NeRF-RPN: A general framework for object detection in NeRFsCode2
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language ModelsCode2
Automatic Differentiation-based Full Waveform Inversion with Flexible WorkflowsCode2
AirMorph: Topology-Preserving Deep Learning for Pulmonary Airway AnalysisCode2
Attacks, Defenses and Evaluations for LLM Conversation Safety: A SurveyCode2
SpatialScore: Towards Unified Evaluation for Multimodal Spatial UnderstandingCode2
An Empirical Study of Qwen3 QuantizationCode2
One-shot Entropy MinimizationCode2
KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose EstimationCode2
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical TasksCode2
Safe Delta: Consistently Preserving Safety when Fine-Tuning LLMs on Diverse DatasetsCode2
VidMuse: A Simple Video-to-Music Generation Framework with Long-Short-Term ModelingCode2
ColorizeDiffusion v2: Enhancing Reference-based Sketch Colorization Through Separating UtilitiesCode2
ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RLCode2
nnWNet: Rethinking the Use of Transformers in Biomedical Image Segmentation and Calling for a Unified Evaluation BenchmarkCode2
X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention DistillationCode2
A Transformer-Based Siamese Network for Change DetectionCode2
Focal Modulation NetworksCode2
An Embodied Generalist Agent in 3D WorldCode2
Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTVCode2
Show:102550
← PrevPage 221 of 7094Next →