SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 20012025 of 177340 papers

TitleStatusHype
Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMsCode4
PyTorch Frame: A Modular Framework for Multi-Modal Tabular LearningCode4
GPAvatar: Generalizable and Precise Head Avatar from Image(s)Code4
One Diffusion to Generate Them AllCode4
TRIPS: Trilinear Point Splatting for Real-Time Radiance Field RenderingCode4
Simple Baselines for Image RestorationCode4
YOLOv11-RGBT: Towards a Comprehensive Single-Stage Multispectral Object Detection FrameworkCode4
AgentKit: Structured LLM Reasoning with Dynamic GraphsCode4
ServerlessLLM: Low-Latency Serverless Inference for Large Language ModelsCode4
MarkLLM: An Open-Source Toolkit for LLM WatermarkingCode4
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted TreesCode4
One Step Diffusion via Shortcut ModelsCode4
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language ModelsCode4
DiffuEraser: A Diffusion Model for Video InpaintingCode4
Accelerating Visual-Policy Learning through Parallel Differentiable SimulationCode4
ActionStudio: A Lightweight Framework for Data and Training of Large Action ModelsCode4
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level SupervisionCode4
End-to-End Hybrid Refractive-Diffractive Lens Design with Differentiable Ray-Wave ModelCode4
Taking a turn for the better: Conversation redirection throughout the course of mental-health therapyCode4
FLARE: Toward Universal Dataset Purification against Backdoor AttacksCode4
ReasonGraph: Visualisation of Reasoning PathsCode4
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation ControlCode4
Halu-J: Critique-Based Hallucination JudgeCode4
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion PriorCode4
CPGD: Toward Stable Rule-based Reinforcement Learning for Language ModelsCode4
Show:102550
← PrevPage 81 of 7094Next →