SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 29763000 of 661570 papers

TitleStatusHype
94% on CIFAR-10 in 3.29 Seconds on a Single GPUCode3
MapTRv2: An End-to-End Framework for Online Vectorized HD Map ConstructionCode3
On the Trajectory Regularity of ODE-based Diffusion SamplingCode3
Amplifier: Bringing Attention to Neglected Low-Energy Components in Time Series ForecastingCode3
IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language ModelsCode3
SyncTalk: The Devil is in the Synchronization for Talking Head SynthesisCode3
Single-Image Shadow Removal Using Deep Learning: A Comprehensive SurveyCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
Parameter-Efficient Fine-Tuning in Spectral Domain for Point Cloud LearningCode3
GaussianEditor: Swift and Controllable 3D Editing with Gaussian SplattingCode3
GraphStorm: all-in-one graph machine learning framework for industry applicationsCode3
TokenPacker: Efficient Visual Projector for Multimodal LLMCode3
WeatherMesh-3: Fast and accurate operational global weather forecastingCode3
NdLinear Is All You Need for Representation LearningCode3
Bake off redux: a review and experimental evaluation of recent time series classification algorithmsCode3
TrafficLLM: Enhancing Large Language Models for Network Traffic Analysis with Generic Traffic RepresentationCode3
CameraHMR: Aligning People with PerspectiveCode3
DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World KnowledgeCode3
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
Rainbow: Combining Improvements in Deep Reinforcement LearningCode3
Mambular: A Sequential Model for Tabular Deep LearningCode3
Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference OptimizationCode3
WHAC: World-grounded Humans and CamerasCode3
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic EvaluationsCode3
Generative AI Act II: Test Time Scaling Drives Cognition EngineeringCode3
Show:102550
← PrevPage 120 of 26463Next →