SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 20512075 of 177340 papers

TitleStatusHype
Discovering faster matrix multiplication algorithms with reinforcement learningCode4
SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory TreeCode4
TotalSegmentator: robust segmentation of 104 anatomical structures in CT imagesCode4
Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented TasksCode4
Benchmarking Neural Network Training AlgorithmsCode4
RenderFormer: Transformer-based Neural Rendering of Triangle Meshes with Global IlluminationCode4
XVerse: Consistent Multi-Subject Control of Identity and Semantic Attributes via DiT ModulationCode4
Deepchecks: A Library for Testing and Validating Machine Learning Models and DataCode4
Effective Whole-body Pose Estimation with Two-stages DistillationCode4
CLAY: A Controllable Large-scale Generative Model for Creating High-quality 3D AssetsCode4
The Importance of Directional Feedback for LLM-based OptimizersCode4
Rerender A Video: Zero-Shot Text-Guided Video-to-Video TranslationCode4
Theseus: A Library for Differentiable Nonlinear OptimizationCode4
SnAG: Scalable and Accurate Video GroundingCode4
From Discrete Tokens to High-Fidelity Audio Using Multi-Band DiffusionCode4
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer ModelsCode4
FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual GuidanceCode4
Old Optimizer, New Norm: An AnthologyCode4
Time-LLM: Time Series Forecasting by Reprogramming Large Language ModelsCode4
The Llama 3 Herd of ModelsCode4
ControlVAE: Tuning, Analytical Properties, and Performance AnalysisCode4
UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2heightCode4
Diffusion Policy Policy OptimizationCode4
Scaling Granite Code Models to 128K ContextCode4
AgentSociety: Large-Scale Simulation of LLM-Driven Generative Agents Advances Understanding of Human Behaviors and SocietyCode4
Show:102550
← PrevPage 83 of 7094Next →