SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 9761000 of 659983 papers

TitleStatusHype
LLM2Vec: Large Language Models Are Secretly Powerful Text EncodersCode5
SpeechAlign: Aligning Speech Generation to Human PreferencesCode5
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real TransferCode5
MagicTime: Time-lapse Video Generation Models as Metamorphic SimulatorsCode5
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic EvaluatorsCode5
SpatialTracker: Tracking Any 2D Pixels in 3D SpaceCode5
ReFT: Representation Finetuning for Language ModelsCode5
Masked Completion via Structured Diffusion with White-Box TransformersCode5
Long-context LLMs Struggle with Long In-context LearningCode5
CityGaussian: Real-time High-quality Large-Scale Scene Rendering with GaussiansCode5
Measuring Taiwanese Mandarin Language UnderstandingCode5
TFB: Towards Comprehensive and Fair Benchmarking of Time Series Forecasting MethodsCode5
InstantSplat: Sparse-view SfM-free Gaussian Splatting in SecondsCode5
GauStudio: A Modular Framework for 3D Gaussian Splatting and BeyondCode5
UniDepth: Universal Monocular Metric Depth EstimationCode5
ChatDBG: Augmenting Debugging with Large Language ModelsCode5
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from TextCode5
MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View ImagesCode5
Mora: Enabling Generalist Video Generation via A Multi-Agent FrameworkCode5
Evolutionary Optimization of Model Merging RecipesCode5
FeatUp: A Model-Agnostic Framework for Features at Any ResolutionCode5
Automatic Interactive Evaluation for Large Language Models with State Aware Patient SimulatorCode5
Fundamental Components of Deep Learning: A category-theoretic approachCode5
WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks?Code5
Efficient Diffusion Model for Image Restoration by Residual ShiftingCode5
Show:102550
← PrevPage 40 of 26400Next →