SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 476500 of 659983 papers

TitleStatusHype
Foundation Models for Time Series Analysis: A Tutorial and SurveyCode7
One-Step Image Translation with Text-to-Image ModelsCode7
DSP: Dynamic Sequence Parallelism for Multi-Dimensional TransformersCode7
CodeUltraFeedback: An LLM-as-a-Judge Dataset for Aligning Large Language Models to Coding PreferencesCode7
GenAD: Generalized Predictive Model for Autonomous DrivingCode7
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image GenerationCode7
Chronos: Learning the Language of Time SeriesCode7
DragAnything: Motion Control for Anything using Entity RepresentationCode7
Better than classical? The subtle art of benchmarking quantum machine learning modelsCode7
DeepSeek-VL: Towards Real-World Vision-Language UnderstandingCode7
Improving Diffusion Models for Authentic Virtual Try-on in the WildCode7
Symmetry Considerations for Learning Task Symmetric Robot PoliciesCode7
Cradle: Empowering Foundation Agents Towards General Computer ControlCode7
SoftTiger: A Clinical Foundation Model for Healthcare WorkflowsCode7
Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale RecommendationCode7
TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous VariablesCode7
StarCoder 2 and The Stack v2: The Next GenerationCode7
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language ModelsCode7
Transparent Image Layer Diffusion using Latent TransparencyCode7
Dynamic Evaluation of Large Language Models by Meta Probing AgentsCode7
Revisiting Feature Prediction for Learning Visual Representations from VideoCode7
On the Vulnerability of LLM/VLM-Controlled RoboticsCode7
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language ModelsCode7
Fast Timing-Conditioned Latent Audio DiffusionCode7
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content CreationCode7
Show:102550
← PrevPage 20 of 26400Next →