The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 476–500 of 659983 papers

Title	Date	Tasks	Status	Hype
Foundation Models for Time Series Analysis: A Tutorial and Survey	Mar 21, 2024	SurveyTime Series	CodeCode Available	7
One-Step Image Translation with Text-to-Image Models	Mar 18, 2024	DenoisingTranslation	CodeCode Available	7
DSP: Dynamic Sequence Parallelism for Multi-Dimensional Transformers	Mar 15, 2024	Text GenerationVideo Generation	CodeCode Available	7
CodeUltraFeedback: An LLM-as-a-Judge Dataset for Aligning Large Language Models to Coding Preferences	Mar 14, 2024	HumanEval	CodeCode Available	7
GenAD: Generalized Predictive Model for Autonomous Driving	Mar 14, 2024	Autonomous Drivingmodel	CodeCode Available	7
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation	Mar 13, 2024	Image GenerationPrompt Engineering	CodeCode Available	7
Chronos: Learning the Language of Time Series	Mar 12, 2024	Gaussian ProcessesLanguage Modeling	CodeCode Available	7
DragAnything: Motion Control for Anything using Entity Representation	Mar 12, 2024	ObjectVideo Generation	CodeCode Available	7
Better than classical? The subtle art of benchmarking quantum machine learning models	Mar 11, 2024	BenchmarkingBinary Classification	CodeCode Available	7
DeepSeek-VL: Towards Real-World Vision-Language Understanding	Mar 8, 2024	ChatbotLanguage Modelling	CodeCode Available	7
Improving Diffusion Models for Authentic Virtual Try-on in the Wild	Mar 8, 2024	Virtual Try-on	CodeCode Available	7
Symmetry Considerations for Learning Task Symmetric Robot Policies	Mar 7, 2024	Data AugmentationDeep Reinforcement Learning	CodeCode Available	7
Cradle: Empowering Foundation Agents Towards General Computer Control	Mar 5, 2024	Efficient Exploration	CodeCode Available	7
SoftTiger: A Clinical Foundation Model for Healthcare Workflows	Mar 1, 2024	Language ModellingLarge Language Model	CodeCode Available	7
Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation	Mar 1, 2024		CodeCode Available	7
TimeXer: Empowering Transformers for Time Series Forecasting with Exogenous Variables	Feb 29, 2024	Time SeriesTime Series Forecasting	CodeCode Available	7
StarCoder 2 and The Stack v2: The Next Generation	Feb 29, 2024	Code CompletionCode Generation	CodeCode Available	7
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models	Feb 29, 2024	Language ModellingMamba	CodeCode Available	7
Transparent Image Layer Diffusion using Latent Transparency	Feb 27, 2024		CodeCode Available	7
Dynamic Evaluation of Large Language Models by Meta Probing Agents	Feb 21, 2024	Data Augmentation	CodeCode Available	7
Revisiting Feature Prediction for Learning Visual Representations from Video	Feb 15, 2024	Prediction	CodeCode Available	7
On the Vulnerability of LLM/VLM-Controlled Robotics	Feb 15, 2024	Language ModellingRobot Manipulation	CodeCode Available	7
SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models	Feb 8, 2024	BenchmarkingDiversity	CodeCode Available	7
Fast Timing-Conditioned Latent Audio Diffusion	Feb 7, 2024	Audio GenerationGPU	CodeCode Available	7
LGM: Large Multi-View Gaussian Model for High-Resolution 3D Content Creation	Feb 7, 2024		CodeCode Available	7