The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1026–1050 of 659983 papers

Title	Date	Tasks	Status	Hype
OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement	Feb 22, 2024	Code GenerationHumanEval	CodeCode Available	5
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases	Feb 22, 2024		CodeCode Available	5
How NeRFs and 3D Gaussian Splatting are Reshaping SLAM: a Survey	Feb 20, 2024	3DGSSimultaneous Localization and Mapping	CodeCode Available	5
VADv2: End-to-End Vectorized Autonomous Driving via Probabilistic Planning	Feb 20, 2024	Autonomous DrivingNavSim	CodeCode Available	5
A Survey on Knowledge Distillation of Large Language Models	Feb 20, 2024	Data AugmentationKnowledge Distillation	CodeCode Available	5
Efficient Multimodal Learning from Data-centric Perspective	Feb 18, 2024	Image ClassificationReferring Expression Comprehension	CodeCode Available	5
Trust Regions for Explanations via Black-Box Probabilistic Certification	Feb 17, 2024		CodeCode Available	5
BlackJAX: Composable Bayesian inference in JAX	Feb 16, 2024	Bayesian InferenceProbabilistic Programming	CodeCode Available	5
DataDreamer: A Tool for Synthetic Data Generation and Reproducible LLM Workflows	Feb 16, 2024	Synthetic Data Generation	CodeCode Available	5
GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting	Feb 15, 2024	3D Object ReconstructionNeural Rendering	CodeCode Available	5
OS-Copilot: Towards Generalist Computer Agents with Self-Improvement	Feb 12, 2024		CodeCode Available	5
Online Iterative Reinforcement Learning from Human Feedback with General Preference Model	Feb 11, 2024		CodeCode Available	5
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue	Feb 8, 2024	Conversational Web NavigationText Generation	CodeCode Available	5
MobileVLM V2: Faster and Stronger Baseline for Vision Language Model	Feb 6, 2024	AutoMLLanguage Modeling	CodeCode Available	5
EasyInstruct: An Easy-to-use Instruction Processing Framework for Large Language Models	Feb 5, 2024		CodeCode Available	5
Unified Training of Universal Time Series Forecasting Transformers	Feb 4, 2024	Time SeriesTime Series Forecasting	CodeCode Available	5
Break the Sequential Dependency of LLM Inference Using Lookahead Decoding	Feb 3, 2024	Code Completion	CodeCode Available	5
Audio Flamingo: A Novel Audio Language Model with Few-Shot Learning and Dialogue Abilities	Feb 2, 2024	Acoustic Scene ClassificationAudio captioning	CodeCode Available	5
Executable Code Actions Elicit Better LLM Agents	Feb 1, 2024	Language ModellingLarge Language Model	CodeCode Available	5
BootsTAP: Bootstrapped Training for Tracking-Any-Point	Feb 1, 2024	Point Tracking	CodeCode Available	5
SymbolicAI: A framework for logic-based approaches combining generative models and solvers	Feb 1, 2024	Few-Shot LearningIn-Context Learning	CodeCode Available	5
MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments	Feb 1, 2024	Embodied Question AnsweringLanguage Modeling	CodeCode Available	5
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval	Jan 31, 2024	Question AnsweringRetrieval	CodeCode Available	5
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research	Jan 31, 2024	Language ModelingLanguage Modelling	CodeCode Available	5
OpenMoE: An Early Effort on Open Mixture-of-Experts Language Models	Jan 29, 2024	DecoderMixture-of-Experts	CodeCode Available	5