The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1826–1850 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Guiding Instruction-based Image Editing via Multimodal Large Language Models	Sep 29, 2023	Image ManipulationResponse Generation	CodeCode Available	4	5
Taming Scalable Visual Tokenizer for Autoregressive Image Generation	Dec 3, 2024	Image GenerationImage Reconstruction	CodeCode Available	4	5
SocialED: A Python Library for Social Event Detection	Dec 18, 2024	CPUEvent Detection	CodeCode Available	4	5
OLMoE: Open Mixture-of-Experts Language Models	Sep 3, 2024	Language ModelingLanguage Modelling	CodeCode Available	4	5
Eagle: Exploring The Design Space for Multimodal LLMs with Mixture of Encoders	Aug 28, 2024	Optical Character Recognition	CodeCode Available	4	5
On the limits of agency in agent-based models	Sep 14, 2024	Computational Efficiencycounterfactual	CodeCode Available	4	5
Visual Attention Network	Feb 20, 2022	image-classificationImage Classification	CodeCode Available	4	5
Large Language Models for Time Series: A Survey	Feb 2, 2024	QuantizationSurvey	CodeCode Available	4	5
LLMMapReduce: Simplified Long-Sequence Processing using Large Language Models	Oct 12, 2024	document understanding	CodeCode Available	4	5
OvercookedV2: Rethinking Overcooked for Zero-Shot Coordination	Mar 22, 2025		CodeCode Available	4	5
TradeMaster: A Holistic Quantitative Trading Platform Empowered by Reinforcement Learning	Sep 26, 2023		CodeCode Available	4	5
Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster	Apr 6, 2023		CodeCode Available	4	5
Large Language Model-Based Agents for Software Engineering: A Survey	Sep 4, 2024	AI AgentLanguage Modeling	CodeCode Available	4	5
R1-Onevision：An Open-Source Multimodal Large Language Model Capable of Deep Reasoning	Feb 24, 2025	Language ModelingLanguage Modelling	CodeCode Available	4	5
Training Sparse Mixture Of Experts Text Embedding Models	Feb 11, 2025	Mixture-of-ExpertsRAG	CodeCode Available	4	5
PyTorch Adapt	Nov 28, 2022	Domain Adaptation	CodeCode Available	4	5
Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality	May 5, 2025	Retrieval	CodeCode Available	4	5
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models	Apr 11, 2024	Language Modelling	CodeCode Available	4	5
MOS: Model Surgery for Pre-Trained Model-Based Class-Incremental Learning	Dec 12, 2024	class-incremental learningClass Incremental Learning	CodeCode Available	4	5
Images Speak in Images: A Generalist Painter for In-Context Visual Learning	Dec 5, 2022	In-Context LearningKeypoint Detection	CodeCode Available	4	5
DreamGen: Unlocking Generalization in Robot Learning through Video World Models	May 19, 2025	Video Generation	CodeCode Available	4	5
MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning	Mar 10, 2025	Multimodal ReasoningReinforcement Learning (RL)	CodeCode Available	4	5
Cognitive Architectures for Language Agents	Sep 5, 2023	Decision Making	CodeCode Available	4	5
AnimateLCM: Computation-Efficient Personalized Style Video Generation without Personalized Video Data	Feb 1, 2024	Conditional Image GenerationDenoising	CodeCode Available	4	5
Ignore This Title and HackAPrompt: Exposing Systemic Vulnerabilities of LLMs through a Global Scale Prompt Hacking Competition	Oct 24, 2023		CodeCode Available	4	5