The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1501–1525 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
LESS: Selecting Influential Data for Targeted Instruction Tuning	Feb 6, 2024		CodeCode Available	4	5
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search	Jun 6, 2024		CodeCode Available	4	5
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation	Apr 5, 2024	Few-Shot LearningScene Segmentation	CodeCode Available	4	5
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors	Aug 21, 2023		CodeCode Available	4	5
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation	Aug 5, 2024		CodeCode Available	4	5
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner	May 23, 2024	3D Generation3D geometry	CodeCode Available	4	5
UniTok: A Unified Tokenizer for Visual Generation and Understanding	Feb 27, 2025	Quantization	CodeCode Available	4	5
LangCell: Language-Cell Pre-training for Cell Identity Understanding	May 9, 2024		CodeCode Available	4	5
RAPIDFlow: Recurrent Adaptable Pyramids with Iterative Decoding for Efficient Optical Flow Estimation	May 1, 2024	Optical Flow Estimation	CodeCode Available	4	5
Kwai Keye-VL Technical Report	Jul 2, 2025	Instruction FollowingReinforcement Learning (RL)	CodeCode Available	4	5
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs	Jun 14, 2024	Language ModelingLanguage Modelling	CodeCode Available	4	5
Towards One-shot Federated Learning: Advances, Challenges, and Future Directions	May 5, 2025	Federated LearningSurvey	CodeCode Available	4	5
s3: You Don't Need That Much Data to Train a Search Agent via RL	May 20, 2025	RAGReinforcement Learning (RL)	CodeCode Available	4	5
lmgame-Bench: How Good are LLMs at Playing Games?	May 21, 2025	Language ModelingLanguage Modelling	CodeCode Available	4	5
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation	May 26, 2025	Human-Domain Subject-to-VideoOpen-Domain Subject-to-Video	CodeCode Available	4	5
DemoFusion: Democratising High-Resolution Image Generation With No $	Nov 24, 2023	Image Generation	CodeCode Available	4	5
Look Once to Hear: Target Speech Hearing with Noisy Examples	May 10, 2024	CPUSpeech Extraction	CodeCode Available	4	5
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World	Feb 29, 2024	AllHallucination	CodeCode Available	4	5
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay	Apr 4, 2025		CodeCode Available	4	5
Eureka: Human-Level Reward Design via Coding Large Language Models	Oct 19, 2023	Decision MakingIn-Context Learning	CodeCode Available	4	5
High Fidelity Neural Audio Compression	Oct 24, 2022	Audio CompressionAudio Signal Processing	CodeCode Available	4	5
MIGC++: Advanced Multi-Instance Generation Controller for Image Synthesis	Jul 2, 2024	AttributeImage Generation	CodeCode Available	4	5
Qiskit Machine Learning: an open-source library for quantum machine learning tasks at scale on quantum hardware and classical simulators	May 23, 2025	Quantum Machine Learning	CodeCode Available	4	5
StudioGAN: A Taxonomy and Benchmark of GANs for Image Synthesis	Jun 19, 2022	Generative Adversarial NetworkImage Generation	CodeCode Available	4	5
CoTracker: It is Better to Track Together	Jul 14, 2023	GPUmotion prediction	CodeCode Available	4	5