The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1751–1775 of 659983 papers

Title	Date	Tasks	Status	Hype
SimPO: Simple Preference Optimization with a Reference-Free Reward	May 23, 2024	ChatbotInstruction Following	CodeCode Available	4
FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical Training	Mar 3, 2023	Federated LearningGPU	CodeCode Available	4
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video Generation	Apr 21, 2025	Video Generation	CodeCode Available	4
ParkingE2E: Camera-based End-to-end Parking Network, from Images to Planning	Aug 4, 2024	DecoderImitation Learning	CodeCode Available	4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges	Jan 4, 2025	FairnessHallucination	CodeCode Available	4
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different Modalities	Dec 13, 2022	Decoder	CodeCode Available	4
LESS: Selecting Influential Data for Targeted Instruction Tuning	Feb 6, 2024		CodeCode Available	4
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search	Jun 6, 2024		CodeCode Available	4
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation	Apr 5, 2024	Few-Shot LearningScene Segmentation	CodeCode Available	4
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors	Aug 21, 2023		CodeCode Available	4
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation	Aug 5, 2024		CodeCode Available	4
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry Refiner	May 23, 2024	3D Generation3D geometry	CodeCode Available	4
UniTok: A Unified Tokenizer for Visual Generation and Understanding	Feb 27, 2025	Quantization	CodeCode Available	4
LangCell: Language-Cell Pre-training for Cell Identity Understanding	May 9, 2024		CodeCode Available	4
RAPIDFlow: Recurrent Adaptable Pyramids with Iterative Decoding for Efficient Optical Flow Estimation	May 1, 2024	Optical Flow Estimation	CodeCode Available	4
Kwai Keye-VL Technical Report	Jul 2, 2025	Instruction FollowingReinforcement Learning (RL)	CodeCode Available	4
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs	Jun 14, 2024	Language ModelingLanguage Modelling	CodeCode Available	4
Towards One-shot Federated Learning: Advances, Challenges, and Future Directions	May 5, 2025	Federated LearningSurvey	CodeCode Available	4
s3: You Don't Need That Much Data to Train a Search Agent via RL	May 20, 2025	RAGReinforcement Learning (RL)	CodeCode Available	4
lmgame-Bench: How Good are LLMs at Playing Games?	May 21, 2025	Language ModelingLanguage Modelling	CodeCode Available	4
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation	May 26, 2025	Human-Domain Subject-to-VideoOpen-Domain Subject-to-Video	CodeCode Available	4
DemoFusion: Democratising High-Resolution Image Generation With No $	Nov 24, 2023	Image Generation	CodeCode Available	4
Look Once to Hear: Target Speech Hearing with Noisy Examples	May 10, 2024	CPUSpeech Extraction	CodeCode Available	4
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World	Feb 29, 2024	AllHallucination	CodeCode Available	4
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human Interplay	Apr 4, 2025		CodeCode Available	4