The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 376–400 of 659983 papers

Title	Date	Tasks	Status	Hype
The Prompt Report: A Systematic Survey of Prompting Techniques	Jun 6, 2024	Prompt EngineeringSurvey	CodeCode Available	7
Qwen2.5-Omni Technical Report	Mar 26, 2025	Automatic Speech Recognition (ASR)GSM8K	CodeCode Available	7
Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large-Scale Recommendation	Mar 1, 2024		CodeCode Available	7
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems	Mar 31, 2025	AutoMLContinual Learning	CodeCode Available	7
Labeling supervised fine-tuning data with the scaling law	May 5, 2024	coreference-resolutionCoreference Resolution	CodeCode Available	7
A Survey of Graph Retrieval-Augmented Generation for Customized Large Language Models	Jan 21, 2025	RAGRetrieval	CodeCode Available	7
When LLMs step into the 3D World: A Survey and Meta-Analysis of 3D Tasks via Multi-modal Large Language Models	May 16, 2024	In-Context LearningQuestion Answering	CodeCode Available	7
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines	Oct 5, 2023	Language ModelingLanguage Modelling	CodeCode Available	7
TotalSegmentator MRI: Robust Sequence-independent Segmentation of Multiple Anatomic Structures in MRI	May 29, 2024	MRI segmentation	CodeCode Available	7
RouteLLM: Learning to Route LLMs with Preference Data	Jun 26, 2024	Data AugmentationTransfer Learning	CodeCode Available	7
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation	Apr 3, 2024	Image GenerationText to Image Generation	CodeCode Available	7
YOLOv12: Attention-Centric Real-Time Object Detectors	Feb 18, 2025	GPUObject	CodeCode Available	7
Long-form music generation with latent diffusion	Apr 16, 2024	Audio GenerationForm	CodeCode Available	7
LLM-AutoDiff: Auto-Differentiate Any LLM Workflow	Jan 28, 2025	Prompt EngineeringQuestion Answering	CodeCode Available	7
Global Structure-from-Motion Revisited	Jul 29, 2024	16k	CodeCode Available	7
Revisiting Feature Prediction for Learning Visual Representations from Video	Feb 15, 2024	Prediction	CodeCode Available	7
Fast Text-to-Audio Generation with Adversarial Post-Training	May 13, 2025	ARCAudio Generation	CodeCode Available	7
GLM-4-Voice: Towards Intelligent and Human-Like End-to-End Spoken Chatbot	Dec 3, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	7
V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning	Jun 11, 2025	Action AnticipationLarge Language Model	CodeCode Available	7
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention	Jun 16, 2025	Mixture-of-ExpertsReinforcement Learning (RL)	CodeCode Available	7
Flow Matching Guide and Code	Dec 9, 2024	Text Generation	CodeCode Available	7
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads	Jan 19, 2024		CodeCode Available	7
ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI	Oct 1, 2024	GPUImitation Learning	CodeCode Available	7
Improving Diffusion Models for Authentic Virtual Try-on in the Wild	Mar 8, 2024	Virtual Try-on	CodeCode Available	7
Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena	Jun 9, 2023	ChatbotLanguage Modelling	CodeCode Available	7