The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1876–1900 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
DeepRetrieval: Hacking Real Search Engines and Retrievers with Large Language Models via Reinforcement Learning	Feb 28, 2025	Information Retrievalreinforcement-learning	CodeCode Available	4	5
LLMMapReduce-V2: Entropy-Driven Convolutional Test-Time Scaling for Generating Long-Form Articles from Extremely Long Resources	Apr 8, 2025	ArticlesForm	CodeCode Available	4	5
KISS-Matcher: Fast and Robust Point Cloud Registration Revisited	Sep 23, 2024	Point Cloud Registration	CodeCode Available	4	5
Cosmos-Transfer1: Conditional World Generation with Adaptive Multimodal Control	Mar 18, 2025		CodeCode Available	4	5
Prototypical Verbalizer for Prompt-based Few-shot Tuning	Mar 18, 2022	Contrastive LearningEntity Typing	CodeCode Available	4	5
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning	May 2, 2024	Autonomous Drivingcounterfactual	CodeCode Available	4	5
NUWA-Infinity: Autoregressive over Autoregressive Generation for Infinite Visual Synthesis	Jul 20, 2022	Image OutpaintingText-to-Image Generation	CodeCode Available	4	5
Autoregressive Video Generation without Vector Quantization	Dec 18, 2024	Image GenerationPrediction	CodeCode Available	4	5
Best-of-N Jailbreaking	Dec 4, 2024		CodeCode Available	4	5
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN Problems	Oct 21, 2024	Automated Theorem ProvingCPU	CodeCode Available	4	5
Continual Learning of Large Language Models: A Comprehensive Survey	Apr 25, 2024	Continual LearningSurvey	CodeCode Available	4	5
KTO: Model Alignment as Prospect Theoretic Optimization	Feb 2, 2024	Attributemodel	CodeCode Available	4	5
Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval	Sep 14, 2024	Contrastive LearningImage Retrieval	CodeCode Available	4	5
Text2SQL is Not Enough: Unifying AI and Databases with TAG	Aug 27, 2024	RAGRetrieval-augmented Generation	CodeCode Available	4	5
Towards No.1 in CLUE Semantic Matching Challenge: Pre-trained Language Model Erlangshen with Propensity-Corrected Loss	Aug 5, 2022	Language ModelingLanguage Modelling	CodeCode Available	4	5
Convolutional Differentiable Logic Gate Networks	Nov 7, 2024		CodeCode Available	4	5
Billion-scale similarity search with GPUs	Feb 28, 2017	GPUImage Similarity Search	CodeCode Available	4	5
Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers	Sep 30, 2024		CodeCode Available	4	5
Faster Neighborhood Attention: Reducing the O(n^2) Cost of Self Attention at the Threadblock Level	Mar 7, 2024		CodeCode Available	4	5
OpenEMMA: Open-Source Multimodal Model for End-to-End Autonomous Driving	Dec 19, 2024	Autonomous Driving	CodeCode Available	4	5
MineDojo: Building Open-Ended Embodied Agents with Internet-Scale Knowledge	Jun 17, 2022	Atari GamesMinecraft	CodeCode Available	4	5
GLIGEN: Open-Set Grounded Text-to-Image Generation	Jan 17, 2023	Conditional Text-to-Image SynthesisImage Generation	CodeCode Available	4	5
Simulation-free Schrödinger bridges via score and flow matching	Jul 7, 2023		CodeCode Available	4	5
Constitutional AI: Harmlessness from AI Feedback	Dec 15, 2022	Decision Making	CodeCode Available	4	5
Revisiting Self-Attentive Sequential Recommendation	Apr 13, 2025	DecoderRecommendation Systems	CodeCode Available	4	5