The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–425 of 659983 papers

Title	Date	Tasks	Status	Hype
PowerPM: Foundation Model for Power Systems	Aug 7, 2024	Contrastive Learningmodel	CodeCode Available	7
Segment Anything in Medical Images and Videos: Benchmark and Deployment	Aug 6, 2024	BenchmarkingSegmentation	CodeCode Available	7
Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining	Aug 5, 2024	DecoderDepth Estimation	CodeCode Available	7
Global Structure-from-Motion Revisited	Jul 29, 2024	16k	CodeCode Available	7
RT-DETRv2: Improved Baseline with Bag-of-Freebies for Real-Time Detection Transformer	Jul 24, 2024	Data AugmentationDecoder	CodeCode Available	7
Stable Audio Open	Jul 19, 2024	Audio GenerationText-to-Music Generation	CodeCode Available	7
ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness?	Jul 19, 2024	BenchmarkingCode Generation	CodeCode Available	7
MMAU: A Holistic Benchmark of Agent Capabilities Across Diverse Domains	Jul 18, 2024		CodeCode Available	7
Qwen2-Audio Technical Report	Jul 15, 2024	Instruction FollowingLanguage Modelling	CodeCode Available	7
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditions	Jul 11, 2024	Image Animation	CodeCode Available	7
MambaVision: A Hybrid Mamba-Transformer Vision Backbone	Jul 10, 2024	Image ClassificationInstance Segmentation	CodeCode Available	7
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large Multimodal Models	Jul 10, 2024	Video Question AnsweringZero-Shot Video Question Answer	CodeCode Available	7
PEER: Expertizing Domain-Specific Tasks with a Multi-Agent Framework and Tuning Methods	Jul 9, 2024	Information RetrievalLEMMA	CodeCode Available	7
Agentless: Demystifying LLM-based Software Engineering Agents	Jul 1, 2024	Program Repair	CodeCode Available	7
ColPali: Efficient Document Retrieval with Vision Language Models	Jun 27, 2024	document understandingRAG	CodeCode Available	7
RouteLLM: Learning to Route LLMs with Preference Data	Jun 26, 2024	Data AugmentationTransfer Learning	CodeCode Available	7
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO	Jun 25, 2024	reinforcement-learningReinforcement Learning	CodeCode Available	7
EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees	Jun 24, 2024		CodeCode Available	7
Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation	Jun 24, 2024	parameter-efficient fine-tuningSentence	CodeCode Available	7
Mooncake: A KVCache-centric Disaggregated Architecture for LLM Serving	Jun 24, 2024	CPUGPU	CodeCode Available	7
Grants4Companies: Applying Declarative Methods for Recommending and Reasoning About Business Grants in the Austrian Public Administration (System Description)	Jun 21, 2024		CodeCode Available	7
NAVSIM: Data-Driven Non-Reactive Autonomous Vehicle Simulation and Benchmarking	Jun 21, 2024	Autonomous DrivingBenchmarking	CodeCode Available	7
DataComp-LM: In search of the next generation of training sets for language models	Jun 17, 2024	Language ModellingMMLU	CodeCode Available	7
Grounding Image Matching in 3D with MASt3R	Jun 14, 2024	3D Reconstruction	CodeCode Available	7
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers	Jun 14, 2024	Decoder	CodeCode Available	7