The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8651–8700 of 661570 papers

Title	Date	Tasks	Status	Hype
CHGNet: Pretrained universal neural network potential for charge-informed atomistic modeling	Feb 28, 2023	Atomic ForcesGraph Neural Network	CodeCode Available	2
LogAI: A Library for Log Analytics and Intelligence	Jan 31, 2023	Anomaly DetectionLog Parsing	CodeCode Available	2
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model	Apr 3, 2023	DenoisingDiversity	CodeCode Available	2
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints	Aug 3, 2023	Image GenerationLanguage Modelling	CodeCode Available	2
Geometric Latent Diffusion Models for 3D Molecule Generation	May 2, 2023	3D Molecule GenerationUnconditional Molecule Generation	CodeCode Available	2
Accelerating Self-Play Learning in Go	Feb 27, 2019	Game of Go	CodeCode Available	2
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models	May 23, 2023	Common Sense ReasoningImage Generation	CodeCode Available	2
MoEUT: Mixture-of-Experts Universal Transformers	May 25, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization Algorithms	May 27, 2025	Bayesian OptimizationBenchmarking	CodeCode Available	2
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction	Jul 16, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs	Apr 25, 2024	Visual GroundingVisual Question Answering	CodeCode Available	2
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization	Sep 9, 2023	Language ModellingLarge Language Model	CodeCode Available	2
Cross-Image Relational Knowledge Distillation for Semantic Segmentation	Apr 14, 2022	Knowledge DistillationSegmentation	CodeCode Available	2
MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation	Oct 5, 2023	BenchmarkingDecision Making	CodeCode Available	2
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection	Dec 16, 2024	LLM-generated Text DetectionText Detection	CodeCode Available	2
R3LIVE: A Robust, Real-time, RGB-colored, LiDAR-Inertial-Visual tightly-coupled state Estimation and mapping package	Sep 10, 2021	Sensor FusionState Estimation	CodeCode Available	2
ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents	Feb 21, 2024	Active LearningPosition	CodeCode Available	2
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval	Jul 21, 2024	General KnowledgeHighlight Detection	CodeCode Available	2
SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization	Sep 20, 2021	AutoMLBayesian Optimization	CodeCode Available	2
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device	Sep 27, 2021	Video RecognitionVideo Understanding	CodeCode Available	2
MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities	May 27, 2024	Autonomous DrivingOut-of-Distribution Detection	CodeCode Available	2
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment	May 29, 2024	Instruction Following	CodeCode Available	2
FEC: Fast Euclidean Clustering for Point Cloud Segmentation	Aug 16, 2022	ClusteringInstance Segmentation	CodeCode Available	2
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging	Jan 5, 2024	Medical Report GenerationMedical Visual Question Answering	CodeCode Available	2
Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt	Jun 6, 2024	Language ModellingLarge Language Model	CodeCode Available	2
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor	Jun 10, 2024	RAGRetrieval	CodeCode Available	2
Exploring Orthogonality in Open World Object Detection	Jan 1, 2024	Incremental LearningObject	CodeCode Available	2
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects	Dec 13, 2024	Large Language Model	CodeCode Available	2
Equinox: neural networks in JAX via callable PyTrees and filtered transformations	Oct 30, 2021		CodeCode Available	2
Deep Architectures for Content Moderation and Movie Content Rating	Dec 8, 2022	Action RecognitionGenre classification	CodeCode Available	2
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models	Jun 24, 2024	Referring ExpressionReferring Expression Comprehension	CodeCode Available	2
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration	Jun 26, 2024	Contrastive LearningDeblurring	CodeCode Available	2
Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction	Nov 22, 2021	GPUNeRF	CodeCode Available	2
Investigating Tradeoffs in Real-World Video Super-Resolution	Nov 24, 2021	BenchmarkingSuper-Resolution	CodeCode Available	2
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search	May 21, 2021		CodeCode Available	2
Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI	Dec 30, 2021		CodeCode Available	2
POCO: Point Convolution for Surface Reconstruction	Jan 5, 2022	3D ReconstructionSurface Reconstruction	CodeCode Available	2
MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control	Aug 15, 2022	Humanoid Control	CodeCode Available	2
Speech Denoising in the Waveform Domain with Self-Attention	Feb 15, 2022	DecoderDenoising	CodeCode Available	2
Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework	Feb 15, 2022	3D Point Cloud ClassificationPoint Cloud Segmentation	CodeCode Available	2
Differentiable and Learnable Robot Models	Feb 22, 2022		CodeCode Available	2
OpenDR: An Open Toolkit for Enabling High Performance, Low Footprint Deep Learning for Robotics	Mar 1, 2022		CodeCode Available	2
Recovering 3D Human Mesh from Monocular Images: A Survey	Mar 3, 2022	3D human pose and shape estimationHuman Mesh Recovery	CodeCode Available	2
SoftGroup for 3D Instance Segmentation on Point Clouds	Mar 3, 2022	3D Instance Segmentation3D Object Detection	CodeCode Available	2
Freeform Body Motion Generation from Speech	Mar 4, 2022	DiversityMotion Generation	CodeCode Available	2
Class-incremental Learning for Time Series: Benchmark and Evaluation	Feb 19, 2024	Activity RecognitionBenchmarking	CodeCode Available	2
MotionCLIP: Exposing Human Motion Generation to CLIP Space	Mar 15, 2022	DisentanglementMotion Generation	CodeCode Available	2
Real-time Object Detection for Streaming Perception	Mar 23, 2022	Autonomous DrivingObject	CodeCode Available	2
Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?	Nov 6, 2024		CodeCode Available	2
Latent Modulated Function for Computational Optimal Continuous Image Representation	Apr 25, 2024	Computational EfficiencySuper-Resolution	CodeCode Available	2