The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 12451–12500 of 474278 papers

Title	Date	Tasks	Status	Hype
Improving Causal Reasoning in Large Language Models: A Survey	Oct 22, 2024	Decision MakingSurvey	CodeCode Available	2
Parameter-Inverted Image Pyramid Networks for Visual Perception and Multimodal Understanding	Jan 14, 2025	image-classificationImage Classification	CodeCode Available	2
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World	Aug 3, 2023	AllQuestion Answering	CodeCode Available	2
LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis	Aug 18, 2023	Facial Expression RecognitionKnowledge Distillation	CodeCode Available	2
RelationField: Relate Anything in Radiance Fields	Dec 18, 2024	3d scene graph generationGraph Generation	CodeCode Available	2
Effector: A Python package for regional explanations	Apr 3, 2024		CodeCode Available	2
FastDiff: A Fast Conditional Diffusion Model for High-Quality Speech Synthesis	Apr 21, 2022	DenoisingGPU	CodeCode Available	2
Structure-informed Language Models Are Protein Designers	Feb 3, 2023		CodeCode Available	2
RETA-LLM: A Retrieval-Augmented Large Language Model Toolkit	Jun 8, 2023	Answer GenerationFact Checking	CodeCode Available	2
Scaling Multi-Camera 3D Object Detection through Weak-to-Strong Eliciting	Apr 10, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
Reviving The Classics: Active Reward Modeling in Large Language Model Alignment	Feb 4, 2025	Computational EfficiencyExperimental Design	CodeCode Available	2
Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning	May 29, 2022	Few-Shot Text ClassificationMemorization	CodeCode Available	2
TimeMarker: A Versatile Video-LLM for Long and Short Video Understanding with Superior Temporal Localization Ability	Nov 27, 2024	Temporal LocalizationVideo Understanding	CodeCode Available	2
Vox-Fusion: Dense Tracking and Mapping with Voxel-based Neural Implicit Representation	Oct 28, 2022		CodeCode Available	2
LeanVec: Searching vectors faster by making them fit	Dec 26, 2023	Cross-Modal RetrievalDimensionality Reduction	CodeCode Available	2
BIGCity: A Universal Spatiotemporal Model for Unified Trajectory and Traffic State Data Analysis	Dec 1, 2024		CodeCode Available	2
CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal Covariance Design	Jan 14, 2024	Model-based Reinforcement LearningModel Predictive Control	CodeCode Available	2
Incremental Sequence Labeling: A Tale of Two Shifts	Feb 16, 2024	Incremental LearningKnowledge Distillation	CodeCode Available	2
Comprehensive Verilog Design Problems: A Next-Generation Benchmark Dataset for Evaluating Large Language Models and Agents on RTL Design and Verification	Jun 17, 2025	Code Generation	CodeCode Available	2
mLoRA: Fine-Tuning LoRA Adapters via Highly-Efficient Pipeline Parallelism in Multiple GPUs	Dec 5, 2023	GPULarge Language Model	CodeCode Available	2
OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering	Mar 26, 2023		CodeCode Available	2
Graphic Design with Large Multimodal Model	Apr 22, 2024	Layout Generationmodel	CodeCode Available	2
Humanoid Agents: Platform for Simulating Human-like Generative Agents	Oct 9, 2023	Unity	CodeCode Available	2
What Are Expected Queries in End-to-End Object Detection?	Jun 2, 2022	Instance Segmentationobject-detection	CodeCode Available	2
Woodpecker: Hallucination Correction for Multimodal Large Language Models	Oct 24, 2023	Hallucination	CodeCode Available	2
Mini Honor of Kings: A Lightweight Environment for Multi-Agent Reinforcement Learning	Jun 6, 2024	Multi-agent Reinforcement Learning	CodeCode Available	2
radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction	Oct 11, 2024	Multi-Task Learning	CodeCode Available	2
A Model or 603 Exemplars: Towards Memory-Efficient Class-Incremental Learning	May 26, 2022	class-incremental learningClass Incremental Learning	CodeCode Available	2
SNP-S3: Shared Network Pre-training and Significant Semantic Strengthening for Various Video-Text Tasks	Jan 31, 2024	Sentence	CodeCode Available	2
SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution	May 27, 2025	Reinforcement Learning (RL)	CodeCode Available	2
Off-Policy Evaluation for Large Action Spaces via Embeddings	Feb 13, 2022	Multi-Armed BanditsOff-policy evaluation	CodeCode Available	2
Interaction2Code: Benchmarking MLLM-based Interactive Webpage Code Generation from Interactive Prototyping	Nov 5, 2024	BenchmarkingCode Generation	CodeCode Available	2
Aerial Lifting: Neural Urban Semantic and Building Instance Lifting from Aerial Imagery	Mar 18, 2024	Instance SegmentationNeRF	CodeCode Available	2
DPoser: Diffusion Model as Robust 3D Human Pose Prior	Dec 9, 2023	DenoisingHuman Mesh Recovery	CodeCode Available	2
Asynchronous Large Language Model Enhanced Planner for Autonomous Driving	Jun 20, 2024	Autonomous DrivingLanguage Modeling	CodeCode Available	2
BanditPAM++: Faster k-medoids Clustering	Sep 21, 2023		CodeCode Available	2
TransST: Transfer Learning Embedded Spatial Factor Modeling of Spatial Transcriptomics Data	Apr 15, 2025	Transfer Learning	CodeCode Available	2
Recent Advances in Medical Imaging Segmentation: A Survey	May 14, 2025	Domain AdaptationFew-Shot Learning	CodeCode Available	2
Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion	Feb 6, 2025	image-classificationImage Classification	CodeCode Available	2
Model and Data Transfer for Cross-Lingual Sequence Labelling in Zero-Resource Settings	Oct 23, 2022	Cross-Lingual NERCross-Lingual Transfer	CodeCode Available	2
EarthLoc: Astronaut Photography Localization by Indexing Earth from Space	Mar 11, 2024	Data AugmentationDisaster Response	CodeCode Available	2
Revisiting Generative Policies: A Simpler Reinforcement Learning Algorithmic Perspective	Dec 2, 2024	Density EstimationOffline RL	CodeCode Available	2
Revisiting Tampered Scene Text Detection in the Era of Generative AI	Jul 31, 2024	MisinformationScene Text Detection	CodeCode Available	2
LinSATNet: The Positive Linear Satisfiability Neural Networks	Jul 18, 2024	Graph Matching	CodeCode Available	2
OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI	Jun 18, 2024	Benchmarkingscientific discovery	CodeCode Available	2
Discover and Mitigate Multiple Biased Subgroups in Image Classifiers	Mar 19, 2024	Dimensionality ReductionSubgroup Discovery	CodeCode Available	2
LinK3D: Linear Keypoints Representation for 3D LiDAR Point Cloud	Jun 13, 2022	3D Object Detectionobject-detection	CodeCode Available	2
HANet: A Hierarchical Attention Network for Change Detection With Bitemporal Very-High-Resolution Remote Sensing Images	Apr 14, 2024	Change DetectionDeep Learning	CodeCode Available	2
Digital Player: Evaluating Large Language Models based Human-like Agent in Games	Feb 28, 2025	Decision Making	CodeCode Available	2
RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style	Oct 21, 2024	BenchmarkingLanguage Modeling	CodeCode Available	2