The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6101–6150 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Disentangling Length from Quality in Direct Preference Optimization	Mar 28, 2024	reinforcement-learningReinforcement Learning	CodeCode Available	2	5
WaveMixSR-V2: Enhancing Super-resolution with Higher Efficiency	Sep 16, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	2	5
Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation	Apr 5, 2024	Image Generation	CodeCode Available	2	5
SymbolFit: Automatic Parametric Modeling with Symbolic Regression	Nov 15, 2024	Formregression	CodeCode Available	2	5
Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex Scenarios	Jan 30, 2024	Benchmarking	CodeCode Available	2	5
An open dataset for oracle bone script recognition and decipherment	Jan 27, 2024	Decipherment	CodeCode Available	2	5
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation	Dec 19, 2023	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2	5
Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems	Jan 17, 2025	Response Generation	CodeCode Available	2	5
EyeCLIP: A visual-language foundation model for multi-modal ophthalmic image analysis	Sep 10, 2024	Contrastive LearningCross-Modal Retrieval	CodeCode Available	2	5
Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling	Oct 10, 2024	Protein Folding	CodeCode Available	2	5
LLM-ESR: Large Language Models Enhancement for Long-tailed Sequential Recommendation	May 31, 2024	Recommendation SystemsSequential Recommendation	CodeCode Available	2	5
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos	May 29, 2024	EgoSchemaMME	CodeCode Available	2	5
Reward Design with Language Models	Feb 27, 2023	Language ModellingLarge Language Model	CodeCode Available	2	5
Vision-Language Model for Object Detection and Segmentation: A Review and Evaluation	Apr 13, 2025	Domain AdaptationLanguage Modeling	CodeCode Available	2	5
Relevance-guided Supervision for OpenQA with ColBERT	Jul 1, 2020	Natural QuestionsOpen-Domain Question Answering	CodeCode Available	2	5
Modern Evolution Strategies for Creativity: Fitting Concrete Images and Abstract Concepts	Sep 18, 2021	Evolutionary Algorithms	CodeCode Available	2	5
Mixed-curvature decision trees and random forests	Oct 3, 2024	Link Predictionregression	CodeCode Available	2	5
XLB: A differentiable massively parallel lattice Boltzmann library in Python	Nov 27, 2023	CPUGPU	CodeCode Available	2	5
OmniXAI: A Library for Explainable AI	Jun 1, 2022	counterfactualCounterfactual Explanation	CodeCode Available	2	5
Time-MMD: Multi-Domain Multimodal Dataset for Time Series Analysis	Jun 12, 2024	Time SeriesTime Series Analysis	CodeCode Available	2	5
AI-Driven Automation Can Become the Foundation of Next-Era Science of Science Research	May 17, 2025	scientific discovery	CodeCode Available	2	5
Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems	Oct 13, 2022	Recommendation Systems	CodeCode Available	2	5
Learned Image Compression with Dictionary-based Entropy Model	Apr 1, 2025	Image Compressionmodel	CodeCode Available	2	5
Context Autoencoder for Self-Supervised Representation Learning	Feb 7, 2022	DecoderInstance Segmentation	CodeCode Available	2	5
Cross-Domain Few-Shot Object Detection via Enhanced Open-Set Object Detector	Feb 5, 2024	Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection	CodeCode Available	2	5
SSL4EO-S12: A Large-Scale Multi-Modal, Multi-Temporal Dataset for Self-Supervised Learning in Earth Observation	Nov 13, 2022	Earth ObservationMulti-Label Image Classification	CodeCode Available	2	5
Audio-FLAN: A Preliminary Release	Feb 23, 2025	Zero-Shot Learning	CodeCode Available	2	5
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models	Feb 21, 2024	Question Answering	CodeCode Available	2	5
VCoder: Versatile Vision Encoders for Multimodal Large Language Models	Dec 21, 2023	Image CaptioningImage Generation	CodeCode Available	2	5
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance	May 23, 2024	Image GenerationPersonalized Image Generation	CodeCode Available	2	5
3D Gaussian Splatting with Deferred Reflection	Apr 29, 2024	Novel View Synthesis	CodeCode Available	2	5
Centroid-Based Efficient Minimum Bayes Risk Decoding	Feb 17, 2024	de-enTranslation	CodeCode Available	2	5
VectorMapNet: End-to-end Vectorized HD Map Learning	Jun 17, 2022	3D Lane DetectionAutonomous Driving	CodeCode Available	2	5
SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target Detection	Jan 28, 2024		CodeCode Available	2	5
Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization	Mar 19, 2024	Quantization	CodeCode Available	2	5
TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language Models	Aug 7, 2023	HallucinationObject Hallucination	CodeCode Available	2	5
Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance	Sep 2, 2024		CodeCode Available	2	5
Measuring Re-identification Risk	Apr 12, 2023		CodeCode Available	2	5
DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents	Jan 2, 2022	Image GenerationVocal Bursts Intensity Prediction	CodeCode Available	2	5
RingFormer: A Neural Vocoder with Ring Attention and Convolution-Augmented Transformer	Jan 2, 2025	Audio Generationtext-to-speech	CodeCode Available	2	5
Transformer-Based Visual Segmentation: A Survey	Apr 19, 2023	Autonomous DrivingPoint Cloud Segmentation	CodeCode Available	2	5
Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process	Sep 29, 2023	Change Data GenerationChange Detection	CodeCode Available	2	5
MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning	Jul 23, 2024	BenchmarkingDecision Making	CodeCode Available	2	5
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention	Mar 13, 2023	image-classificationImage Classification	CodeCode Available	2	5
YOLOPoint Joint Keypoint and Object Detection	Feb 6, 2024	Objectobject-detection	CodeCode Available	2	5
chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics	Aug 28, 2024		CodeCode Available	2	5
VeriThinker: Learning to Verify Makes Reasoning Model Efficient	May 23, 2025	model	CodeCode Available	2	5
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars	Mar 2, 2022	Action DetectionOnline Action Detection	CodeCode Available	2	5
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models	Dec 18, 2024	Reasoning SegmentationSegmentation	CodeCode Available	2	5
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking	Jul 28, 2023	Multi-Object TrackingMultiple Object Tracking	CodeCode Available	2	5