The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 12901–12950 of 474278 papers

Title	Date	Tasks	Status	Hype
Leveraging Passage Embeddings for Efficient Listwise Reranking with Large Language Models	Jun 21, 2024	Learning-To-RankPassage Ranking	CodeCode Available	2
DsDm: Model-Aware Dataset Selection with Datamodels	Jan 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
CLIP Itself is a Strong Fine-tuner: Achieving 85.7% and 88.0% Top-1 Accuracy with ViT-B and ViT-L on ImageNet	Dec 12, 2022		CodeCode Available	2
Benchmarking Laparoscopic Surgical Image Restoration and Beyond	May 25, 2025	BenchmarkingImage Restoration	CodeCode Available	2
ULIP: Learning a Unified Representation of Language, Images, and Point Clouds for 3D Understanding	Dec 10, 2022	3D Architecture3D Classification	CodeCode Available	2
Monocular, One-stage, Regression of Multiple 3D People	Aug 27, 2020	3D Depth Estimation3D Human Pose Estimation	CodeCode Available	2
Giraffe: Adventures in Expanding Context Lengths in LLMs	Aug 21, 2023	16k4k	CodeCode Available	2
Effect of Choosing Loss Function when Using T-batching for Representation Learning on Dynamic Networks	Aug 13, 2023	Graph Representation LearningLink Prediction	CodeCode Available	2
Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition	Apr 10, 2023	image-classificationImage Classification	CodeCode Available	2
What is a Goldilocks Face Verification Test Set?	May 24, 2024	Face RecognitionFace Verification	CodeCode Available	2
DiffArtist: Towards Structure and Appearance Controllable Image Stylization	Jul 22, 2024	DisentanglementImage Stylization	CodeCode Available	2
Structure-Aligned Protein Language Model	May 22, 2025	Contrastive LearningLanguage Modeling	CodeCode Available	2
Detecting music deepfakes is easy but actually hard	May 7, 2024	DeepFake DetectionFace Swapping	CodeCode Available	2
Denoising Diffusion Bridge Models	Sep 29, 2023	DenoisingImage Generation	CodeCode Available	2
Test-time Alignment of Diffusion Models without Reward Over-optimization	Jan 10, 2025	Diversity	CodeCode Available	2
Differentially Private Synthetic Data via APIs 3: Using Simulators Instead of Foundation Model	Feb 8, 2025	Image Generation	CodeCode Available	2
AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML	Oct 3, 2024	AutoMLCode Generation	CodeCode Available	2
Towards Collaborative Autonomous Driving: Simulation Platform and End-to-End System	Apr 15, 2024	Autonomous Driving	CodeCode Available	2
ReservoirComputing.jl: An Efficient and Modular Library for Reservoir Computing Models	Apr 8, 2022		CodeCode Available	2
LoRA-Ensemble: Efficient Uncertainty Modelling for Self-attention Networks	May 23, 2024	Decision Making	CodeCode Available	2
"I'm sorry to hear that": Finding New Biases in Language Models with a Holistic Descriptor Dataset	May 18, 2022	Sentence	CodeCode Available	2
Music Understanding LLaMA: Advancing Text-to-Music Generation with Question Answering and Captioning	Aug 22, 2023	Caption GenerationLarge Language Model	CodeCode Available	2
Large Language Models on Graphs: A Comprehensive Survey	Dec 5, 2023	Language ModellingSurvey	CodeCode Available	2
SegEarth-R1: Geospatial Pixel Reasoning via Large Language Model	Apr 13, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot	Feb 20, 2023	Efficient Explorationreinforcement-learning	CodeCode Available	2
Towards Better Dynamic Graph Learning: New Architecture and Unified Library	Mar 23, 2023	Dynamic Link PredictionDynamic Node Classification	CodeCode Available	2
City3D: Large-Scale Building Reconstruction from Airborne LiDAR Point Clouds	Jan 25, 2022	Surface Reconstruction	CodeCode Available	2
Fast-DDPM: Fast Denoising Diffusion Probabilistic Models for Medical Image-to-Image Generation	May 23, 2024	DenoisingImage Denoising	CodeCode Available	2
GenRL: Multimodal-foundation world models for generalization in embodied agents	Jun 26, 2024	BenchmarkingReinforcement Learning (RL)	CodeCode Available	2
Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs	Oct 14, 2024	Computational EfficiencyQuestion Answering	CodeCode Available	2
Towards Lightweight Super-Resolution with Dual Regression Learning	Jul 16, 2022	Image Super-ResolutionModel Compression	CodeCode Available	2
Scale Decoupled Distillation	Mar 20, 2024	Knowledge Distillation	CodeCode Available	2
MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders	Feb 20, 2025	Computational Efficiency	CodeCode Available	2
Explicit Visual Prompting for Low-Level Structure Segmentations	Mar 20, 2023	Camouflaged Object SegmentationDefocus Blur Detection	CodeCode Available	2
Tree-Ring Watermarks: Fingerprints for Diffusion Images that are Invisible and Robust	May 31, 2023	Image Generation	CodeCode Available	2
Omni Aggregation Networks for Lightweight Image Super-Resolution	Apr 20, 2023	Image Super-ResolutionSuper-Resolution	CodeCode Available	2
You Only Look at Once for Real-time and Generic Multi-Task	Oct 2, 2023	Autonomous DrivingDrivable Area Detection	CodeCode Available	2
Domino: Discovering Systematic Errors with Cross-Modal Embeddings	Mar 24, 2022	Representation LearningSlice Discovery	CodeCode Available	2
h-Edit: Effective and Flexible Diffusion-Based Editing via Doob's h-Transform	Mar 4, 2025		CodeCode Available	2
InterFusion: Text-Driven Generation of 3D Human-Object Interaction	Mar 22, 2024	3D Generationglobal-optimization	CodeCode Available	2
SystolicAttention: Fusing FlashAttention within a Single Systolic Array	Jul 15, 2025	Scheduling	CodeCode Available	2
TAB: Unified Benchmarking of Time Series Anomaly Detection Methods	Jun 22, 2025	Anomaly DetectionBenchmarking	CodeCode Available	2
Named Entity Recognition in Twitter: A Dataset and Analysis on Short-Term Temporal Shifts	Oct 7, 2022	ArticlesLanguage Modeling	CodeCode Available	2
AdaReTaKe: Adaptive Redundancy Reduction to Perceive Longer for Video-language Understanding	Mar 16, 2025	Video Understanding	CodeCode Available	2
RAM: Retrieval-Based Affordance Transfer for Generalizable Zero-Shot Robotic Manipulation	Jul 5, 2024	Human-Object Interaction DetectionRetrieval	CodeCode Available	2
Exploring Diffusion Transformer Designs via Grafting	Jun 5, 2025		CodeCode Available	2
LinkAlign: Scalable Schema Linking for Real-World Large-Scale Multi-Database Text-to-SQL	Mar 24, 2025	RetrievalText to SQL	CodeCode Available	2
Wildfire Smoke Detection with Computer Vision	Jan 12, 2023	Object Detection	CodeCode Available	2
Top Leaderboard Ranking = Top Coding Proficiency, Always? EvoEval: Evolving Coding Benchmarks via LLM	Mar 28, 2024	Code GenerationHumanEval	CodeCode Available	2
Process Reward Model with Q-Value Rankings	Oct 15, 2024	Decision MakingLanguage Modeling	CodeCode Available	2