The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 20951–21000 of 474278 papers

Title	Date	Tasks	Status	Hype
AdapTrack: Adaptive Thresholding-Based Matching For Multi-object Tracking	Sep 27, 2024	Multi-Object TrackingObject Tracking	CodeCode Available	1
CESNET-TimeSeries24: Time Series Dataset for Network Traffic Anomaly Detection and Forecasting	Sep 27, 2024	Anomaly DetectionTime Series	CodeCode Available	1
RepairBench: Leaderboard of Frontier Models for Program Repair	Sep 27, 2024	Program Repair	CodeCode Available	1
FlashMix: Fast Map-Free LiDAR Localization via Feature Mixing and Contrastive-Constrained Accelerated Training	Sep 27, 2024	Metric LearningPosition	CodeCode Available	1
Prompt-Driven Temporal Domain Adaptation for Nighttime UAV Tracking	Sep 27, 2024	Domain Adaptation	CodeCode Available	1
CurricuLLM: Automatic Task Curricula Design for Learning Complex Robot Skills using Large Language Models	Sep 27, 2024	Reinforcement Learning (RL)World Knowledge	CodeCode Available	1
AL-GTD: Deep Active Learning for Gaze Target Detection	Sep 27, 2024	Active Learning	CodeCode Available	1
URIEL+: Enhancing Linguistic Inclusion and Usability in a Typological and Multilingual Knowledge Base	Sep 27, 2024		CodeCode Available	1
HR-Extreme: A High-Resolution Dataset for Extreme Weather Forecasting	Sep 27, 2024	Deep LearningPrediction	CodeCode Available	1
From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long Video Understanding	Sep 27, 2024	Video UnderstandingVisual Reasoning	CodeCode Available	1
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models	Sep 27, 2024	Instruction Following	CodeCode Available	1
Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs	Sep 27, 2024	GPURecommendation Systems	CodeCode Available	1
Dual Cone Gradient Descent for Training Physics-Informed Neural Networks	Sep 27, 2024		CodeCode Available	1
From Vision to Audio and Beyond: A Unified Model for Audio-Visual Representation and Generation	Sep 27, 2024	Audio ClassificationAudio Generation	CodeCode Available	1
Underwater Image Enhancement with Physical-based Denoising Diffusion Implicit Models	Sep 27, 2024	DenoisingImage Enhancement	CodeCode Available	1
ARLBench: Flexible and Efficient Benchmarking for Hyperparameter Optimization in Reinforcement Learning	Sep 27, 2024	AutoMLBenchmarking	CodeCode Available	1
A comprehensive review and new taxonomy on superpixel segmentation	Sep 27, 2024	Superpixels	CodeCode Available	1
Generative AI for fast and accurate statistical computation of fluids	Sep 27, 2024	Operator learning	CodeCode Available	1
LML-DAP: Language Model Learning a Dataset for Data-Augmented Prediction	Sep 27, 2024	ClassificationFeature Engineering	CodeCode Available	1
Improving Visual Object Tracking through Visual Prompting	Sep 27, 2024	Object	CodeCode Available	1
Cottention: Linear Transformers With Cosine Attention	Sep 27, 2024		CodeCode Available	1
DualAD: Dual-Layer Planning for Reasoning in Autonomous Driving	Sep 26, 2024	Autonomous DrivingLanguage Modeling	CodeCode Available	1
IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning	Sep 26, 2024	Image CaptioningRetrieval	CodeCode Available	1
HydraViT: Stacking Heads for a Scalable ViT	Sep 26, 2024		CodeCode Available	1
Task-recency bias strikes back: Adapting covariances in Exemplar-Free Class Incremental Learning	Sep 26, 2024	class-incremental learningClass Incremental Learning	CodeCode Available	1
A Simple but Strong Baseline for Sounding Video Generation: Effective Adaptation of Audio and Video Diffusion Models for Joint Generation	Sep 26, 2024	Inductive BiasVideo Generation	CodeCode Available	1
MIO: A Foundation Model on Multimodal Tokens	Sep 26, 2024	modelText Generation	CodeCode Available	1
InterNet: Unsupervised Cross-modal Homography Estimation Based on Interleaved Modality Transfer and Self-supervised Homography Prediction	Sep 26, 2024	Domain GeneralizationHomography Estimation	CodeCode Available	1
Realistic Evaluation of Model Merging for Compositional Generalization	Sep 26, 2024	image-classificationImage Classification	CodeCode Available	1
Uni-Med: A Unified Medical Generalist Foundation Model For Multi-Task Learning Via Connector-MoE	Sep 26, 2024	image-classificationImage Classification	CodeCode Available	1
MECD: Unlocking Multi-Event Causal Discovery in Video Reasoning	Sep 26, 2024	Causal DiscoveryCausal Discovery in Video Reasoning	CodeCode Available	1
DarkSAM: Fooling Segment Anything Model to Segment Nothing	Sep 26, 2024	model	CodeCode Available	1
Revisiting Deep Ensemble Uncertainty for Enhanced Medical Anomaly Detection	Sep 26, 2024	Anomaly Detection	CodeCode Available	1
CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors	Sep 26, 2024		CodeCode Available	1
Leveraging Anthropometric Measurements to Improve Human Mesh Estimation and Ensure Consistent Body Shapes	Sep 26, 2024	3D Human Pose EstimationPose Estimation	CodeCode Available	1
An Adversarial Perspective on Machine Unlearning for AI Safety	Sep 26, 2024	Machine Unlearning	CodeCode Available	1
RED QUEEN: Safeguarding Large Language Models against Concealed Multi-Turn Jailbreaking	Sep 26, 2024	Red Teaming	CodeCode Available	1
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search	Sep 26, 2024	MathMathematical Problem-Solving	CodeCode Available	1
A Time Series is Worth Five Experts: Heterogeneous Mixture of Experts for Traffic Flow Prediction	Sep 26, 2024	Mixture-of-ExpertsPrediction	CodeCode Available	1
LightAvatar: Efficient Head Avatar as Dynamic Neural Light Field	Sep 26, 2024	GPUNeRF	CodeCode Available	1
Wavelet-Driven Generalizable Framework for Deepfake Face Forgery Detection	Sep 26, 2024	DeepFake DetectionFace Swapping	CodeCode Available	1
Self-Distilled Depth Refinement with Noisy Poisson Fusion	Sep 26, 2024	Depth Estimation	CodeCode Available	1
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation	Sep 26, 2024	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
Autonomous Network Defence using Reinforcement Learning	Sep 26, 2024	reinforcement-learningReinforcement Learning	CodeCode Available	1
Trustworthy Text-to-Image Diffusion Models: A Timely and Focused Survey	Sep 26, 2024	FairnessImage Generation	CodeCode Available	1
A Framework for Standardizing Similarity Measures in a Rapidly Evolving Field	Sep 26, 2024		CodeCode Available	1
MALPOLON: A Framework for Deep Species Distribution Modeling	Sep 26, 2024	BenchmarkingGPU	CodeCode Available	1
Taming Diffusion Prior for Image Super-Resolution with Domain Shift SDEs	Sep 26, 2024	Image RestorationImage Super-Resolution	CodeCode Available	1
GrEmLIn: A Repository of Green Baseline Embeddings for 87 Low-Resource Languages Injected with Multilingual Graph Knowledge	Sep 26, 2024	Natural Language InferenceSentiment Analysis	CodeCode Available	1
DMC-VB: A Benchmark for Representation Learning for Control with Visual Distractors	Sep 26, 2024	continuous-controlContinuous Control	CodeCode Available	1