The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15001–15050 of 474278 papers

Title	Date	Tasks	Status	Hype
Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking	Jun 25, 2025	GPUVisual Tracking	CodeCode Available	1
Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorch	Jun 25, 2025	Computational EfficiencyGPR	CodeCode Available	1
A foundation model with multi-variate parallel attention to generate neuronal activity	Jun 25, 2025	Seizure DetectionTime Series	CodeCode Available	1
Q-resafe: Assessing Safety Risks and Quantization-aware Safety Patching for Quantized Large Language Models	Jun 25, 2025	Quantization	CodeCode Available	1
WattsOnAI: Measuring, Analyzing, and Visualizing Energy and Carbon Footprint of AI Workloads	Jun 25, 2025	Benchmarking	CodeCode Available	1
DPLib: A Standard Benchmark Library for Distributed Power System Analysis and Optimization	Jun 25, 2025	Distributed Optimization	CodeCode Available	1
MAM: Modular Multi-Agent Framework for Multi-Modal Medical Diagnosis via Role-Specialized Collaboration	Jun 24, 2025	DiagnosticMedical Diagnosis	CodeCode Available	1
Augmenting Multi-Agent Communication with State Delta Trajectory	Jun 24, 2025		CodeCode Available	1
Self-Supervised Multimodal NeRF for Autonomous Driving	Jun 24, 2025	Autonomous DrivingNeRF	CodeCode Available	1
HERCULES: Hierarchical Embedding-based Recursive Clustering Using LLMs for Efficient Summarization	Jun 24, 2025	Clustering	CodeCode Available	1
SMARTIES: Spectrum-Aware Multi-Sensor Auto-Encoder for Remote Sensing Images	Jun 24, 2025		CodeCode Available	1
Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models	Jun 24, 2025	Camouflaged Object SegmentationSegmentation	CodeCode Available	1
ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model	Jun 24, 2025		CodeCode Available	1
Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting	Jun 24, 2025	DenoisingWeather Forecasting	CodeCode Available	1
MATE: LLM-Powered Multi-Agent Translation Environment for Accessibility Applications	Jun 24, 2025		CodeCode Available	1
KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality	Jun 24, 2025	HallucinationHallucination Evaluation	CodeCode Available	1
EvDetMAV: Generalized MAV Detection from Moving Event Cameras	Jun 24, 2025		CodeCode Available	1
Fast and Distributed Equivariant Graph Neural Networks by Virtual Node Learning	Jun 24, 2025	Graph Learning	CodeCode Available	1
Introducing EG-IPT and ipt~: a novel electric guitar dataset and a new Max/MSP object for real-time classification of instrumental playing techniques	Jun 24, 2025		CodeCode Available	1
Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture	Jun 24, 2025	Decoder	CodeCode Available	1
EBC-ZIP: Improving Blockwise Crowd Counting with Zero-Inflated Poisson Regression	Jun 24, 2025	Crowd CountingDensity Estimation	CodeCode Available	1
LEGATO: Large-scale End-to-end Generalizable Approach to Typeset OMR	Jun 23, 2025	Decoder	CodeCode Available	1
NIC-RobustBench: A Comprehensive Open-Source Toolkit for Neural Image Compression and Robustness Analysis	Jun 23, 2025	Adversarial RobustnessImage Compression	CodeCode Available	1
The Within-Orbit Adaptive Leapfrog No-U-Turn Sampler	Jun 23, 2025		CodeCode Available	1
Riemannian generative decoder	Jun 23, 2025	DecoderRepresentation Learning	CodeCode Available	1
Morse: Dual-Sampling for Lossless Acceleration of Diffusion Models	Jun 23, 2025	Image Generation	CodeCode Available	1
MCN-SLAM: Multi-Agent Collaborative Neural SLAM with Hybrid Implicit Neural Scene Representation	Jun 23, 2025	3D ReconstructionNeRF	CodeCode Available	1
MedTVT-R1: A Multimodal LLM Empowering Medical Reasoning and Diagnosis	Jun 23, 2025	DiagnosticLarge Language Model	CodeCode Available	1
Sequential keypoint density estimator: an overlooked baseline of skeleton-based video anomaly detection	Jun 23, 2025	Anomaly DetectionVideo Anomaly Detection	CodeCode Available	1
Parallel Continuous Chain-of-Thought with Jacobi Iteration	Jun 23, 2025		CodeCode Available	1
Advancing Talking Head Generation: A Comprehensive Survey of Multi-Modal Methodologies, Datasets, Evaluation Metrics, and Loss Functions	Jun 23, 2025	NeRFTalking Head Generation	CodeCode Available	1
What You Think Is What You Get: Bridge User Intent and Transfer Function Design through Multimodal Large Language Models	Jun 23, 2025		CodeCode Available	1
Rethinking Decoder Design: Improving Biomarker Segmentation Using Depth-to-Space Restoration and Residual Linear Attention	Jun 23, 2025	DecoderImage Segmentation	CodeCode Available	1
CommVQ: Commutative Vector Quantization for KV Cache Compression	Jun 23, 2025	GPUGSM8K	CodeCode Available	1
Taming Vision-Language Models for Medical Image Analysis: A Comprehensive Review	Jun 23, 2025	Medical Image AnalysisPrompt Learning	CodeCode Available	1
DuetGen: Music Driven Two-Person Dance Generation via Hierarchical Masked Modeling	Jun 23, 2025	Motion Synthesis	CodeCode Available	1
DIP: Unsupervised Dense In-Context Post-training of Visual Representations	Jun 23, 2025	GPUMeta-Learning	CodeCode Available	1
LIGHTHOUSE: Fast and precise distance to shoreline calculations from anywhere on earth	Jun 23, 2025	CPU	CodeCode Available	1
Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective	Jun 22, 2025	In-Context LearningLarge Language Model	CodeCode Available	1
h-calibration: Rethinking Classifier Recalibration with Probabilistic Error-Bounded Objective	Jun 22, 2025	scoring rule	CodeCode Available	1
MiCo: Multiple Instance Learning with Context-Aware Clustering for Whole Slide Image Analysis	Jun 22, 2025	ClusteringMultiple Instance Learning	CodeCode Available	1
OmniESI: A unified framework for enzyme-substrate interaction prediction with progressive conditional deep learning	Jun 22, 2025	Parameter PredictionPrediction	CodeCode Available	1
DRAMA-X: A Fine-grained Intent Prediction and Risk Reasoning Benchmark For Driving	Jun 21, 2025	Autonomous DrivingDescriptive	CodeCode Available	1
AbRank: A Benchmark Dataset and Metric-Learning Framework for Antibody-Antigen Affinity Ranking	Jun 21, 2025	Metric LearningProtein Language Model	CodeCode Available	1
ConsumerBench: Benchmarking Generative AI Applications on End-User Devices	Jun 21, 2025	BenchmarkingCPU	CodeCode Available	1
TeXpert: A Multi-Level Benchmark for Evaluating LaTeX Code Generation by LLMs	Jun 20, 2025	Code Generation	CodeCode Available	1
TextBraTS: Text-Guided Volumetric Brain Tumor Segmentation with Innovative Dataset Development and Fusion Module Exploration	Jun 20, 2025	Brain Tumor SegmentationImage Segmentation	CodeCode Available	1
Visual-Instructed Degradation Diffusion for All-in-One Image Restoration	Jun 20, 2025	AllDeblurring	CodeCode Available	1
Mesh-Informed Neural Operator : A Transformer Generative Approach	Jun 20, 2025	Operator learning	CodeCode Available	1
A Minimalist Optimizer Design for LLM Pretraining	Jun 20, 2025		CodeCode Available	1