The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 17151–17200 of 474278 papers

Title	Date	Tasks	Status	Hype
Terrier: A Deep Learning Repeat Classifier	Mar 12, 2025	Deep Learning	CodeCode Available	1
Fair Federated Medical Image Classification Against Quality Shift via Inter-Client Progressive State Matching	Mar 12, 2025	FairnessFederated Learning	CodeCode Available	1
CoRe^2: Collect, Reflect and Refine to Generate Better and Faster	Mar 12, 2025		CodeCode Available	1
BIMBA: Selective-Scan Compression for Long-Range Video Question Answering	Mar 12, 2025	Video Question AnsweringZero-Shot Video Question Answer	CodeCode Available	1
Revisiting semi-supervised learning in the era of foundation models	Mar 12, 2025	parameter-efficient fine-tuningPseudo Label	CodeCode Available	1
Robust Multimodal Survival Prediction with the Latent Differentiation Conditional Variational AutoEncoder	Mar 12, 2025	Survival Predictionwhole slide images	CodeCode Available	1
RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling	Mar 12, 2025	3D GenerationText to 3D	CodeCode Available	1
Prompt to Restore, Restore to Prompt: Cyclic Prompting for Universal Adverse Weather Removal	Mar 12, 2025	Image RestorationPrompt Learning	CodeCode Available	1
AdvAD: Exploring Non-Parametric Diffusion for Imperceptible Adversarial Attacks	Mar 12, 2025	DenoisingSSIM	CodeCode Available	1
AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents	Mar 12, 2025		CodeCode Available	1
CyberLLMInstruct: A New Dataset for Analysing Safety of Fine-Tuned LLMs Using Cyber Security Data	Mar 12, 2025	Adversarial AttackMalware Analysis	CodeCode Available	1
CASTLE: Benchmarking Dataset for Static Code Analyzers and LLMs towards CWE Detection	Mar 12, 2025	BenchmarkingCode Classification	CodeCode Available	1
Towards Next-Generation Recommender Systems: A Benchmark for Personalized Recommendation Assistant with LLMs	Mar 12, 2025	Recommendation Systems	CodeCode Available	1
Motion Blender Gaussian Splatting for Dynamic Scene Reconstruction	Mar 12, 2025	Dynamic ReconstructionSimulated Gaussian Manipulation	CodeCode Available	1
How Well Does Your Tabular Generator Learn the Structure of Tabular Data?	Mar 12, 2025		CodeCode Available	1
MP-HSIR: A Multi-Prompt Framework for Universal Hyperspectral Image Restoration	Mar 12, 2025	Image RestorationSpectral Reconstruction	CodeCode Available	1
PerCoV2: Improved Ultra-Low Bit-Rate Perceptual Image Compression with Implicit Hierarchical Masked Image Modeling	Mar 12, 2025	Image Compression	CodeCode Available	1
MOAT: Evaluating LMMs for Capability Integration and Instruction Grounding	Mar 12, 2025		CodeCode Available	1
AnyMoLe: Any Character Motion In-betweening Leveraging Video Diffusion Models	Mar 11, 2025	Motion Generationmotion in-betweening	CodeCode Available	1
Regulatory DNA sequence Design with Reinforcement Learning	Mar 11, 2025	reinforcement-learningReinforcement Learning	CodeCode Available	1
Learning to Detect Objects from Multi-Agent LiDAR Scans without Manual Labels	Mar 11, 2025	3D Object DetectionObject	CodeCode Available	1
CoLMDriver: LLM-based Negotiation Benefits Cooperative Autonomous Driving	Mar 11, 2025	Autonomous Driving	CodeCode Available	1
Detecting Backdoor Attacks in Federated Learning via Direction Alignment Inspection	Mar 11, 2025	Federated Learning	CodeCode Available	1
EgoBlind: Towards Egocentric Visual Assistance for the Blind	Mar 11, 2025		CodeCode Available	1
NullFace: Training-Free Localized Face Anonymization	Mar 11, 2025	AttributeFace Anonymization	CodeCode Available	1
X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction	Mar 11, 2025	3D ReconstructionComputed Tomography (CT)	CodeCode Available	1
VFM-UDA++: Improving Network Architectures and Data Strategies for Unsupervised Domain Adaptive Semantic Segmentation	Mar 11, 2025	Domain AdaptationSemantic Segmentation	CodeCode Available	1
BiasEdit: Debiasing Stereotyped Language Models via Model Editing	Mar 11, 2025	counterfactualLanguage Modeling	CodeCode Available	1
Chemical reasoning in LLMs unlocks steerable synthesis planning and reaction mechanism elucidation	Mar 11, 2025		CodeCode Available	1
Rethinking Diffusion Model in High Dimension	Mar 11, 2025	model	CodeCode Available	1
PhysVLM: Enabling Visual Language Models to Understand Robotic Physical Reachability	Mar 11, 2025	Visual Reasoning	CodeCode Available	1
Aligning Text to Image in Diffusion Models is Easier Than You Think	Mar 11, 2025	Contrastive LearningImage Generation	CodeCode Available	1
MegaSR: Mining Customized Semantics and Expressive Guidance for Image Super-Resolution	Mar 11, 2025	Image Super-ResolutionSuper-Resolution	CodeCode Available	1
SAS: Segment Any 3D Scene with Integrated 2D Priors	Mar 11, 2025	Instance SegmentationSemantic Segmentation	CodeCode Available	1
^RFLAV: Rolling Flow matching for infinite Audio Video generation	Mar 11, 2025	Video Generation	CodeCode Available	1
Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention	Mar 11, 2025	In-Context LearningRetrieval	CodeCode Available	1
Using Powerful Prior Knowledge of Diffusion Model in Deep Unfolding Networks for Image Compressive Sensing	Mar 11, 2025	Compressive SensingImage Compressed Sensing	CodeCode Available	1
VRMDiff: Text-Guided Video Referring Matting Generation of Diffusion	Mar 11, 2025	Image MattingVideo Alignment	CodeCode Available	1
Can We Detect Failures Without Failure Data? Uncertainty-Aware Runtime Failure Detection for Imitation Learning Policies	Mar 11, 2025	Conformal PredictionImitation Learning	CodeCode Available	1
Enhancing Large Language Models for Hardware Verification: A Novel SystemVerilog Assertion Dataset	Mar 11, 2025		CodeCode Available	1
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees	Mar 11, 2025	ChatbotLanguage Modeling	CodeCode Available	1
Towards Interpretable Protein Structure Prediction with Sparse Autoencoders	Mar 11, 2025	PredictionProtein Structure Prediction	CodeCode Available	1
EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments	Mar 11, 2025		CodeCode Available	1
Controlling Latent Diffusion Using Latent CLIP	Mar 11, 2025	DenoisingDescriptive	CodeCode Available	1
Chain-of-Thought Reasoning In The Wild Is Not Always Faithful	Mar 11, 2025		CodeCode Available	1
CFNet: Optimizing Remote Sensing Change Detection through Content-Aware Enhancement	Mar 11, 2025	Change Detection	CodeCode Available	1
AG-VPReID: A Challenging Large-Scale Benchmark for Aerial-Ground Video-based Person Re-Identification	Mar 11, 2025	Person Re-IdentificationVideo-Based Person Re-Identification	CodeCode Available	1
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis	Mar 11, 2025	AllDataset Generation	CodeCode Available	1
STEAD: Spatio-Temporal Efficient Anomaly Detection for Time and Compute Sensitive Applications	Mar 11, 2025	Anomaly DetectionAnomaly Detection In Surveillance Videos	CodeCode Available	1
Source-free domain adaptation based on label reliability for cross-domain bearing fault diagnosis	Mar 11, 2025	Data AugmentationDomain Adaptation	CodeCode Available	1