The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 17901–17950 of 474278 papers

Title	Date	Tasks	Status	Hype
FOCUS - Multi-View Foot Reconstruction From Synthetically Trained Dense Correspondences	Feb 10, 2025	Surface Reconstruction	CodeCode Available	1
HODDI: A Dataset of High-Order Drug-Drug Interactions for Computational Pharmacovigilance	Feb 10, 2025	Pharmacovigilance	CodeCode Available	1
evclust: Python library for evidential clustering	Feb 10, 2025	Clustering	CodeCode Available	1
Calibrating LLMs with Information-Theoretic Evidential Deep Learning	Feb 10, 2025	Computational EfficiencyDeep Learning	CodeCode Available	1
Leveraging Allophony in Self-Supervised Speech Models for Atypical Pronunciation Assessment	Feb 10, 2025		CodeCode Available	1
LawGPT: Knowledge-Guided Data Generation and Its Application to Legal LLM	Feb 10, 2025	Legal Reasoning	CodeCode Available	1
Self-Correcting Decoding with Generative Feedback for Mitigating Hallucinations in Large Vision-Language Models	Feb 10, 2025	Image GenerationResponse Generation	CodeCode Available	1
The Case for Cleaner Biosignals: High-fidelity Neural Compressor Enables Transfer from Cleaner iEEG to Noisier EEG	Feb 10, 2025	EEGElectroencephalogram (EEG)	CodeCode Available	1
RALLRec: Improving Retrieval Augmented Large Language Model Recommendation with Representation Learning	Feb 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Krutrim LLM: Multilingual Foundational Model for over a Billion People	Feb 10, 2025		CodeCode Available	1
From Pixels to Components: Eigenvector Masking for Visual Representation Learning	Feb 10, 2025	image-classificationImage Classification	CodeCode Available	1
Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video Environments	Feb 10, 2025	BenchmarkingOptical Character Recognition	CodeCode Available	1
Prompt-SID: Learning Structural Representation Prompt via Latent Diffusion for Single-Image Denoising	Feb 10, 2025	DenoisingImage Denoising	CodeCode Available	1
When Data Manipulation Meets Attack Goals: An In-depth Survey of Attacks for VLMs	Feb 10, 2025		CodeCode Available	1
UniZyme: A Unified Protein Cleavage Site Predictor Enhanced with Enzyme Active-Site Knowledge	Feb 10, 2025		CodeCode Available	1
Retrieving Filter Spectra in CNN for Explainable Sleep Stage Classification	Feb 10, 2025	EEG	CodeCode Available	1
SAVE: Self-Attention on Visual Embedding for Zero-Shot Generic Object Counting	Feb 10, 2025	Exemplar-Free CountingObject	CodeCode Available	1
A Data-Efficient Pan-Tumor Foundation Model for Oncology CT Interpretation	Feb 10, 2025	Lesion SegmentationStructured Report Generation	CodeCode Available	1
CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale Analysis	Feb 10, 2025	Person Re-Identification	CodeCode Available	1
WyckoffDiff -- A Generative Diffusion Model for Crystal Symmetry	Feb 10, 2025	modelPosition	CodeCode Available	1
ProjectTest: A Project-level LLM Unit Test Generation Benchmark and Impact of Error Fixing Mechanisms	Feb 10, 2025		CodeCode Available	1
Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty	Feb 10, 2025	image-classificationImage Classification	CodeCode Available	1
Conditional diffusion model with spatial attention and latent embedding for medical image segmentation	Feb 10, 2025	HippocampusImage Segmentation	CodeCode Available	1
A Simple yet Effective DDG Predictor is An Unsupervised Antibody Optimizer and Explainer	Feb 10, 2025		CodeCode Available	1
Implicit Language Models are RNNs: Balancing Parallelization and Expressivity	Feb 10, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Foundation Model of Electronic Medical Records for Adaptive Risk Estimation	Feb 10, 2025	Benchmarking	CodeCode Available	1
DGenNO: A Novel Physics-aware Neural Operator for Solving Forward and Inverse PDE Problems based on Deep, Generative Probabilistic Modeling	Feb 10, 2025		CodeCode Available	1
RelGNN: Composite Message Passing for Relational Deep Learning	Feb 10, 2025	Deep LearningGraph Attention	CodeCode Available	1
LANTERN++: Enhancing Relaxed Speculative Decoding with Static Tree Drafting for Visual Auto-regressive Models	Feb 10, 2025	Text Generation	CodeCode Available	1
Habitizing Diffusion Planning for Efficient and Effective Decision Making	Feb 10, 2025	CPUD4RL	CodeCode Available	1
Combining Large Language Models with Static Analyzers for Code Review Generation	Feb 10, 2025	RAGRetrieval-augmented Generation	CodeCode Available	1
Geometry-aware RL for Manipulation of Varying Shapes and Deformable Objects	Feb 10, 2025		CodeCode Available	1
AIMS.au: A Dataset for the Analysis of Modern Slavery Countermeasures in Corporate Statements	Feb 10, 2025	Sentence	CodeCode Available	1
Large Language Models Meet Symbolic Provers for Logical Reasoning Evaluation	Feb 10, 2025	Logical Reasoning	CodeCode Available	1
Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE	Feb 10, 2025	DiversityLanguage Modeling	CodeCode Available	1
Learning Clustering-based Prototypes for Compositional Zero-shot Learning	Feb 10, 2025	AttributeClustering	CodeCode Available	1
MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation	Feb 9, 2025	Scene Generation	CodeCode Available	1
DexVLA: Vision-Language Model with Plug-In Diffusion Expert for General Robot Control	Feb 9, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
MERGE^3: Efficient Evolutionary Merging on Consumer-grade GPUs	Feb 9, 2025	GPU	CodeCode Available	1
LM2: Large Memory Models	Feb 9, 2025	DecoderMMLU	CodeCode Available	1
UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control	Feb 9, 2025	Image Restoration	CodeCode Available	1
Reinforced Lifelong Editing for Language Models	Feb 9, 2025	Model Editing	CodeCode Available	1
Preventing Rogue Agents Improves Multi-Agent Collaboration	Feb 9, 2025	Action Detection	CodeCode Available	1
Training Language Models for Social Deduction with Multi-Agent Reinforcement Learning	Feb 9, 2025	Multi-agent Reinforcement Learning	CodeCode Available	1
Audio-Visual Representation Learning via Knowledge Distillation from Speech Foundation Models	Feb 9, 2025	Audio-Visual Speech RecognitionAutomatic Speech Recognition	CodeCode Available	1
Semantic Role Labeling: A Systematical Survey	Feb 9, 2025	Semantic Role LabelingSurvey	CodeCode Available	1
Beyond Fine-Tuning: A Systematic Study of Sampling Techniques in Personalized Image Generation	Feb 9, 2025	Image GenerationPersonalized Image Generation	CodeCode Available	1
Known Unknowns: Out-of-Distribution Property Prediction in Materials and Molecules	Feb 9, 2025	Known UnknownsProperty Prediction	CodeCode Available	1
DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations	Feb 9, 2025	Multi-Task Learning	CodeCode Available	1
Injecting Universal Jailbreak Backdoors into LLMs in Minutes	Feb 9, 2025	Model Editing	CodeCode Available	1