The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9801–9850 of 661570 papers

Title	Date	Tasks	Status	Hype
Self-Supervised Speech Quality Estimation and Enhancement Using Only Clean Speech	Feb 26, 2024	QuantizationSpeech Enhancement	CodeCode Available	2
HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields	Feb 26, 2024	3D Hand Pose Estimationhand-object pose	CodeCode Available	2
Pretrained Visual Uncertainties	Feb 26, 2024	Retrieval	CodeCode Available	2
DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models	Feb 26, 2024	MambaState Space Models	CodeCode Available	2
CLAP: Learning Transferable Binary Code Representations with Natural Language Supervision	Feb 26, 2024	Representation LearningTransfer Learning	CodeCode Available	2
Pandora's White-Box: Precise Training Data Detection and Extraction in Large Language Models	Feb 26, 2024	Language Modelling	CodeCode Available	2
TEaR: Improving LLM-based Machine Translation with Systematic Self-Refinement	Feb 26, 2024	Machine TranslationTranslation	CodeCode Available	2
CodeS: Towards Building Open-source Language Models for Text-to-SQL	Feb 26, 2024	Data AugmentationDiagnostic	CodeCode Available	2
Feedback Efficient Online Fine-Tuning of Diffusion Models	Feb 26, 2024	reinforcement-learningReinforcement Learning	CodeCode Available	2
DecompDiff: Diffusion Models with Decomposed Priors for Structure-Based Drug Design	Feb 26, 2024	AvgDrug Design	CodeCode Available	2
Defending LLMs against Jailbreaking Attacks via Backtranslation	Feb 26, 2024	Language Modelling	CodeCode Available	2
CARTE: Pretraining and Transfer for Tabular Learning	Feb 26, 2024	Data IntegrationTransfer Learning	CodeCode Available	2
DEYO: DETR with YOLO for End-to-End Object Detection	Feb 26, 2024	DecoderGPU	CodeCode Available	2
An Integrated Data Processing Framework for Pretraining Foundation Models	Feb 26, 2024		CodeCode Available	2
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers	Feb 25, 2024	In-Context LearningSafety Alignment	CodeCode Available	2
GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction	Feb 25, 2024	3D ReconstructionActive 3D Reconstruction	CodeCode Available	2
VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Feb 25, 2024	Pose EstimationTransfer Learning	CodeCode Available	2
HiGPT: Heterogeneous Graph Language Model	Feb 25, 2024	Graph LearningLanguage Modeling	CodeCode Available	2
GraphWiz: An Instruction-Following Language Model for Graph Problems	Feb 25, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	2
Deep Homography Estimation for Visual Place Recognition	Feb 25, 2024	Homography EstimationRe-Ranking	CodeCode Available	2
GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation	Feb 24, 2024		CodeCode Available	2
Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning	Feb 24, 2024	ClassificationFine-Grained Image Recognition	CodeCode Available	2
Reliable Conflictive Multi-View Learning	Feb 24, 2024	MULTI-VIEW LEARNING	CodeCode Available	2
HIR-Diff: Unsupervised Hyperspectral Image Restoration Via Improved Diffusion Models	Feb 24, 2024	DenoisingImage Restoration	CodeCode Available	2
MACRec: a Multi-Agent Collaboration Framework for Recommendation	Feb 23, 2024	Conversational RecommendationDecision Making	CodeCode Available	2
Morphological Symmetries in Robotics	Feb 23, 2024	Data Augmentation	CodeCode Available	2
EasyRL4Rec: An Easy-to-use Library for Reinforcement Learning Based Recommender Systems	Feb 23, 2024	Recommendation SystemsReinforcement Learning (RL)	CodeCode Available	2
Machine Unlearning of Pre-trained Large Language Models	Feb 23, 2024	Machine Unlearning	CodeCode Available	2
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning	Feb 23, 2024	Arithmetic ReasoningAutomated Theorem Proving	CodeCode Available	2
ToMBench: Benchmarking Theory of Mind in Large Language Models	Feb 23, 2024	BenchmarkingMultiple-choice	CodeCode Available	2
Foundation Policies with Hilbert Representations	Feb 23, 2024	Reinforcement Learning (RL)Unsupervised Pre-training	CodeCode Available	2
EMIFF: Enhanced Multi-scale Image Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection	Feb 23, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
GraphEdit: Large Language Models for Graph Structure Learning	Feb 23, 2024	Graph structure learning	CodeCode Available	2
Fast Adversarial Attacks on Language Models In One GPU Minute	Feb 23, 2024	Adversarial AttackComputational Efficiency	CodeCode Available	2
RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation	Feb 23, 2024		CodeCode Available	2
ChunkAttention: Efficient Self-Attention with Prefix-Aware KV Cache and Two-Phase Partition	Feb 23, 2024		CodeCode Available	2
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition	Feb 23, 2024	Image GenerationPersonalized Image Generation	CodeCode Available	2
The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)	Feb 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Grasp, See, and Place: Efficient Unknown Object Rearrangement with Policy Structure Prior	Feb 23, 2024	ObjectObject Rearrangement	CodeCode Available	2
Symbolic Music Generation with Non-Differentiable Rule Guided Diffusion	Feb 22, 2024	Music Generation	CodeCode Available	2
HyperFast: Instant Classification for Tabular Data	Feb 22, 2024	AutoMLClassification	CodeCode Available	2
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models	Feb 22, 2024	AllMixture-of-Experts	CodeCode Available	2
Measuring Multimodal Mathematical Reasoning with MATH-Vision Dataset	Feb 22, 2024	DiversityMath	CodeCode Available	2
Batch and match: black-box variational inference with a score-based divergence	Feb 22, 2024	Variational Inference	CodeCode Available	2
HINT: High-quality INPainting Transformer with Mask-Aware Encoding and Enhanced Attention	Feb 22, 2024	Image Inpaintingspeech-recognition	CodeCode Available	2
GeneOH Diffusion: Towards Generalizable Hand-Object Interaction Denoising via Denoising Diffusion	Feb 22, 2024	Denoising	CodeCode Available	2
tinyBenchmarks: evaluating LLMs with fewer examples	Feb 22, 2024	MMLUMultiple-choice	CodeCode Available	2
WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition	Feb 22, 2024	Image-level Supervised Instance Segmentationobject-detection	CodeCode Available	2
Data Science with LLMs and Interpretable Models	Feb 22, 2024	Additive modelsQuestion Answering	CodeCode Available	2
Stable Neural Stochastic Differential Equations in Analyzing Irregular Time Series Data	Feb 22, 2024	Irregular Time SeriesMissing Values	CodeCode Available	2