The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 20501–20550 of 474278 papers

Title	Date	Tasks	Status	Hype
MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning	Oct 12, 2024	Domain AdaptationMulti-Task Learning	CodeCode Available	1
Towards Efficient Visual-Language Alignment of the Q-Former for Visual Reasoning Tasks	Oct 12, 2024	parameter-efficient fine-tuningVisual Reasoning	CodeCode Available	1
Rethinking Data Selection at Scale: Random Selection is Almost All You Need	Oct 12, 2024	All	CodeCode Available	1
LogLM: From Task-based to Instruction-based Automated Log Analysis	Oct 12, 2024	Anomaly DetectionLog Parsing	CodeCode Available	1
DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention	Oct 11, 2024	image-classificationImage Classification	CodeCode Available	1
OpenCity: A Scalable Platform to Simulate Urban Activities with Massive LLM Agents	Oct 11, 2024		CodeCode Available	1
Synth-SONAR: Sonar Image Synthesis with Enhanced Diversity and Realism via Dual Diffusion Models and GPT Prompting	Oct 11, 2024	DiversityImage Generation	CodeCode Available	1
Learning General Representation of 12-Lead Electrocardiogram with a Joint-Embedding Predictive Architecture	Oct 11, 2024	ECG ClassificationElectrocardiography (ECG)	CodeCode Available	1
Cross-Modal Bidirectional Interaction Model for Referring Remote Sensing Image Segmentation	Oct 11, 2024	BenchmarkingImage Segmentation	CodeCode Available	1
AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation	Oct 11, 2024	Safety Alignment	CodeCode Available	1
Distillation of Discrete Diffusion through Dimensional Correlations	Oct 11, 2024		CodeCode Available	1
Refusal-Trained LLMs Are Easily Jailbroken As Browser Agents	Oct 11, 2024	ChatbotRed Teaming	CodeCode Available	1
Retraining-Free Merging of Sparse MoE via Hierarchical Clustering	Oct 11, 2024	ClusteringLanguage Modeling	CodeCode Available	1
Parameter-Efficient Fine-Tuning of State Space Models	Oct 11, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Chain-of-Restoration: Multi-Task Image Restoration Models are Zero-Shot Step-by-Step Universal Image Restorers	Oct 11, 2024	Image Restoration	CodeCode Available	1
SPORTU: A Comprehensive Sports Understanding Benchmark for Multimodal Large Language Models	Oct 11, 2024	Few-Shot LearningMultiple-choice	CodeCode Available	1
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization	Oct 11, 2024		CodeCode Available	1
DiffPO: A causal diffusion model for learning distributions of potential outcomes	Oct 11, 2024	Causal InferenceDecision Making	CodeCode Available	1
E-Motion: Future Motion Simulation via Event Sequence Diffusion	Oct 11, 2024		CodeCode Available	1
Zeroth-Order Fine-Tuning of LLMs in Random Subspaces	Oct 11, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Conjugated Semantic Pool Improves OOD Detection with Pre-trained Vision-Language Models	Oct 11, 2024	Out of Distribution (OOD) Detection	CodeCode Available	1
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars	Oct 11, 2024		CodeCode Available	1
Hespi: A pipeline for automatically detecting information from hebarium specimen sheets	Oct 11, 2024	Handwritten Text RecognitionHTR	CodeCode Available	1
Do Unlearning Methods Remove Information from Language Model Weights?	Oct 11, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
KinDEL: DNA-Encoded Library Dataset for Kinase Inhibitors	Oct 11, 2024	Drug Discovery	CodeCode Available	1
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding	Oct 11, 2024	HallucinationMoment Retrieval	CodeCode Available	1
Language Imbalance Driven Rewarding for Multilingual Self-improving	Oct 11, 2024	Arithmetic ReasoningInstruction Following	CodeCode Available	1
MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices	Oct 11, 2024		CodeCode Available	1
Low-complexity Attention-based Unsupervised Anomalous Sound Detection exploiting Separable Convolutions and Angular Loss	Oct 11, 2024	Anomaly DetectionTask 2	CodeCode Available	1
When Graph meets Multimodal: Benchmarking on Multimodal Attributed Graphs Learning	Oct 11, 2024	AttributeBenchmarking	CodeCode Available	1
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement Learning	Oct 11, 2024	DiversityMuJoCo	CodeCode Available	1
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping	Oct 11, 2024	MMEQuestion Answering	CodeCode Available	1
Batched Energy-Entropy acquisition for Bayesian Optimization	Oct 11, 2024	Bayesian OptimizationGaussian Processes	CodeCode Available	1
MiRAGeNews: Multimodal Realistic AI-Generated News Detection	Oct 11, 2024		CodeCode Available	1
Mentor-KD: Making Small Language Models Better Multi-step Reasoners	Oct 11, 2024	Knowledge Distillation	CodeCode Available	1
Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models	Oct 11, 2024	Model-based Reinforcement Learningreinforcement-learning	CodeCode Available	1
Zero-Shot Offline Imitation Learning via Optimal Transport	Oct 11, 2024	Imitation Learning	CodeCode Available	1
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient	Oct 11, 2024	MambaModel-based Reinforcement Learning	CodeCode Available	1
PEAR: A Robust and Flexible Automation Framework for Ptychography Enabled by Multiple Large Language Model Agents	Oct 11, 2024	Code GenerationLanguage Modeling	CodeCode Available	1
PoisonBench: Assessing Large Language Model Vulnerability to Data Poisoning	Oct 11, 2024	Data PoisoningLanguage Modeling	CodeCode Available	1
A foundation model for generalizable disease diagnosis in chest X-ray images	Oct 11, 2024	Self-Supervised Learning	CodeCode Available	1
SmartPretrain: Model-Agnostic and Dataset-Agnostic Representation Learning for Motion Prediction	Oct 11, 2024	Autonomous Vehiclesmotion prediction	CodeCode Available	1
Recovering complex ecological dynamics from time series using state-space universal dynamic equations	Oct 11, 2024	Time Series	CodeCode Available	1
DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection	Oct 11, 2024	General Knowledgeobject-detection	CodeCode Available	1
CrackSegDiff: Diffusion Probability Model-based Multi-modal Crack Segmentation	Oct 10, 2024	Crack SegmentationDenoising	CodeCode Available	1
Divide and Translate: Compositional First-Order Logic Translation and Verification for Complex Logical Reasoning	Oct 10, 2024	Language ModellingLarge Language Model	CodeCode Available	1
Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models	Oct 10, 2024		CodeCode Available	1
Executing Arithmetic: Fine-Tuning Large Language Models as Turing Machines	Oct 10, 2024		CodeCode Available	1
Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining	Oct 10, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
SPA: 3D Spatial-Awareness Enables Effective Embodied Representation	Oct 10, 2024	GPUNeural Rendering	CodeCode Available	1