The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 18751–18800 of 474278 papers

Title	Date	Tasks	Status	Hype
Generating Traffic Scenarios via In-Context Learning to Learn Better Motion Planner	Dec 24, 2024	Autonomous DrivingDataset Generation	CodeCode Available	1
Beyond Gradient Averaging in Parallel Optimization: Improved Robustness through Gradient Agreement Filtering	Dec 24, 2024	image-classificationImage Classification	CodeCode Available	1
Extract Free Dense Misalignment from CLIP	Dec 24, 2024	HallucinationImage Generation	CodeCode Available	1
Towards Modality Generalization: A Benchmark and Prospective Analysis	Dec 24, 2024		CodeCode Available	1
LongDocURL: a Comprehensive Multimodal Long Document Benchmark Integrating Understanding, Reasoning, and Locating	Dec 24, 2024	document understandingQuestion Answering	CodeCode Available	1
An Automatic Graph Construction Framework based on Large Language Models for Recommendation	Dec 24, 2024	graph constructionQuantization	CodeCode Available	1
Accelerating AIGC Services with Latent Action Diffusion Scheduling in Edge Networks	Dec 24, 2024	Scheduling	CodeCode Available	1
Underwater Image Restoration via Polymorphic Large Kernel CNNs	Dec 24, 2024	Computational EfficiencyFeature Importance	CodeCode Available	1
Learning to engineer protein flexibility	Dec 24, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
VisionGRU: A Linear-Complexity RNN Model for Efficient Image Analysis	Dec 24, 2024	image-classificationImage Classification	CodeCode Available	1
Improving Pareto Set Learning for Expensive Multi-objective Optimization via Stein Variational Hypernetworks	Dec 23, 2024	Gaussian Processes	CodeCode Available	1
Towards Unsupervised Model Selection for Domain Adaptive Object Detection	Dec 23, 2024	Model Selectionobject-detection	CodeCode Available	1
Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection	Dec 23, 2024	object-detectionObject Detection	CodeCode Available	1
The Superposition of Diffusion Models Using the Itô Density Estimator	Dec 23, 2024		CodeCode Available	1
Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding	Dec 23, 2024	EEGElectroencephalogram (EEG)	CodeCode Available	1
CARL-GT: Evaluating Causal Reasoning Capabilities of Large Language Models	Dec 23, 2024	Decision MakingMath	CodeCode Available	1
QTSeg: A Query Token-Based Architecture for Efficient 2D Medical Image Segmentation	Dec 23, 2024	Breast Cancer DetectionDecoder	CodeCode Available	1
Brain-to-Text Benchmark '24: Lessons Learned	Dec 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Resource-Aware Arabic LLM Creation: Model Adaptation, Integration, and Multi-Domain Testing	Dec 23, 2024	ArabicMMLUDialect Identification	CodeCode Available	1
A Survey on LLM-based Multi-Agent System: Recent Advances and New Frontiers in Application	Dec 23, 2024		CodeCode Available	1
GraphHash: Graph Clustering Enables Parameter Efficiency in Recommender Systems	Dec 23, 2024	Click-Through Rate PredictionClustering	CodeCode Available	1
LegalAgentBench: Evaluating LLM Agents in Legal Domain	Dec 23, 2024	Decision Making	CodeCode Available	1
BrainMAP: Learning Multiple Activation Pathways in Brain Networks	Dec 23, 2024	MambaMixture-of-Experts	CodeCode Available	1
Multi-Modal Grounded Planning and Efficient Replanning For Learning Embodied Agents with A Few Examples	Dec 23, 2024	Common Sense ReasoningTask Planning	CodeCode Available	1
Progressive Boundary Guided Anomaly Synthesis for Industrial Anomaly Detection	Dec 23, 2024	Anomaly DetectionBinary Classification	CodeCode Available	1
On the Generalization Ability of Machine-Generated Text Detectors	Dec 23, 2024	BenchmarkingMisinformation	CodeCode Available	1
Hierarchical Vector Quantization for Unsupervised Action Segmentation	Dec 23, 2024	Action SegmentationClustering	CodeCode Available	1
Efficient fine-tuning methodology of text embedding models for information retrieval: contrastive learning penalty (clp)	Dec 23, 2024	Contrastive LearningInformation Retrieval	CodeCode Available	1
Kernel-Aware Graph Prompt Learning for Few-Shot Anomaly Detection	Dec 23, 2024	Anomaly DetectionPrompt Learning	CodeCode Available	1
Unity is Strength: Unifying Convolutional and Transformeral Features for Better Person Re-Identification	Dec 23, 2024	Person Re-IdentificationUnity	CodeCode Available	1
AFANet: Adaptive Frequency-Aware Network for Weakly-Supervised Few-Shot Semantic Segmentation	Dec 23, 2024	Few-Shot LearningFew-Shot Semantic Segmentation	CodeCode Available	1
Multimodal Learning with Uncertainty Quantification based on Discounted Belief Fusion	Dec 23, 2024	Decision MakingMulti-modal Classification	CodeCode Available	1
CodeV: Issue Resolving with Visual Data	Dec 23, 2024		CodeCode Available	1
Neural Spatial-Temporal Tensor Representation for Infrared Small Target Detection	Dec 23, 2024		CodeCode Available	1
Uncertainty-Participation Context Consistency Learning for Semi-supervised Semantic Segmentation	Dec 23, 2024	Semantic SegmentationSemi-Supervised Semantic Segmentation	CodeCode Available	1
SMAC-Hard: Enabling Mixed Opponent Strategy Script and Self-play on SMAC	Dec 23, 2024	BenchmarkingMulti-agent Reinforcement Learning	CodeCode Available	1
LiveIdeaBench: Evaluating LLMs' Scientific Creativity and Idea Generation with Minimal Context	Dec 23, 2024		CodeCode Available	1
VarAD: Lightweight High-Resolution Image Anomaly Detection via Visual Autoregressive Modeling	Dec 23, 2024	Anomaly DetectionMamba	CodeCode Available	1
Knowledge Editing through Chain-of-Thought	Dec 23, 2024	knowledge editingWorld Knowledge	CodeCode Available	1
WildPPG: A Real-World PPG Dataset of Long Continuous Recordings	Dec 23, 2024	Heart rate estimationPhotoplethysmography (PPG)	CodeCode Available	1
Seamless Detection: Unifying Salient Object Detection and Camouflaged Object Detection	Dec 22, 2024	DecoderObject	CodeCode Available	1
Optimal signal transmission and timescale diversity in a model of human brain operating near criticality	Dec 22, 2024	Diversity	CodeCode Available	1
Empirical evaluation of normalizing flows in Markov Chain Monte Carlo	Dec 22, 2024		CodeCode Available	1
Learning to Generate Gradients for Test-Time Adaptation via Test-Time Training Layers	Dec 22, 2024	MemorizationTest-time Adaptation	CodeCode Available	1
A Conditional Diffusion Model for Electrical Impedance Tomography Image Reconstruction	Dec 22, 2024	DenoisingImage Reconstruction	CodeCode Available	1
SAIL: Sample-Centric In-Context Learning for Document Information Extraction	Dec 22, 2024	In-Context Learning	CodeCode Available	1
LLM-Powered User Simulator for Recommender System	Dec 22, 2024	Recommendation Systemsreinforcement-learning	CodeCode Available	1
Grams: Gradient Descent with Adaptive Momentum Scaling	Dec 22, 2024		CodeCode Available	1
Interactive Classification Metrics: A graphical application to build robust intuition for classification model evaluation	Dec 22, 2024	Binary ClassificationClassification	CodeCode Available	1
Online Preference-based Reinforcement Learning with Self-augmented Feedback from Large Language Model	Dec 22, 2024	Language ModelingLanguage Modelling	CodeCode Available	1