The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 17701–17750 of 474278 papers

Title	Date	Tasks	Status	Hype
K-Paths: Reasoning over Graph Paths for Drug Repurposing and Drug Interaction Prediction	Feb 18, 2025	Drug DiscoveryKnowledge Graphs	CodeCode Available	1
Learning-Guided Rolling Horizon Optimization for Long-Horizon Flexible Job-Shop Scheduling	Feb 18, 2025	Combinatorial OptimizationJob Shop Scheduling	CodeCode Available	1
MSE-Adapter: A Lightweight Plugin Endowing LLMs with the Capability to Perform Multimodal Sentiment Analysis and Emotion Recognition	Feb 18, 2025	Emotion RecognitionLarge Language Model	CodeCode Available	1
Toward Foundational Model for Sleep Analysis Using a Multimodal Hybrid Self-Supervised Learning Framework	Feb 18, 2025	Contrastive LearningDiagnostic	CodeCode Available	1
tn4ml: Tensor Network Training and Customization for Machine Learning	Feb 18, 2025	Tensor Networks	CodeCode Available	1
Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection	Feb 18, 2025	Contrastive LearningDomain Generalization	CodeCode Available	1
MaxSup: Overcoming Representation Collapse in Label Smoothing	Feb 18, 2025	image-classificationImage Classification	CodeCode Available	1
WeedsGalore: A Multispectral and Multitemporal UAV-based Dataset for Crop and Weed Segmentation in Agricultural Maize Fields	Feb 18, 2025	Instance SegmentationManagement	CodeCode Available	1
Score-Based Diffusion Policy Compatible with Reinforcement Learning via Optimal Transport	Feb 18, 2025	Imitation Learning	CodeCode Available	1
PartSDF: Part-Based Implicit Neural Representation for Composite 3D Shape Parametrization and Optimization	Feb 18, 2025	3D Shape RepresentationDecoder	CodeCode Available	1
PTQ1.61: Push the Real Limit of Extremely Low-Bit Post-Training Quantization Methods for Large Language Models	Feb 18, 2025	BinarizationQuantization	CodeCode Available	1
Myna: Masking-Based Contrastive Learning of Musical Representations	Feb 18, 2025	Contrastive LearningData Augmentation	CodeCode Available	1
A Cognitive Writing Perspective for Constrained Long-Form Text Generation	Feb 18, 2025	FormText Generation	CodeCode Available	1
DemonAgent: Dynamically Encrypted Multi-Backdoor Implantation Attack on LLM-based Agent	Feb 18, 2025		CodeCode Available	1
Universal Embedding Function for Traffic Classification via QUIC Domain Recognition Pretraining: A Transfer Learning Success	Feb 18, 2025	Traffic ClassificationTransfer Learning	CodeCode Available	1
CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space	Feb 18, 2025	Embodied Question AnsweringQuestion Answering	CodeCode Available	1
Scientific Machine Learning of Flow Resistance Using Universal Shallow Water Equations with Differentiable Programming	Feb 18, 2025	Sensitivity	CodeCode Available	1
Uncertainty-Aware Graph Structure Learning	Feb 18, 2025	Graph structure learning	CodeCode Available	1
k-Graph: A Graph Embedding for Interpretable Time Series Clustering	Feb 18, 2025	ClusteringGraph Embedding	CodeCode Available	1
Towards Text-Image Interleaved Retrieval	Feb 18, 2025	Information RetrievalLanguage Modeling	CodeCode Available	1
Demonstrating specification gaming in reasoning models	Feb 18, 2025		CodeCode Available	1
R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs	Feb 18, 2025	HallucinationKnowledge Graphs	CodeCode Available	1
Automating Prompt Leakage Attacks on Large Language Models Using Agentic Approach	Feb 18, 2025		CodeCode Available	1
Iron Sharpens Iron: Defending Against Attacks in Machine-Generated Text Detection with Adversarial Training	Feb 18, 2025	Adversarial AttackText Detection	CodeCode Available	1
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?	Feb 18, 2025	BenchmarkingBlocking	CodeCode Available	1
Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking	Feb 18, 2025		CodeCode Available	1
Enhancing Audio-Visual Spiking Neural Networks through Semantic-Alignment and Cross-Modal Residual Learning	Feb 18, 2025		CodeCode Available	1
Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Search	Feb 18, 2025	Retrieval	CodeCode Available	1
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity	Feb 18, 2025		CodeCode Available	1
Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements	Feb 18, 2025	Decision MakingFraud Detection	CodeCode Available	1
Disentangling Long-Short Term State Under Unknown Interventions for Online Time Series Forecasting	Feb 18, 2025	DisentanglementTime Series	CodeCode Available	1
MVCNet: Multi-View Contrastive Network for Motor Imagery Classification	Feb 18, 2025	Brain Computer InterfaceContrastive Learning	CodeCode Available	1
RealSyn: An Effective and Scalable Multimodal Interleaved Document Transformation Paradigm	Feb 18, 2025	Representation LearningRetrieval	CodeCode Available	1
G-Refer: Graph Retrieval-Augmented Large Language Model for Explainable Recommendation	Feb 18, 2025	Collaborative FilteringExplainable Recommendation	CodeCode Available	1
VidCapBench: A Comprehensive Benchmark of Video Captioning for Controllable Text-to-Video Generation	Feb 18, 2025	Text-to-Video GenerationVideo Captioning	CodeCode Available	1
UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models	Feb 18, 2025	Text Generation	CodeCode Available	1
Positional Encoding in Transformer-Based Time Series Models: A Survey	Feb 17, 2025	Anomaly DetectionBenchmarking	CodeCode Available	1
video-SALMONN-o1: Reasoning-enhanced Audio-visual Large Language Model	Feb 17, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Causal Inference for Qualitative Outcomes	Feb 17, 2025	Causal Inference	CodeCode Available	1
Maximum Entropy Reinforcement Learning with Diffusion Policy	Feb 17, 2025	Efficient ExplorationMuJoCo	CodeCode Available	1
Towards Mechanistic Interpretability of Graph Transformers via Attention Graphs	Feb 17, 2025	Node Classification	CodeCode Available	1
A Physics-Informed Blur Learning Framework for Imaging Systems	Feb 17, 2025	Deblurring	CodeCode Available	1
VANPY: Voice Analysis Framework	Feb 17, 2025	Action DetectionActivity Detection	CodeCode Available	1
SMART: Self-Aware Agent for Tool Overuse Mitigation	Feb 17, 2025	GSM8KLarge Language Model	CodeCode Available	1
Deep Learning of Proteins with Local and Global Regions of Disorder	Feb 17, 2025	Protein Structure Prediction	CodeCode Available	1
VRoPE: Rotary Position Embedding for Video Large Language Models	Feb 17, 2025	PositionVideo Understanding	CodeCode Available	1
M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis	Feb 17, 2025	Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA)	CodeCode Available	1
Learning Dexterous Bimanual Catch Skills through Adversarial-Cooperative Heterogeneous-Agent Reinforcement Learning	Feb 17, 2025		CodeCode Available	1
Revisiting the Test-Time Scaling of o1-like Models: Do they Truly Possess Test-Time Scaling Capabilities?	Feb 17, 2025		CodeCode Available	1
Leveraging Labelled Data Knowledge: A Cooperative Rectification Learning Network for Semi-supervised 3D Medical Image Segmentation	Feb 17, 2025	Image SegmentationMedical Image Segmentation	CodeCode Available	1