The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 12251–12300 of 474278 papers

Title	Date	Tasks	Status	Hype
Marco-Bench-MIF: On Multilingual Instruction-Following Capability of Large Language Models	Jul 16, 2025		CodeCode Available	0
Topology Enhanced MARL for Multi-Vehicle Cooperative Decision-Making of CAVs	Jul 16, 2025		CodeCode Available	0
Generate to Ground: Multimodal Text Conditioning Boosts Phrase Grounding in Medical Vision-Language Models	Jul 16, 2025		CodeCode Available	0
Advancing Retrieval-Augmented Generation for Structured Enterprise and Internal Data	Jul 16, 2025		CodeCode Available	0
Vidi: Large Multimodal Models for Video Understanding and Editing	Jul 16, 2025		—Unverified	0
CT-ScanGaze: A Dataset and Baselines for 3D Volumetric Scanpath Modeling	Jul 16, 2025		—Unverified	0
Improving physics-informed neural network extrapolation via transfer learning and adaptive activation functions	Jul 16, 2025		CodeCode Available	0
Mixture of Raytraced Experts	Jul 16, 2025		CodeCode Available	0
MOFSimBench: Evaluating Universal Machine Learning Interatomic Potentials In Metal--Organic Framework Molecular Modeling	Jul 16, 2025		CodeCode Available	0
CorrMoE: Mixture of Experts with De-stylization Learning for Cross-Scene and Cross-Domain Correspondence Pruning	Jul 16, 2025		CodeCode Available	0
CompressedVQA-HDR: Generalized Full-reference and No-reference Quality Assessment Models for Compressed High Dynamic Range Videos	Jul 16, 2025		CodeCode Available	0
Dataset Ownership Verification for Pre-trained Masked Models	Jul 16, 2025		CodeCode Available	0
MS-DETR: Towards Effective Video Moment Retrieval and Highlight Detection by Joint Motion-Semantic Learning	Jul 16, 2025		CodeCode Available	0
Open-Vocabulary Indoor Object Grounding with 3D Hierarchical Scene Graph	Jul 16, 2025		CodeCode Available	0
Wavelet-based Decoupling Framework for low-light Stereo Image Enhancement	Jul 16, 2025		CodeCode Available	0
Text-driven Multiplanar Visual Interaction for Semi-supervised Medical Image Segmentation	Jul 16, 2025		CodeCode Available	0
QuRe: Query-Relevant Retrieval through Hard Negative Sampling in Composed Image Retrieval	Jul 16, 2025		CodeCode Available	0
CytoSAE: Interpretable Cell Embeddings for Hematology	Jul 16, 2025		CodeCode Available	0
DeltaDiff: Reality-Driven Diffusion with AnchorResiduals for Faithful SR	Jul 16, 2025		CodeCode Available	0
DyG-RAG: Dynamic Graph Retrieval-Augmented Generation with Event-Centric Reasoning	Jul 16, 2025		CodeCode Available	0
Cross-modal Ship Re-Identification via Optical and SAR Imagery: A Novel Dataset and Method	Jul 16, 2025		CodeCode Available	0
TRIQA: Image Quality Assessment by Contrastive Pretraining on Ordered Distortion Triplets	Jul 16, 2025		CodeCode Available	0
The benefits of query-based KGQA systems for complex and temporal questions in LLM era	Jul 16, 2025		CodeCode Available	0
Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image Classification	Jul 16, 2025		CodeCode Available	0
Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs	Jul 16, 2025		CodeCode Available	0
The Evolving Role of Large Language Models in Scientific Innovation: Evaluator, Collaborator, and Scientist	Jul 16, 2025		CodeCode Available	0
AU-Blendshape for Fine-grained Stylized 3D Facial Expression Manipulation	Jul 16, 2025		CodeCode Available	0
BOOKCOREF: Coreference Resolution at Book Scale	Jul 16, 2025		CodeCode Available	0
Out-of-distribution data supervision towards biomedical semantic segmentation	Jul 16, 2025		CodeCode Available	0
RadioDiff-3D: A 3D3D Radio Map Dataset and Generative Diffusion Based Benchmark for 6G Environment-Aware Communication	Jul 16, 2025		CodeCode Available	0
Text-ADBench: Text Anomaly Detection Benchmark based on LLMs Embedding	Jul 16, 2025		CodeCode Available	0
Mono-InternVL-1.5: Towards Cheaper and Faster Monolithic Multimodal Large Language Models	Jul 16, 2025		CodeCode Available	0
MS-DGCNN++: A Multi-Scale Fusion Dynamic Graph Neural Network with Biological Knowledge Integration for LiDAR Tree Species Classification	Jul 16, 2025		CodeCode Available	0
FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale	Jul 16, 2025	Computational EfficiencyGPU	CodeCode Available	3
Second-Order Bounds for [0,1]-Valued Regression via Betting Loss	Jul 16, 2025	Distributional Reinforcement Learningregression	—Unverified	0
A Bayesian Incentive Mechanism for Poison-Resilient Federated Learning	Jul 16, 2025	Data PoisoningFederated Learning	—Unverified	0
YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association	Jul 16, 2025	Multi-Object TrackingObject Tracking	CodeCode Available	0
Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?	Jul 16, 2025	Hyperparameter OptimizationMeta-Learning	—Unverified	0
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training	Jul 16, 2025	Code GenerationMath	—Unverified	0
Information-Theoretic Generalization Bounds of Replay-based Continual Learning	Jul 16, 2025	Continual LearningGeneralization Bounds	—Unverified	0
Non-Adaptive Adversarial Face Generation	Jul 16, 2025	AttributeFace Generation	—Unverified	0
A Privacy-Preserving Framework for Advertising Personalization Incorporating Federated Learning and Differential Privacy	Jul 16, 2025	Anomaly DetectionFederated Learning	—Unverified	0
Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts	Jul 16, 2025	Feature Importanceregression	—Unverified	0
Heat Kernel Goes Topological	Jul 16, 2025	Computational EfficiencyProperty Prediction	—Unverified	0
A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning	Jul 16, 2025	ObjectPoint Cloud Registration	—Unverified	0
Fly, Fail, Fix: Iterative Game Repair with Reinforcement Learning and Large Multimodal Models	Jul 16, 2025	Game DesignReinforcement Learning (RL)	—Unverified	0
Federated Learning in Open- and Closed-Loop EMG Decoding: A Privacy and Performance Perspective	Jul 16, 2025	Federated LearningPrivacy Preserving	—Unverified	0
Safeguarding Federated Learning-based Road Condition Classification	Jul 16, 2025	Autonomous DrivingClassification	—Unverified	0
Ranking Vectors Clustering: Theory and Applications	Jul 16, 2025	Clustering	—Unverified	0
Draw an Ugly Person An Exploration of Generative AIs Perceptions of Ugliness	Jul 16, 2025	Large Language Model	—Unverified	0