The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7976–8000 of 474278 papers

Title	Date	Status
MM-OPERA: Benchmarking Open-ended Association Reasoning for Large Vision-Language Models	Oct 30, 2025	CodeCode Available
Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench	Oct 30, 2025	—Unverified
Scaling Tractable Probabilistic Circuits: A Systems Perspective	Oct 30, 2025	CodeCode Available
Incremental Human-Object Interaction Detection with Invariant Relation Representation Learning	Oct 30, 2025	CodeCode Available
SpinalSAM-R1: A Vision-Language Multimodal Interactive System for Spine CT Segmentation	Oct 30, 2025	CodeCode Available
Loquetier: A Virtualized Multi-LoRA Framework for Unified LLM Fine-tuning and Serving	Oct 30, 2025	CodeCode Available
The Denario project: Deep knowledge AI agents for scientific discovery	Oct 30, 2025	CodeCode Available
Cross-view Localization and Synthesis -- Datasets, Challenges and Opportunities	Oct 30, 2025	CodeCode Available
D-HUMOR: Dark Humor Understanding via Multimodal Open-ended Reasoning -- A Benchmark Dataset and Method	Oct 30, 2025	CodeCode Available
Defeating the Training-Inference Mismatch via FP16	Oct 30, 2025	—Unverified
From One to More: Contextual Part Latents for 3D Generation	Oct 30, 2025	—Unverified
Locality in Image Diffusion Models Emerges from Data Statistics	Oct 30, 2025	—Unverified
UNO-Bench: A Unified Benchmark for Exploring the Compositional Law Between Uni-modal and Omni-modal in Omni Models	Oct 30, 2025	—Unverified
Multi-Agent Evolve: LLM Self-Improve through Co-evolution	Oct 30, 2025	—Unverified
Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs	Oct 30, 2025	CodeCode Available
EgoExo-Con: Exploring View-Invariant Video Temporal Understanding	Oct 30, 2025	—Unverified
FullPart: Generating each 3D Part at Full Resolution	Oct 30, 2025	—Unverified
Spiking Patches: Asynchronous, Sparse, and Efficient Tokens for Event Cameras	Oct 30, 2025	—Unverified
AMO-Bench: Large Language Models Still Struggle in High School Math Competitions	Oct 30, 2025	—Unverified
OmniX: From Unified Panoramic Generation and Perception to Graphics-Ready 3D Scenes	Oct 30, 2025	—Unverified
Are Video Models Ready as Zero-Shot Reasoners? An Empirical Study with the MME-CoF Benchmark	Oct 30, 2025	—Unverified
The Quest for Generalizable Motion Generation: Data, Model, and Evaluation	Oct 30, 2025	—Unverified
C-LoRA: Contextual Low-Rank Adaptation for Uncertainty Estimation in Large Language Models	Oct 30, 2025	CodeCode Available
Smoothing Slot Attention Iterations and Recurrences	Oct 30, 2025	CodeCode Available
Holographic Transformers for Complex-Valued Signal Processing: Integrating Phase Interference into Self-Attention	Oct 30, 2025	CodeCode Available