The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7001–7025 of 474278 papers

Title	Date	Status
Conversational LLMs Simplify Secure Clinical Data Access, Understanding, and Analysis	Nov 23, 2025	—Unverified
Health system learning achieves generalist neuroimaging models	Nov 23, 2025	—Unverified
Assessing Historical Structural Oppression Worldwide via Rule-Guided Prompting of Large Language Models	Nov 23, 2025	CodeCode Available
Kitty: Accurate and Efficient 2-bit KV Cache Quantization with Dynamic Channel-wise Precision Boost	Nov 23, 2025	CodeCode Available
Xmodel-2.5: 1.3B Data-Efficient Reasoning SLM	Nov 23, 2025	CodeCode Available
In Search of Goodness: Large Scale Benchmarking of Goodness Functions for the Forward-Forward Algorithm	Nov 23, 2025	CodeCode Available
Prompt Optimization as a State-Space Search Problem	Nov 23, 2025	CodeCode Available
An Analysis of Constraint-Based Multi-Agent Pathfinding Algorithms	Nov 23, 2025	CodeCode Available
End-to-End Visual Autonomous Parking via Control-Aided Attention	Nov 23, 2025	CodeCode Available
Hyperspectral Variational Autoencoders for Joint Data Compression and Component Extraction	Nov 23, 2025	CodeCode Available
A Diffusion Model to Shrink Proteins While Maintaining Their Function	Nov 23, 2025	—Unverified
Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data	Nov 23, 2025	—Unverified
General Agentic Memory Via Deep Research	Nov 23, 2025	—Unverified
VPN: Visual Prompt Navigation	Nov 23, 2025	CodeCode Available
DocPTBench: Benchmarking End-to-End Photographed Document Parsing and Translation	Nov 23, 2025	CodeCode Available
NAF: Zero-Shot Feature Upsampling via Neighborhood Attention Filtering	Nov 23, 2025	CodeCode Available
ReCoGS: Real-time ReColoring for Gaussian Splatting scenes	Nov 23, 2025	CodeCode Available
Towards Robust and Fair Next Visit Diagnosis Prediction under Noisy Clinical Notes with Large Language Models	Nov 23, 2025	CodeCode Available
UPLME: Uncertainty-Aware Probabilistic Language Modelling for Robust Empathy Regression	Nov 23, 2025	CodeCode Available
Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO	Nov 23, 2025	CodeCode Available
HEAL: Learning-Free Source Free Unsupervised Domain Adaptation for Cross-Modality Medical Image Segmentation	Nov 22, 2025	CodeCode Available
AutoHFormer: Efficient Hierarchical Autoregressive Transformer for Time Series Prediction	Nov 22, 2025	CodeCode Available
Matching-Based Few-Shot Semantic Segmentation Models Are Interpretable by Design	Nov 22, 2025	CodeCode Available
Fine-Grained GRPO for Precise Preference Alignment in Flow Models	Nov 22, 2025	—Unverified
Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs	Nov 22, 2025	—Unverified