The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2801–2850 of 659983 papers

Title	Date	Status
TxSum: User-Centered Ethereum Transaction Understanding with Micro-Level Semantic Grounding	Mar 18, 2026	—Unverified
KeyframeFace: Language-Driven Facial Animation via Semantic Keyframes	Mar 18, 2026	—Unverified
Speculative Decoding: Performance or Illusion?	Mar 18, 2026	—Unverified
A Comprehensive Benchmark of Histopathology Foundation Models for Kidney Digital Pathology Images	Mar 18, 2026	—Unverified
Trajectory-Optimized Time Reparameterization for Learning-Compatible Reduced-Order Modeling of Stiff Dynamical Systems	Mar 18, 2026	—Unverified
When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education	Mar 18, 2026	—Unverified
DexGrasp-Zero: A Morphology-Aligned Policy for Zero-Shot Cross-Embodiment Dexterous Grasping	Mar 18, 2026	—Unverified
ShuttleEnv: An Interactive Data-Driven RL Environment for Badminton Strategy Modeling	Mar 18, 2026	—Unverified
VisionNVS: Self-Supervised Inpainting for Novel View Synthesis under the Virtual-Shift Paradigm	Mar 18, 2026	—Unverified
FrescoDiffusion: 4K Image-to-Video with Prior-Regularized Tiled Diffusion	Mar 18, 2026	—Unverified
KA2L: A Knowledge-Aware Active Learning Framework for LLMs	Mar 18, 2026	—Unverified
ReLaGS: Relational Language Gaussian Splatting	Mar 18, 2026	—Unverified
Part-Aware Open-Vocabulary 3D Affordance Grounding via Prototypical Semantic and Geometric Alignment	Mar 18, 2026	—Unverified
Governed Memory: A Production Architecture for Multi-Agent Workflows	Mar 18, 2026	—Unverified
Feeling the Space: Egomotion-Aware Video Representation for Efficient and Accurate 3D Scene Understanding	Mar 18, 2026	—Unverified
"I'm Not Reading All of That": Understanding Software Engineers' Level of Cognitive Engagement with Agentic Coding Assistants	Mar 18, 2026	—Unverified
HarmMetric Eval: Benchmarking Metrics and Judges for LLM Harmfulness Assessment	Mar 18, 2026	—Unverified
Interpretable Context Methodology: Folder Structure as Agentic Architecture	Mar 18, 2026	—Unverified
Simple Additions, Substantial Gains: Expanding Scripts, Languages, and Lineage Coverage in URIEL+	Mar 18, 2026	—Unverified
CoT-PL: Chain-of-Thought Pseudo-Labeling for Open-Vocabulary Object Detection	Mar 18, 2026	CodeCode Available
Deep learning and the rate of approximation by flows	Mar 18, 2026	—Unverified
Hyperparameter Trajectory Inference with Conditional Lagrangian Optimal Transport	Mar 18, 2026	—Unverified
AR-Flow VAE: A Structured Autoregressive Flow Prior Variational Autoencoder for Unsupervised Blind Source Separation	Mar 18, 2026	—Unverified
The Comprehension-Gated Agent Economy: A Robustness-First Architecture for AI Economic Agency	Mar 18, 2026	—Unverified
World Reconstruction From Inconsistent Views	Mar 18, 2026	—Unverified
Neural Pushforward Samplers for the Fokker-Planck Equation on Embedded Riemannian Manifolds	Mar 18, 2026	—Unverified
Attention-guided Evidence Grounding for Spoken Question Answering	Mar 18, 2026	—Unverified
Explanations Go Linear: Post-hoc Explainability for Tabular Data with Interpretable Meta-Encoding	Mar 18, 2026	—Unverified
Hebbian Physics Networks: A Self-Organizing Computational Architecture Based on Local Physical Laws	Mar 18, 2026	—Unverified
ReviewScore: Misinformed Peer Review Detection with Large Language Models	Mar 18, 2026	—Unverified
On the identifiability of causal graphs with multiple environments	Mar 18, 2026	—Unverified
Provably Safe Model Updates	Mar 18, 2026	—Unverified
Efficient Cross-Domain Offline Reinforcement Learning with Dynamics- and Value-Aligned Data Filtering	Mar 18, 2026	—Unverified
The Moralization Corpus: Frame-Based Annotation and Analysis of Moralizing Speech Acts across Diverse Text Genres	Mar 18, 2026	—Unverified
Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning	Mar 18, 2026	—Unverified
Global Optimization By Gradient From Hierarchical Score-Matching Spaces	Mar 18, 2026	—Unverified
Federated Causal Representation Learning in State-Space Systems for Decentralized Counterfactual Reasoning	Mar 18, 2026	—Unverified
CogGen: Cognitive-Load-Informed Fully Unsupervised Deep Generative Modeling for Compressively Sampled MRI Reconstruction	Mar 18, 2026	—Unverified
LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis	Mar 18, 2026	—Unverified
Event-Driven Video Generation	Mar 18, 2026	—Unverified
Next-Frame Decoding for Ultra-Low-Bitrate Image Compression with Video Diffusion Priors	Mar 18, 2026	—Unverified
NV-Bench: Benchmark of Nonverbal Vocalization Synthesis for Expressive Text-to-Speech Generation	Mar 18, 2026	—Unverified
EngGPT2: Sovereign, Efficient and Open Intelligence	Mar 18, 2026	—Unverified
TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas	Mar 18, 2026	—Unverified
HGP-Mamba: Integrating Histology and Generated Protein Features for Mamba-based Multimodal Survival Risk Prediction	Mar 18, 2026	CodeCode Available
Draft-and-Prune: Improving the Reliability of Auto-formalization for Logical Reasoning	Mar 18, 2026	—Unverified
ConfusionBench: An Expert-Validated Benchmark for Confusion Recognition and Localization in Educational Videos	Mar 18, 2026	—Unverified
Directing the Narrative: A Finetuning Method for Controlling Coherence and Style in Story Generation	Mar 18, 2026	—Unverified
Embedding World Knowledge into Tabular Models: Towards Best Practices for Embedding Pipeline Design	Mar 18, 2026	—Unverified
Physics-informed offline reinforcement learning eliminates catastrophic fuel waste in maritime routing	Mar 18, 2026	—Unverified