The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5501–5525 of 661570 papers

Title	Date	Status	Hype
Exposing Hidden Biases in Text-to-Image Models via Automated Prompt Search	Mar 16, 2026	—Unverified	0
Sharing State Between Prompts and Programs	Mar 16, 2026	—Unverified	1
Token-Level LLM Collaboration via FusionRoute	Mar 16, 2026	—Unverified	0
Time-Annealed Perturbation Sampling: Diverse Generation for Diffusion Language Models	Mar 16, 2026	—Unverified	0
On Theoretically-Driven LLM Agents for Multi-Dimensional Discourse Analysis	Mar 16, 2026	—Unverified	0
Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning	Mar 16, 2026	—Unverified	0
MAWARITH: A Dataset and Benchmark for Legal Inheritance Reasoning with LLMs	Mar 16, 2026	—Unverified	0
Beyond Polarity: Multi-Dimensional LLM Sentiment Signals for WTI Crude Oil Futures Return Prediction	Mar 16, 2026	—Unverified	0
Overcoming the Modality Gap in Context-Aided Forecasting	Mar 16, 2026	—Unverified	0
BrainBench: Exposing the Commonsense Reasoning Gap in Large Language Models	Mar 16, 2026	—Unverified	0
Transition Flow Matching	Mar 16, 2026	—Unverified	0
Loosely-Structured Software: Engineering Context, Structure, and Evolution Entropy in Runtime-Rewired Multi-Agent Systems	Mar 16, 2026	—Unverified	0
Tackling Over-smoothing on Hypergraphs: A Ricci Flow-guided Neural Diffusion Approach	Mar 16, 2026	—Unverified	0
LLM-Driven Discovery of High-Entropy Catalysts via Retrieval-Augmented Generation	Mar 16, 2026	—Unverified	0
Embedding-Aware Feature Discovery: Bridging Latent Representations and Interpretable Features in Event Sequences	Mar 16, 2026	—Unverified	0
Meta-TTRL: A Metacognitive Framework for Self-Improving Test-Time Reinforcement Learning in Unified Multimodal Models	Mar 16, 2026	—Unverified	0
S2Act: Simple Spiking Actor	Mar 16, 2026	—Unverified	0
ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems	Mar 16, 2026	—Unverified	0
You've Got a Golden Ticket: Improving Generative Robot Policies With A Single Noise Vector	Mar 16, 2026	—Unverified	0
Simulation Distillation: Pretraining World Models in Simulation for Rapid Real-World Adaptation	Mar 16, 2026	—Unverified	0
CorrectionPlanner: Self-Correction Planner with Reinforcement Learning in Autonomous Driving	Mar 16, 2026	—Unverified	0
Domain Adaptation Without the Compute Burden for Efficient Whole Slide Image Analysis	Mar 16, 2026	—Unverified	0
Parallelised Differentiable Straightest Geodesics for 3D Meshes	Mar 16, 2026	—Unverified	0
Evolving Contextual Safety in Multi-Modal Large Language Models via Inference-Time Self-Reflective Memory	Mar 16, 2026	—Unverified	0
Mask Is What DLLM Needs: A Masked Data Training Paradigm for Diffusion LLMs	Mar 16, 2026	—Unverified	0