The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6426–6450 of 474278 papers

Title	Date	Status
Beyond the Exploration-Exploitation Trade-off: A Hidden State Approach for LLM Reasoning in RLVR	Dec 4, 2025	—Unverified
HUME: Measuring the Human-Model Performance Gap in Text Embedding Tasks	Dec 4, 2025	—Unverified
Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction	Dec 4, 2025	—Unverified
ShadowDraw: From Any Object to Shadow-Drawing Compositional Art	Dec 4, 2025	—Unverified
Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning	Dec 4, 2025	CodeCode Available
TTRV: Test-Time Reinforcement Learning for Vision Language Models	Dec 4, 2025	—Unverified
EoS-FM: Can an Ensemble of Specialist Models act as a Generalist Feature Extractor?	Dec 4, 2025	CodeCode Available
ViDiC: Video Difference Captioning	Dec 4, 2025	—Unverified
In-Context Representation Hijacking	Dec 4, 2025	—Unverified
FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring	Dec 4, 2025	—Unverified
X-Humanoid: Robotize Human Videos to Generate Humanoid Videos at Scale	Dec 4, 2025	—Unverified
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates	Dec 4, 2025	—Unverified
LiteVGGT: Boosting Vanilla VGGT via Geometry-aware Cached Token Merging	Dec 4, 2025	—Unverified
Aligned but Stereotypical? The Hidden Influence of System Prompts on Social Bias in LVLM-Based Text-to-Image Models	Dec 4, 2025	—Unverified
Reflection Removal through Efficient Adaptation of Diffusion Transformers	Dec 4, 2025	—Unverified
Deep Forcing: Training-Free Long Video Generation with Deep Sink and Participative Compression	Dec 4, 2025	—Unverified
ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning	Dec 4, 2025	—Unverified
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation	Dec 4, 2025	—Unverified
Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model	Dec 4, 2025	—Unverified
Inferring Compositional 4D Scenes without Ever Seeing One	Dec 4, 2025	—Unverified
The SAM2-to-SAM3 Gap in the Segment Anything Model Family: Why Prompt-Based Expertise Fails in Concept-Driven Image Segmentation	Dec 4, 2025	—Unverified
Flowing Backwards: Improving Normalizing Flows via Reverse Representation Alignment	Dec 4, 2025	CodeCode Available
Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models	Dec 4, 2025	CodeCode Available
Ground-Truth Subgraphs for Better Training and Evaluation of Knowledge Graph Augmented LLMs	Dec 4, 2025	—Unverified
A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA)	Dec 4, 2025	CodeCode Available