The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9401–9425 of 474278 papers

Title	Date	Status
VLA-RFT: Vision-Language-Action Reinforcement Fine-tuning with Verified Rewards in World Simulators	Oct 1, 2025	—Unverified
LongCodeZip: Compress Long Context for Code Language Models	Oct 1, 2025	—Unverified
Efficient Multi-modal Large Language Models via Progressive Consistency Distillation	Oct 1, 2025	—Unverified
Can World Models Benefit VLMs for World Dynamics?	Oct 1, 2025	—Unverified
QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL	Oct 1, 2025	—Unverified
CurES: From Gradient Analysis to Efficient Curriculum Learning for Reasoning LLMs	Oct 1, 2025	—Unverified
Pay-Per-Search Models are Abstention Models	Oct 1, 2025	—Unverified
TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments	Oct 1, 2025	—Unverified
WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents	Oct 1, 2025	CodeCode Available
Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model	Oct 1, 2025	CodeCode Available
Collaborative-Distilled Diffusion Models (CDDM) for Accelerated and Lightweight Trajectory Prediction	Oct 1, 2025	CodeCode Available
ReSWD: ReSTIR'd, not shaken. Combining Reservoir Sampling and Sliced Wasserstein Distance for Variance Reduction	Oct 1, 2025	—Unverified
MathSticks: A Benchmark for Visual Symbolic Compositional Reasoning with Matchstick Puzzles	Oct 1, 2025	CodeCode Available
DACoN: DINO for Anime Paint Bucket Colorization with Any Number of Reference Images	Oct 1, 2025	CodeCode Available
GIM: Improved Interpretability for Large Language Models	Oct 1, 2025	CodeCode Available
Steering When Necessary: Flexible Steering Large Language Models with Backtracking	Oct 1, 2025	CodeCode Available
CogVLA: Cognition-Aligned Vision-Language-Action Model via Instruction-Driven Routing & Sparsification	Oct 1, 2025	CodeCode Available
Efficient Probabilistic Tensor Networks	Oct 1, 2025	CodeCode Available
Domain-Specialized Interactive Segmentation Framework for Meningioma Radiotherapy Planning	Oct 1, 2025	CodeCode Available
Enhancing Rating Prediction with Off-the-Shelf LLMs Using In-Context User Reviews	Oct 1, 2025	CodeCode Available
Relative-Absolute Fusion: Rethinking Feature Extraction in Image-Based Iterative Method Selection for Solving Sparse Linear Systems	Oct 1, 2025	CodeCode Available
Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum	Oct 1, 2025	CodeCode Available
InfVSR: Breaking Length Limits of Generic Video Super-Resolution	Oct 1, 2025	CodeCode Available
JEPA-T: Joint-Embedding Predictive Architecture with Text Fusion for Image Generation	Oct 1, 2025	CodeCode Available
Multi-Actor Multi-Critic Deep Deterministic Reinforcement Learning with a Novel Q-Ensemble Method	Oct 1, 2025	CodeCode Available