The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7901–7925 of 474278 papers

Title	Date	Status
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems	Nov 1, 2025	—Unverified
PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity	Nov 1, 2025	—Unverified
Kimi Linear: An Expressive, Efficient Attention Architecture	Nov 1, 2025	—Unverified
Reject Only Critical Tokens: Pivot-Aware Speculative Decoding	Nov 1, 2025	CodeCode Available
Diff4Splat: Controllable 4D Scene Generation with Latent Dynamic Reconstruction Models	Nov 1, 2025	—Unverified
GDROS: A Geometry-Guided Dense Registration Framework for Optical-SAR Images under Large Geometric Transformations	Nov 1, 2025	CodeCode Available
Chain of Retrieval: Multi-Aspect Iterative Search Expansion and Post-Order Search Aggregation for Full Paper Retrieval	Nov 1, 2025	CodeCode Available
RL Fine-Tuning Heals OOD Forgetting in SFT	Nov 1, 2025	CodeCode Available
Enhancing Adversarial Transferability by Balancing Exploration and Exploitation with Gradient-Guided Sampling	Nov 1, 2025	CodeCode Available
Friend or Foe: How LLMs' Safety Mind Gets Fooled by Intent Shift Attack	Nov 1, 2025	CodeCode Available
Enhancing Heavy Rain Nowcasting with Multimodal Data: Integrating Radar and Satellite Observations	Nov 1, 2025	CodeCode Available
Emotion Detection in Speech Using Lightweight and Transformer-Based Models: A Comparative and Ablation Study	Nov 1, 2025	CodeCode Available
MIFO: Learning and Synthesizing Multi-Instance from One Image	Nov 1, 2025	CodeCode Available
MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools	Nov 1, 2025	CodeCode Available
OSMGen: Highly Controllable Satellite Image Synthesis using OpenStreetMap Data	Nov 1, 2025	CodeCode Available
PADBen: A Comprehensive Benchmark for Evaluating AI Text Detectors Against Paraphrase Attacks	Nov 1, 2025	CodeCode Available
ToxicTextCLIP: Text-Based Poisoning and Backdoor Attacks on CLIP Pre-training	Nov 1, 2025	CodeCode Available
Why Federated Optimization Fails to Achieve Perfect Fitting? A Theoretical Perspective on Client-Side Optima	Nov 1, 2025	CodeCode Available
FedMGP: Personalized Federated Learning with Multi-Group Text-Visual Prompts	Nov 1, 2025	CodeCode Available
ToM: Leveraging Tree-oriented MapReduce for Long-Context Reasoning in Large Language Models	Nov 1, 2025	CodeCode Available
Three-dimensional narrow volume reconstruction method with unconditional stability based on a phase-field Lagrange multiplier approach	Nov 1, 2025	CodeCode Available
Foundation Models for Trajectory Planning in Autonomous Driving: A Review of Progress and Open Challenges	Oct 31, 2025	CodeCode Available
DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models	Oct 31, 2025	—Unverified
ConnectomeBench: Can LLMs Proofread the Connectome?	Oct 31, 2025	CodeCode Available
H2-Cache: A Novel Hierarchical Dual-Stage Cache for High-Performance Acceleration of Generative Diffusion Models	Oct 31, 2025	CodeCode Available