The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6476–6500 of 474278 papers

Title	Date	Status
Soft Decision Tree classifier: explainable and extendable PyTorch implementation	Dec 3, 2025	CodeCode Available
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle	Dec 3, 2025	—Unverified
SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL	Dec 3, 2025	—Unverified
CoDA: From Text-to-Image Diffusion Models to Training-Free Dataset Distillation	Dec 3, 2025	CodeCode Available
InteractiveOmni: A Unified Omni-modal Model for Audio-Visual Multi-turn Dialogue	Dec 3, 2025	—Unverified
DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling	Dec 3, 2025	—Unverified
Divide, then Ground: Adapting Frame Selection to Query Types for Long-Form Video Understanding	Dec 3, 2025	—Unverified
PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation	Dec 3, 2025	—Unverified
SkillFactory: Self-Distillation For Learning Cognitive Behaviors	Dec 3, 2025	—Unverified
PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design	Dec 3, 2025	—Unverified
Thinking with Programming Vision: Towards a Unified View for Thinking with Images	Dec 3, 2025	CodeCode Available
Look Around and Pay Attention: Multi-camera Point Tracking Reimagined with Transformers	Dec 3, 2025	CodeCode Available
Addressing Logical Fallacies In Scientific Reasoning From Large Language Models: Towards a Dual-Inference Training Framework	Dec 3, 2025	CodeCode Available
Context-Aware Hierarchical Learning: A Two-Step Paradigm towards Safer LLMs	Dec 3, 2025	CodeCode Available
Principled RL for Diffusion LLMs Emerges from a Sequence-Level Perspective	Dec 3, 2025	CodeCode Available
Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization	Dec 3, 2025	CodeCode Available
DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment	Dec 3, 2025	CodeCode Available
Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation	Dec 3, 2025	CodeCode Available
LoRA Patching: Exposing the Fragility of Proactive Defenses against Deepfakes	Dec 3, 2025	CodeCode Available
Heatmap Pooling Network for Action Recognition from RGB Videos	Dec 3, 2025	CodeCode Available
Score Distillation of Flow Matching Models	Dec 3, 2025	—Unverified
Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation	Dec 3, 2025	—Unverified
OneThinker: All-in-one Reasoning Model for Image and Video	Dec 3, 2025	—Unverified
CartoMapQA: A Fundamental Benchmark Dataset Evaluating Vision-Language Models on Cartographic Map Understanding	Dec 3, 2025	CodeCode Available
Different types of syntactic agreement recruit the same units within large language models	Dec 3, 2025	—Unverified