The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7151–7200 of 661570 papers

Title	Date	Status	Hype
When LLM Judge Scores Look Good but Best-of-N Decisions Fail	Mar 12, 2026	—Unverified	0
Multi-Station WiFi CSI Sensing Framework Robust to Station-wise Feature Missingness and Limited Labeled Data	Mar 12, 2026	—Unverified	0
Fair Learning for Bias Mitigation and Quality Optimization in Paper Recommendation	Mar 12, 2026	—Unverified	0
KnowVal: A Knowledge-Augmented and Value-Guided Autonomous Driving System	Mar 12, 2026	—Unverified	0
Geometry-Aware Probabilistic Circuits via Voronoi Tessellations	Mar 12, 2026	—Unverified	0
RF4D:Neural Radar Fields for Novel View Synthesis in Outdoor Dynamic Scenes	Mar 12, 2026	—Unverified	0
Hope Speech Detection in code-mixed Roman Urdu tweets: A Positive Turn in Natural Language Processing	Mar 12, 2026	—Unverified	0
Adaptive Dual-Constrained Line Aggregation for Robust Generic and Wireframe Line Segment Detection	Mar 12, 2026	—Unverified	0
On the Theoretical Limitations of Embedding-Based Retrieval	Mar 12, 2026	—Unverified	4
Disentangling Slow and Fast Temporal Dynamics in Degradation Inference with Hierarchical Differential Models	Mar 12, 2026	—Unverified	0
ManiVID-3D: Generalizable View-Invariant Reinforcement Learning for Robotic Manipulation via Disentangled 3D Representations	Mar 12, 2026	—Unverified	0
NormGenesis: Multicultural Dialogue Generation via Exemplar-Guided Social Norm Modeling and Violation Recovery	Mar 12, 2026	—Unverified	0
Critique-Coder: Enhancing Coder Models by Critique Reinforcement Learning	Mar 12, 2026	—Unverified	1
Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct	Mar 12, 2026	—Unverified	2
Contrastive Diffusion Guidance for Spatial Inverse Problems	Mar 12, 2026	—Unverified	0
Refereed Learning	Mar 12, 2026	—Unverified	0
ReSplat: Learning Recurrent Gaussian Splatting	Mar 12, 2026	—Unverified	0
Understanding and Optimizing Attention-Based Sparse Matching for Diverse Local Features	Mar 12, 2026	—Unverified	0
DriveCritic: Towards Context-Aware, Human-Aligned Evaluation for Autonomous Driving with Vision-Language Models	Mar 12, 2026	—Unverified	0
See4D: Pose-Free 4D Generation via Auto-Regressive Video Inpainting	Mar 12, 2026	—Unverified	0
FrugalPrompt: Reducing Contextual Overhead in Large Language Models via Token Attribution	Mar 12, 2026	—Unverified	0
A Foundational Theory of Quantitative Abstraction: Adjunctions, Duality, and Logic for Probabilistic Systems	Mar 12, 2026	—Unverified	0
More Than Memory Savings: Zeroth-Order Optimization Mitigates Forgetting in Continual Learning	Mar 12, 2026	—Unverified	0
Belief Dynamics Reveal the Dual Nature of In-Context Learning and Activation Steering	Mar 12, 2026	—Unverified	0
Adaptive Hyperbolic Kernels: Modulated Embedding in de Branges-Rovnyak Spaces	Mar 12, 2026	—Unverified	0
Quality Assurance of LLM-generated Code: Addressing Non-Functional Quality Characteristics	Mar 12, 2026	—Unverified	0
Defending Unauthorized Model Merging via Dual-Stage Weight Protection	Mar 12, 2026	—Unverified	0
Mobile-Agent-RAG: Driving Smart Multi-Agent Coordination with Contextual Knowledge Empowerment for Long-Horizon Mobile Automation	Mar 12, 2026	—Unverified	0
ConCISE: A Reference-Free Conciseness Evaluation Metric for LLM-Generated Answers	Mar 12, 2026	—Unverified	0
Radiative-Structured Neural Operator for Continuous Spectral Super-Resolution	Mar 12, 2026	—Unverified	0
Decoupling Perception from Reasoning for Hallucination-Resistant Video Understanding	Mar 12, 2026	—Unverified	0
Beyond Description: Cognitively Benchmarking Fine-Grained Action for Embodied Agents	Mar 12, 2026	—Unverified	0
Contact-Aware Refinement of Human Pose Pseudo-Ground Truth via Bioimpedance Sensing	Mar 12, 2026	—Unverified	0
LoC-Path: Learning to Compress for Pathology Multimodal Large Language Models	Mar 12, 2026	—Unverified	0
Forests of Uncertaint(r)ees: Using tree-based ensembles to estimate probability distributions of future conflict	Mar 12, 2026	—Unverified	0
Value Under Ignorance in Universal Artificial Intelligence	Mar 12, 2026	—Unverified	0
SDUM: A Scalable Deep Unrolled Model for Universal MRI Reconstruction	Mar 12, 2026	—Unverified	0
Resurfacing Paralinguistic Awareness in Large Audio Language Models	Mar 12, 2026	—Unverified	0
Don't Mind the Gaps: Implicit Neural Representations for Resolution-Agnostic Retinal OCT Analysis	Mar 12, 2026	—Unverified	0
Beyond the Black Box: A Survey on the Theory and Mechanism of Large Language Models	Mar 12, 2026	—Unverified	0
Prompting Underestimates LLM Capability for Time Series Classification	Mar 12, 2026	—Unverified	0
Provably Finding a Hidden Dense Submatrix among Many Planted Dense Submatrices via Convex Programming	Mar 12, 2026	—Unverified	0
LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models	Mar 12, 2026	—Unverified	0
Learning Through Dialogue: Engagement and Efficacy Matter More Than Explanations	Mar 12, 2026	—Unverified	0
PosIR: Position-Aware Heterogeneous Information Retrieval Benchmark	Mar 12, 2026	—Unverified	0
Energy-Aware Metaheuristics	Mar 12, 2026	—Unverified	0
A Learnable Wavelet Transformer for Long-Short Equity Trading and Risk-Adjusted Return Optimization	Mar 12, 2026	—Unverified	0
Do LLMs Truly Benefit from Longer Context in Automatic Post-Editing?	Mar 12, 2026	—Unverified	0
Generating a Paracosm for Training-Free Zero-Shot Composed Image Retrieval	Mar 12, 2026	—Unverified	0
BLOCK: An Open-Source Bi-Stage MLLM Character-to-Skin Pipeline for Minecraft	Mar 12, 2026	—Unverified	0