The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8151–8200 of 661570 papers

Title	Date	Status	Hype
Federated Active Learning Under Extreme Non-IID and Global Class Imbalance	Mar 11, 2026	CodeCode Available	0
On the Learning Dynamics of Two-layer Linear Networks with Label Noise SGD	Mar 11, 2026	CodeCode Available	0
Sparse Task Vector Mixup with Hypernetworks for Efficient Knowledge Transfer in Whole-Slide Image Prognosis	Mar 11, 2026	CodeCode Available	0
Reinforcement Learning with Conditional Expectation Reward	Mar 11, 2026	CodeCode Available	0
CodePercept: Code-Grounded Visual STEM Perception for MLLMs	Mar 11, 2026	CodeCode Available	0
Guiding Diffusion Models with Semantically Degraded Conditions	Mar 11, 2026	CodeCode Available	0
Ranking Reasoning LLMs under Test-Time Scaling	Mar 11, 2026	CodeCode Available	0
DNS-GT: A Graph-based Transformer Approach to Learn Embeddings of Domain Names from DNS Queries	Mar 11, 2026	CodeCode Available	0
Benchmarking Graph Neural Networks in Solving Hard Constraint Satisfaction Problems	Mar 11, 2026	CodeCode Available	0
Bilevel Layer-Positioning LoRA for Real Image Dehazing	Mar 11, 2026	CodeCode Available	0
LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation	Mar 11, 2026	CodeCode Available	0
CUPID: A Plug-in Framework for Joint Aleatoric and Epistemic Uncertainty Estimation with a Single Model	Mar 11, 2026	CodeCode Available	0
Protein Counterfactuals via Diffusion-Guided Latent Optimization	Mar 11, 2026	CodeCode Available	0
ZACH-ViT: Regime-Dependent Inductive Bias in Compact Vision Transformers for Medical Imaging	Mar 11, 2026	CodeCode Available	0
Shadow in the Cache: Unveiling and Mitigating Privacy Risks of KV-cache in LLM Inference	Mar 11, 2026	—Unverified	1
CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance	Mar 11, 2026	—Unverified	1
LaTeXTrans: Structured LaTeX Translation with Multi-Agent Coordination	Mar 11, 2026	—Unverified	3
Efficient Audio-Visual Speech Separation with Discrete Lip Semantics and Multi-Scale Global-Local Attention	Mar 11, 2026	—Unverified	2
CostNav: A Navigation Benchmark for Real-World Economic-Cost Evaluation of Physical AI Agents	Mar 11, 2026	CodeCode Available	0
Sample-and-Search: An Effective Algorithm for Learning-Augmented k-Median Clustering in High dimensions	Mar 11, 2026	—Unverified	0
Beyond the Illusion of Consensus: From Surface Heuristics to Knowledge-Grounded Evaluation in LLM-as-a-Judge	Mar 11, 2026	—Unverified	0
FRIEND: Federated Learning for Joint Optimization of multi-RIS Configuration and Eavesdropper Intelligent Detection in B5G Networks	Mar 11, 2026	—Unverified	0
Operationalizing Perceptions of Agent Gender: Foundations and Guidelines	Mar 10, 2026	—Unverified	0
LITTA: Late-Interaction and Test-Time Alignment for Visually-Grounded Multimodal Retrieval	Mar 10, 2026	—Unverified	0
Decoding the decoder: Contextual sequence-to-sequence modeling for intracortical speech decoding	Mar 10, 2026	—Unverified	0
Stability of AI Governance Systems: A Coupled Dynamics Model of Public Trust and Social Disruptions	Mar 10, 2026	—Unverified	0
Developing Machine Learning-Based Watch-to-Warning Severe Weather Guidance from the Warn-on-Forecast System	Mar 10, 2026	—Unverified	0
A Visualization for Comparative Analysis of Regression Models	Mar 10, 2026	—Unverified	0
Automatic Analysis of Collaboration Through Human Conversational Data Resources: A Review	Mar 10, 2026	—Unverified	0
Maximizing mutual information between user-contexts and responses improve LLM personalization with no additional data	Mar 10, 2026	—Unverified	0
LLM-MRD: LLM-Guided Multi-View Reasoning Distillation for Fake News Detection	Mar 10, 2026	CodeCode Available	0
Semantic Chameleon: Corpus-Dependent Poisoning Attacks and Defenses in RAG Systems	Mar 10, 2026	—Unverified	0
Quantizer-Aware Hierarchical Neural Codec Modeling for Speech Deepfake Detection	Mar 10, 2026	—Unverified	0
Privacy and Safety Experiences and Concerns of U.S. Women Using Generative AI for Seeking Sexual and Reproductive Health Information	Mar 10, 2026	—Unverified	0
HoloByte: Continuous Hyperspherical Distillation for Tokenizer-Free Modeling	Mar 10, 2026	CodeCode Available	0
OrthoAI v2: From Single-Agent Segmentation to Dual-Agent Treatment Planning for Clear Aligners	Mar 10, 2026	—Unverified	0
Quantum Amplitude Estimation for Catastrophe Insurance Tail-Risk Pricing: Empirical Convergence and NISQ Noise Analysis	Mar 10, 2026	—Unverified	0
OpenClaw-RL: Train Any Agent Simply by Talking	Mar 10, 2026	CodeCode Available	0
Enhancing Reconstruction Capability of Wavelet Transform Amorphous Radial Distribution Function via Machine Learning Assisted Parameter Tuning	Mar 10, 2026	—Unverified	0
Geometry-Aware Semantic Reasoning for Training Free Video Anomaly Detection	Mar 10, 2026	—Unverified	0
InfiniteDance: Scalable 3D Dance Generation Towards in-the-wild Generalization	Mar 10, 2026	—Unverified	0
A Computer-aided Framework for Detecting Osteosarcoma in Computed Tomography Scans	Mar 10, 2026	—Unverified	0
Deep Learning for BioImaging: What Are We Learning?	Mar 10, 2026	—Unverified	0
Do Large Language Models Get Caught in Hofstadter-Mobius Loops?	Mar 10, 2026	—Unverified	0
A Hierarchical End-of-Turn Model with Primary Speaker Segmentation for Real-Time Conversational AI	Mar 10, 2026	—Unverified	0
FusionNet: a frame interpolation network for 4D heart models	Mar 10, 2026	CodeCode Available	0
Detecting Miscitation on the Scholarly Web through LLM-Augmented Text-Rich Graph Learning	Mar 10, 2026	—Unverified	0
GPU-Accelerated Genetic Programming for Symbolic Regression with Beagle Framework	Mar 10, 2026	—Unverified	0
A Causal Graph Approach to Oppositional Narrative Analysis	Mar 10, 2026	—Unverified	0
Learning Bayesian and Markov Networks with an Unreliable Oracle	Mar 10, 2026	—Unverified	0