The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7951–8000 of 661570 papers

Title	Date	Status	Hype
UniCom: Unified Multimodal Modeling via Compressed Continuous Semantic Representations	Mar 11, 2026	—Unverified	0
WalkGPT: Grounded Vision-Language Conversation with Depth-Aware Segmentation for Pedestrian Navigation	Mar 11, 2026	—Unverified	0
Large Language Models as Annotators for Machine Translation Quality Estimation	Mar 11, 2026	—Unverified	0
eLasmobranc Dataset: An Image Dataset for Elasmobranch Species Recognition and Biodiversity Monitoring	Mar 11, 2026	—Unverified	0
CacheSolidarity: Preventing Prefix Caching Side Channels in Multi-tenant LLM Serving Systems	Mar 11, 2026	—Unverified	0
Event-based Photometric Stereo via Rotating Illumination and Per-Pixel Learning	Mar 11, 2026	—Unverified	0
Deep Randomized Distributed Function Computation (DeepRDFC): Neural Distributed Channel Simulation	Mar 11, 2026	—Unverified	0
A PUF-Based Approach for Copy Protection of Intellectual Property in Neural Network Models	Mar 11, 2026	—Unverified	0
Prioritizing Gradient Sign Over Modulus: An Importance-Aware Framework for Wireless Federated Learning	Mar 11, 2026	—Unverified	0
Phase-Interface Instance Segmentation as a Visual Sensor for Laboratory Process Monitoring	Mar 11, 2026	—Unverified	0
Interpretable Chinese Metaphor Identification via LLM-Assisted MIPVU Rule Script Generation: A Comparative Protocol Study	Mar 11, 2026	—Unverified	0
PolGS++: Physically-Guided Polarimetric Gaussian Splatting for Fast Reflective Surface Reconstruction	Mar 11, 2026	—Unverified	0
Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services	Mar 11, 2026	—Unverified	0
Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization	Mar 11, 2026	—Unverified	0
Evaluating randomized smoothing as a defense against adversarial attacks in trajectory prediction	Mar 11, 2026	—Unverified	0
ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning	Mar 11, 2026	—Unverified	0
A dataset of medication images with instance segmentation masks for preventing adverse drug events	Mar 11, 2026	—Unverified	0
BALD-SAM: Disagreement-based Active Prompting in Interactive Segmentation	Mar 11, 2026	—Unverified	0
PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words	Mar 11, 2026	—Unverified	0
Human Presence Detection via Wi-Fi Range-Filtered Doppler Spectrum on Commodity Laptops	Mar 11, 2026	—Unverified	0
Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis	Mar 11, 2026	—Unverified	0
From Images to Words: Efficient Cross-Modal Knowledge Distillation to Language Models from Black-box Teachers	Mar 11, 2026	—Unverified	0
Semantic Landmark Particle Filter for Robot Localisation in Vineyards	Mar 11, 2026	—Unverified	0
V_0.5: Generalist Value Model as a Prior for Sparse RL Rollouts	Mar 11, 2026	—Unverified	0
SiDiaC-v.2.0: Sinhala Diachronic Corpus Version 2.0	Mar 11, 2026	—Unverified	0
SNPgen: Phenotype-Supervised Genotype Representation and Synthetic Data Generation via Latent Diffusion	Mar 11, 2026	—Unverified	0
Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models	Mar 11, 2026	—Unverified	0
A Hybrid Knowledge-Grounded Framework for Safety and Traceability in Prescription Verification	Mar 11, 2026	—Unverified	0
When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS	Mar 11, 2026	—Unverified	0
ECoLAD: Deployment-Oriented Evaluation for Automotive Time-Series Anomaly Detection	Mar 11, 2026	—Unverified	0
Bridging the Skill Gap in Clinical CBCT Interpretation with CBCTRepD	Mar 11, 2026	—Unverified	0
LLM2Vec-Gen: Generative Embeddings from Large Language Models	Mar 11, 2026	—Unverified	2
Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control	Mar 11, 2026	—Unverified	0
Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density Estimators	Mar 11, 2026	CodeCode Available	0
When should we trust the annotation? Selective prediction for molecular structure retrieval from mass spectra	Mar 11, 2026	—Unverified	0
Bio-Inspired Self-Supervised Learning for Wrist-worn IMU Signals	Mar 11, 2026	—Unverified	0
Pointy - A Lightweight Transformer for Point Cloud Foundation Models	Mar 11, 2026	CodeCode Available	0
Contact Coverage-Guided Exploration for General-Purpose Dexterous Manipulation	Mar 11, 2026	—Unverified	0
Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style	Mar 11, 2026	—Unverified	0
GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations	Mar 11, 2026	—Unverified	0
ForwardFlow: Simulation only statistical inference using deep learning	Mar 11, 2026	—Unverified	0
The Discrete Charm of the MLP: Binary Routing of Continuous Signals in Transformer Feed-Forward Layers	Mar 11, 2026	—Unverified	0
Understanding Parents' Desires in Moderating Children's Interactions with GenAI Chatbots through LLM-Generated Probes	Mar 11, 2026	—Unverified	0
MCMC Informed Neural Emulators for Uncertainty Quantification in Dynamical Systems	Mar 11, 2026	—Unverified	0
Too Vivid to Be Real? Benchmarking and Calibrating Generative Color Fidelity	Mar 11, 2026	CodeCode Available	0
Artificial Intelligence as a Catalyst for Innovation in Software Engineering	Mar 11, 2026	—Unverified	0
Leech Lattice Vector Quantization for Efficient LLM Compression	Mar 11, 2026	—Unverified	0
Factorized Neural Implicit DMD for Parametric Dynamics	Mar 11, 2026	—Unverified	0
Cross-Species Transfer Learning for Electrophysiology-to-Transcriptomics Mapping in Cortical GABAergic Interneurons	Mar 11, 2026	—Unverified	0
RCTs & Human Uplift Studies: Methodological Challenges and Practical Solutions for Frontier AI Evaluation	Mar 11, 2026	—Unverified	0