The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4801–4825 of 661570 papers

Title	Date	Status
From Natural Language to Executable Option Strategies via Large Language Models	Mar 17, 2026	—Unverified
Tabular LLMs for Interpretable Few-Shot Alzheimer's Disease Prediction with Multimodal Biomedical Data	Mar 17, 2026	CodeCode Available
Ethical Fairness without Demographics in Human-Centered AI	Mar 17, 2026	—Unverified
The Cost of Reasoning: Chain-of-Thought Induces Overconfidence in Vision-Language Models	Mar 17, 2026	—Unverified
Incongruent Positivity: When Miscalibrated Positivity Undermines Online Supportive Conversations	Mar 17, 2026	—Unverified
Do Understanding and Generation Fight? A Diagnostic Study of DPO for Unified Multimodal Models	Mar 17, 2026	—Unverified
SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding	Mar 17, 2026	—Unverified
LUMINA: A Multi-Vendor Mammography Benchmark with Energy Harmonization Protocol	Mar 17, 2026	—Unverified
Anonymous-by-Construction: An LLM-Driven Framework for Privacy-Preserving Text	Mar 17, 2026	—Unverified
When the City Teaches the Car: Label-Free 3D Perception from Infrastructure	Mar 17, 2026	—Unverified
Automated identification of Ichneumonoidea wasps via YOLO-based deep learning: Integrating HiresCam for Explainable AI	Mar 17, 2026	—Unverified
Transformer-Encoder Trees for Efficient Multilingual Machine Translation and Speech Translation	Mar 17, 2026	—Unverified
A Scalable Approach to Solving Simulation-Based Network Security Games	Mar 17, 2026	—Unverified
Semantic One-Dimensional Tokenizer for Image Reconstruction and Generation	Mar 17, 2026	—Unverified
Over-the-air White-box Attack on the Wav2Vec Speech Recognition Neural Network	Mar 17, 2026	—Unverified
Edge-Efficient Two-Stream Multimodal Architecture for Non-Intrusive Bathroom Fall Detection	Mar 17, 2026	—Unverified
CircuitBuilder: From Polynomials to Circuits via Reinforcement Learning	Mar 17, 2026	—Unverified
ProgressiveAvatars: Progressive Animatable 3D Gaussian Avatars	Mar 17, 2026	—Unverified
Data-driven generalized perimeter control: Zürich case study	Mar 17, 2026	—Unverified
Can LLMs Detect Their Confabulations? Estimating Reliability in Uncertainty-Aware Language Models	Mar 17, 2026	—Unverified
Transformers can do Bayesian Clustering	Mar 17, 2026	—Unverified
Knowing What You Cannot Explain: Learning to Reject Low-Quality Explanations	Mar 17, 2026	—Unverified
EdiVal-Agent: An Object-Centric Framework for Automated, Fine-Grained Evaluation of Multi-Turn Editing	Mar 17, 2026	—Unverified
Accurate Shift Invariant Convolutional Neural Networks Using Gaussian-Hermite Moments	Mar 17, 2026	—Unverified
Patient4D: Temporally Consistent Patient Body Mesh Recovery from Monocular Operating Room Video	Mar 17, 2026	—Unverified