The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7501–7550 of 661570 papers

Title	Date	Status
DeepHistoViT: An Interpretable Vision Transformer Framework for Histopathological Cancer Classification	Mar 12, 2026	—Unverified
GPT4o-Receipt: A Dataset and Human Study for AI-Generated Document Forensics	Mar 12, 2026	—Unverified
MDS-VQA: Model-Informed Data Selection for Video Quality Assessment	Mar 12, 2026	—Unverified
CFD-HAR: User-controllable Privacy through Conditional Feature Disentanglement	Mar 12, 2026	—Unverified
MANSION: Multi-floor lANguage-to-3D Scene generatIOn for loNg-horizon tasks	Mar 12, 2026	—Unverified
Streaming Translation and Transcription Through Speech-to-Text Causal Alignment	Mar 12, 2026	—Unverified
EnTransformer: A Deep Generative Transformer for Multivariate Probabilistic Forecasting	Mar 12, 2026	—Unverified
InSpatio-WorldFM: An Open-Source Real-Time Generative Frame Model	Mar 12, 2026	—Unverified
ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models	Mar 12, 2026	—Unverified
RDNet: Region Proportion-Aware Dynamic Adaptive Salient Object Detection Network in Optical Remote Sensing Images	Mar 12, 2026	—Unverified
Triple X: A LLM-Based Multilingual Speech Recognition System for the INTERSPEECH2025 MLC-SLM Challenge	Mar 12, 2026	—Unverified
What do near-optimal learning rate schedules look like?	Mar 12, 2026	—Unverified
Global Evolutionary Steering: Refining Activation Steering Control via Cross-Layer Consistency	Mar 12, 2026	—Unverified
A Geometrically-Grounded Drive for MDL-Based Optimization in Deep Learning	Mar 12, 2026	—Unverified
VQQA: An Agentic Approach for Video Evaluation and Quality Improvement	Mar 12, 2026	—Unverified
Pruning-induced phases in fully-connected neural networks: the eumentia, the dementia, and the amentia	Mar 12, 2026	—Unverified
Maximum Entropy Exploration Without the Rollouts	Mar 12, 2026	—Unverified
Bridging the Gap Between Security Metrics and Key Risk Indicators: An Empirical Framework for Vulnerability Prioritization	Mar 12, 2026	—Unverified
Operationalising Cyber Risk Management Using AI: Connecting Cyber Incidents to MITRE ATT&CK Techniques, Security Controls, and Metrics	Mar 12, 2026	—Unverified
Learning Pore-scale Multiphase Flow from 4D Velocimetry	Mar 12, 2026	—Unverified
Delayed Backdoor Attacks: Exploring the Temporal Dimension as a New Attack Surface in Pre-Trained Models	Mar 12, 2026	—Unverified
FastLSQ: Solving PDEs in One Shot via Fourier Features with Exact Analytical Derivatives	Mar 12, 2026	CodeCode Available
Evaluation and LLM-Guided Learning of ICD Coding Rationales	Mar 12, 2026	—Unverified
LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation	Mar 12, 2026	—Unverified
Deep Incentive Design with Differentiable Equilibrium Blocks	Mar 12, 2026	—Unverified
Gen-Fab: A Variation-Aware Generative Model for Predicting Fabrication Variations in Nanophotonic Devices	Mar 12, 2026	—Unverified
FlexRec: Adapting LLM-based Recommenders for Flexible Needs via Reinforcement Learning	Mar 12, 2026	—Unverified
Compiling Temporal Numeric Planning into Discrete PDDL+: Extended Version	Mar 12, 2026	—Unverified
Llettuce: An Open Source Natural Language Processing Tool for the Translation of Medical Terms into Uniform Clinical Encoding	Mar 12, 2026	—Unverified
Head-wise Adaptive Rotary Positional Encoding for Fine-Grained Image Generation	Mar 12, 2026	—Unverified
Thermodynamics of Reinforcement Learning Curricula	Mar 12, 2026	—Unverified
Temporal Straightening for Latent Planning	Mar 12, 2026	—Unverified
MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning	Mar 12, 2026	—Unverified
Generalizing Vision-Language Models with Dedicated Prompt Guidance	Mar 12, 2026	—Unverified
Entropy Guided Diversification and Preference Elicitation in Agentic Recommendation Systems	Mar 12, 2026	—Unverified
EvoFlows: Evolutionary Edit-Based Flow-Matching for Protein Engineering	Mar 12, 2026	—Unverified
Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents	Mar 12, 2026	—Unverified
Cornserve: A Distributed Serving System for Any-to-Any Multimodal Models	Mar 12, 2026	—Unverified
A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition	Mar 12, 2026	—Unverified
Causal Representation Learning with Optimal Compression under Complex Treatments	Mar 12, 2026	—Unverified
Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach	Mar 12, 2026	—Unverified
CoMMET: To What Extent Can LLMs Perform Theory of Mind Tasks?	Mar 12, 2026	—Unverified
Real-World Point Tracking with Verifier-Guided Pseudo-Labeling	Mar 12, 2026	—Unverified
PreLoRA: Hybrid Pre-training of Vision Transformers with Full Training and Low-Rank Adapters	Mar 12, 2026	—Unverified
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously	Mar 12, 2026	CodeCode Available
Evaluate-as-Action: Self-Evaluated Process Rewards for Retrieval-Augmented Agents	Mar 12, 2026	—Unverified
Hidden State Poisoning Attacks against Mamba-based Language Models	Mar 12, 2026	—Unverified
DRIFT: Dual-Representation Inter-Fusion Transformer for Automated Driving Perception with 4D Radar Point Clouds	Mar 12, 2026	—Unverified
SpectralGuard: Detecting Memory Collapse Attacks in State Space Models	Mar 12, 2026	—Unverified
LLM-Augmented Therapy Normalization and Aspect-Based Sentiment Analysis for Treatment-Resistant Depression on Reddit	Mar 12, 2026	—Unverified