The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9801–9825 of 474278 papers

Title	Date	Status
Integrating Object Interaction Self-Attention and GAN-Based Debiasing for Visual Question Answering	Sep 25, 2025	CodeCode Available
LM-Searcher: Cross-domain Neural Architecture Search with LLMs via Unified Numerical Encoding	Sep 25, 2025	CodeCode Available
Understanding-in-Generation: Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation	Sep 25, 2025	CodeCode Available
SIM-CoT: Supervised Implicit Chain-of-Thought	Sep 25, 2025	CodeCode Available
Mammo-CLIP Dissect: A Framework for Analysing Mammography Concepts in Vision-Language Models	Sep 25, 2025	CodeCode Available
Mixture of Thoughts: Learning to Aggregate What Experts Think, Not Just What They Say	Sep 25, 2025	CodeCode Available
AutoOEP -- A Multi-modal Framework for Online Exam Proctoring	Sep 24, 2025	CodeCode Available
UI-S1: Advancing GUI Automation via Semi-online Reinforcement Learning	Sep 24, 2025	—Unverified
Play by the Type Rules: Inferring Constraints for LLM Functions in Declarative Programs	Sep 24, 2025	CodeCode Available
Intervening in Black Box: Concept Bottleneck Model for Enhancing Human Neural Network Mutual Understanding	Sep 24, 2025	CodeCode Available
Measuring Harmfulness of Computer-Using Agents	Sep 24, 2025	—Unverified
MAPEX: A Multi-Agent Pipeline for Keyphrase Extraction	Sep 24, 2025	CodeCode Available
Lavida-O: Elastic Large Masked Diffusion Models for Unified Multimodal Understanding and Generation	Sep 24, 2025	—Unverified
From Prompt to Progression: Taming Video Diffusion Models for Seamless Attribute Transition	Sep 24, 2025	CodeCode Available
UserRL: Training Interactive User-Centric Agent via Reinforcement Learning	Sep 24, 2025	—Unverified
CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition	Sep 24, 2025	—Unverified
FreezeVLA: Action-Freezing Attacks against Vision-Language-Action Models	Sep 24, 2025	—Unverified
SynchroRaMa : Lip-Synchronized and Emotion-Aware Talking Face Generation via Multi-Modal Emotion Embedding	Sep 24, 2025	—Unverified
Every Character Counts: From Vulnerability to Defense in Phishing Detection	Sep 24, 2025	CodeCode Available
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling	Sep 24, 2025	—Unverified
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning	Sep 24, 2025	CodeCode Available
Improving Monte Carlo Tree Search for Symbolic Regression	Sep 24, 2025	CodeCode Available
Discrete Diffusion for Reflective Vision-Language-Action Models in Autonomous Driving	Sep 24, 2025	—Unverified
Charting a Decade of Computational Linguistics in Italy: The CLiC-it Corpus	Sep 24, 2025	—Unverified
PGCLODA: Prompt-Guided Graph Contrastive Learning for Oligopeptide-Infectious Disease Association Prediction	Sep 24, 2025	CodeCode Available