The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7926–7950 of 474278 papers

Title	Date	Status
The End of Manual Decoding: Towards Truly End-to-End Language Models	Oct 31, 2025	—Unverified
DP-FedPGN: Finding Global Flat Minima for Differentially Private Federated Learning via Penalizing Gradient Norm	Oct 31, 2025	CodeCode Available
Eliciting Secret Knowledge from Language Models	Oct 31, 2025	—Unverified
IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction	Oct 31, 2025	—Unverified
E2Rank: Your Text Embedding can Also be an Effective and Efficient Listwise Reranker	Oct 31, 2025	—Unverified
Object-IR: Leveraging Object Consistency and Mesh Deformation for Self-Supervised Image Retargeting	Oct 31, 2025	CodeCode Available
HyperClick: Advancing Reliable GUI Grounding via Uncertainty Calibration	Oct 31, 2025	—Unverified
Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals	Oct 31, 2025	—Unverified
Cycle Consistency as Reward: Learning Image-Text Alignment without Human Preferences	Oct 31, 2025	—Unverified
ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use	Oct 31, 2025	—Unverified
Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications	Oct 31, 2025	CodeCode Available
Multilingual Political Views of Large Language Models: Identification and Steering	Oct 31, 2025	CodeCode Available
Adaptive Stochastic Coefficients for Accelerating Diffusion Sampling	Oct 31, 2025	CodeCode Available
Ready to Translate, Not to Represent? Bias and Performance Gaps in Multilingual LLMs Across Language Families and Domains	Oct 31, 2025	CodeCode Available
Dynamic Gaussian Splatting from Defocused and Motion-blurred Monocular Videos	Oct 31, 2025	CodeCode Available
DiagramEval: Evaluating LLM-Generated Diagrams via Graphs	Oct 31, 2025	CodeCode Available
Aeolus: A Multi-structural Flight Delay Dataset	Oct 31, 2025	CodeCode Available
MLPerf Automotive	Oct 31, 2025	CodeCode Available
ZEBRA: Towards Zero-Shot Cross-Subject Generalization for Universal Brain Visual Decoding	Oct 31, 2025	CodeCode Available
E-MMDiT: Revisiting Multimodal Diffusion Transformer Design for Fast Image Synthesis under Limited Resources	Oct 31, 2025	CodeCode Available
MemeArena: Automating Context-Aware Unbiased Evaluation of Harmfulness Understanding for Multimodal Large Language Models	Oct 31, 2025	CodeCode Available
Fints: Efficient Inference-Time Personalization for LLMs with Fine-Grained Instance-Tailored Steering	Oct 31, 2025	CodeCode Available
DRAMA: Unifying Data Retrieval and Analysis for Open-Domain Analytic Queries	Oct 31, 2025	CodeCode Available
Trans-defense: Transformer-based Denoiser for Adversarial Defense with Spatial-Frequency Domain Representation	Oct 31, 2025	CodeCode Available
A Technical Exploration of Causal Inference with Hybrid LLM Synthetic Data	Oct 31, 2025	CodeCode Available