The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 12501–12550 of 474278 papers

Title	Date	Tasks	Status	Hype
Is Micro-expression Ethnic Leaning?	Jul 14, 2025		CodeCode Available	0
Large Population Models	Jul 14, 2025		CodeCode Available	0
Demonstrating the Octopi-1.5 Visual-Tactile-Language Model	Jul 14, 2025		CodeCode Available	0
Boosting Multimodal Learning via Disentangled Gradient Learning	Jul 14, 2025		CodeCode Available	0
FTCFormer: Fuzzy Token Clustering Transformer for Image Classification	Jul 14, 2025		CodeCode Available	0
DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs	Jul 14, 2025		CodeCode Available	0
CWNet: Causal Wavelet Network for Low-Light Image Enhancement	Jul 14, 2025		CodeCode Available	0
Structure-Guided Diffusion Models for High-Fidelity Portrait Shadow Removal	Jul 14, 2025		CodeCode Available	0
Minimizing the Pretraining Gap: Domain-aligned Text-Based Person Retrieval	Jul 14, 2025		CodeCode Available	0
A Training-Free, Task-Agnostic Framework for Enhancing MLLM Performance on High-Resolution Images	Jul 14, 2025		CodeCode Available	0
ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users	Jul 14, 2025		CodeCode Available	0
RefSTAR: Blind Facial Image Restoration with Reference Selection, Transfer, and Reconstruction	Jul 14, 2025		CodeCode Available	0
BenchReAD: A systematic benchmark for retinal anomaly detection	Jul 14, 2025		CodeCode Available	0
Training Dynamics Underlying Language Model Scaling Laws: Loss Deceleration and Zero-Sum Learning	Jul 14, 2025		CodeCode Available	0
Offline Reinforcement Learning with Wasserstein Regularization via Optimal Transport Maps	Jul 14, 2025		CodeCode Available	0
LaCache: Ladder-Shaped KV Caching for Efficient Long-Context Modeling of Large Language Models	Jul 14, 2025	Long-range modeling	CodeCode Available	0
CodeJudgeBench: Benchmarking LLM-as-a-Judge for Coding Tasks	Jul 14, 2025	BenchmarkingCode Generation	—Unverified	0
CodeAssistBench (CAB): Dataset & Benchmarking for Multi-turn Chat-Based Code Assistance	Jul 14, 2025	BenchmarkingCode Generation	—Unverified	0
Turning the Tide: Repository-based Code Reflection	Jul 14, 2025	Code GenerationDiversity	—Unverified	0
A New Dataset and Performance Benchmark for Real-time Spacecraft Segmentation in Onboard Flight Computers	Jul 14, 2025	Image SegmentationSegmentation	CodeCode Available	0
Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder	Jul 14, 2025	Self-Supervised Learning	—Unverified	0
Vision Language Action Models in Robotic Manipulation: A Systematic Review	Jul 14, 2025	Dataset GenerationNatural Language Understanding	CodeCode Available	2
Leveraging RAG-LLMs for Urban Mobility Simulation and Analysis	Jul 14, 2025	Decision MakingRAG	—Unverified	0
Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning	Jul 14, 2025	Federated LearningKnowledge Distillation	—Unverified	0
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization	Jul 14, 2025	2kImage Generation	CodeCode Available	2
Binomial Self-Compensation: Mechanism and Suppression of Motion Error in Phase-Shifting Profilometry	Jul 14, 2025	3D Reconstruction	—Unverified	0
Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks	Jul 14, 2025	image-classificationImage Classification	—Unverified	0
3DGAA: Realistic and Robust 3D Gaussian-based Adversarial Attack for Autonomous Driving	Jul 14, 2025	3DGSAdversarial Attack	—Unverified	0
Lightweight Model for Poultry Disease Detection from Fecal Images Using Multi-Color Space Feature Optimization and Machine Learning	Jul 14, 2025	Computational EfficiencyDimensionality Reduction	—Unverified	0
EmbRACE-3K: Embodied Reasoning and Action in Complex Environments	Jul 14, 2025	Scene UnderstandingSpatial Reasoning	—Unverified	0
Chat with AI: The Surprising Turn of Real-time Video Communication from Human to AI	Jul 14, 2025	Large Language ModelMultimodal Large Language Model	—Unverified	0
Privacy-Preserving Multi-Stage Fall Detection Framework with Semi-supervised Federated Learning and Robotic Vision Confirmation	Jul 14, 2025	Federated LearningIndoor Localization	—Unverified	0
Efficient Federated Learning with Heterogeneous Data and Adaptive Dropout	Jul 14, 2025	Federated Learning	—Unverified	0
MTF-Grasp: A Multi-tier Federated Learning Approach for Robotic Grasping	Jul 14, 2025	Federated LearningRobotic Grasping	—Unverified	0
Cameras as Relative Positional Encoding	Jul 14, 2025	Depth EstimationNovel View Synthesis	—Unverified	0
A Survey on MLLM-based Visually Rich Document Understanding: Methods, Challenges, and Emerging Trends	Jul 14, 2025	document understandingOptical Character Recognition	—Unverified	0
Domain Borders Are There to Be Crossed With Federated Few-Shot Adaptation	Jul 14, 2025	Domain AdaptationFederated Learning	CodeCode Available	0
Differentially Private Federated Low Rank Adaptation Beyond Fixed-Matrix	Jul 14, 2025	Privacy Preserving	CodeCode Available	0
Graph World Model	Jul 14, 2025	Graph Learningmodel	CodeCode Available	1
Glance-MCMT: A General MCMT Framework with Glance Initialization and Progressive Association	Jul 14, 2025		CodeCode Available	0
DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation	Jul 14, 2025	DecoderGPU	CodeCode Available	0
Test-Time Canonicalization by Foundation Models for Robust Perception	Jul 14, 2025		CodeCode Available	0
FPC-Net: Revisiting SuperPoint with Descriptor-Free Keypoint Detection via Feature Pyramids and Consistency-Based Implicit Matching	Jul 14, 2025	Keypoint Detection	—Unverified	0
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second	Jul 14, 2025	Novel View SynthesisPoint Tracking	—Unverified	0
ZClassifier: Temperature Tuning and Manifold Approximation via KL Divergence on Logit Space	Jul 14, 2025	Out of Distribution (OOD) Detection	CodeCode Available	0
Kodezi Chronos: A Debugging-First Language Model for Repository-Scale, Memory-Driven Code Understanding	Jul 14, 2025	Code GenerationLanguage Modeling	CodeCode Available	9
SentiDrop: A Multi Modal Machine Learning model for Predicting Dropout in Distance Learning	Jul 14, 2025	Feature ImportanceSentiment Analysis	—Unverified	0
Convergence of Agnostic Federated Averaging	Jul 14, 2025	Federated Learning	—Unverified	0
4D-Animal: Freely Reconstructing Animatable 3D Animals from Videos	Jul 14, 2025		CodeCode Available	1
Benchmarking and Evaluation of AI Models in Biology: Outcomes and Recommendations from the CZI Virtual Cells Workshop	Jul 14, 2025	Benchmarking	—Unverified	0