The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 16051–16100 of 474278 papers

Title	Date	Tasks	Status	Hype
Must Read: A Systematic Survey of Computational Persuasion	May 12, 2025	FairnessMarketing	CodeCode Available	1
Ophora: A Large-Scale Data-Driven Text-Guided Ophthalmic Surgical Video Generation Model	May 12, 2025	Video Generation	CodeCode Available	1
ISAC: An Invertible and Stable Auditory Filter Bank with Customizable Kernels for ML Integration	May 12, 2025	ISAC	CodeCode Available	1
Asynchronous Multi-Object Tracking with an Event Camera	May 12, 2025	Multi-Object TrackingObject	CodeCode Available	1
Overflow Prevention Enhances Long-Context Recurrent LLMs	May 12, 2025	Mamba	CodeCode Available	1
Finite-Sample-Based Reachability for Safe Control with Gaussian Process Dynamics	May 12, 2025	Model Predictive Control	CodeCode Available	1
Guiding Data Collection via Factored Scaling Curves	May 12, 2025	Imitation Learning	CodeCode Available	1
Towards Actionable Pedagogical Feedback: A Multi-Perspective Analysis of Mathematics Teaching and Tutoring Dialogue	May 12, 2025	TAG	CodeCode Available	1
Measuring General Intelligence with Generated Games	May 12, 2025	In-Context LearningLarge Language Model	CodeCode Available	1
Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning	May 12, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
A Multi-Dimensional Constraint Framework for Evaluating and Improving Instruction Following in Large Language Models	May 12, 2025	Instruction Following	CodeCode Available	1
FLUXSynID: A Framework for Identity-Controlled Synthetic Face Generation with Document and Live Images	May 12, 2025	DiversityFace Generation	CodeCode Available	1
Symbolic Regression with Multimodal Large Language Models and Kolmogorov Arnold Networks	May 12, 2025	Kolmogorov-Arnold NetworksLanguage Modeling	CodeCode Available	1
Chronocept: Instilling a Sense of Time in Machines	May 12, 2025	Fact CheckingRAG	CodeCode Available	1
Codifying Character Logic in Role-Playing	May 12, 2025		CodeCode Available	1
Neural Brain: A Neuroscience-inspired Framework for Embodied Agents	May 12, 2025	Navigate	CodeCode Available	1
DocVXQA: Context-Aware Visual Explanations for Document Question Answering	May 12, 2025	Question Answering	CodeCode Available	1
Can LLM-based Financial Investing Strategies Outperform the Market in Long Run?	May 11, 2025		CodeCode Available	1
Non-Stationary Time Series Forecasting Based on Fourier Analysis and Cross Attention Mechanism	May 11, 2025	Financial AnalysisTime Series	CodeCode Available	1
Unsupervised Learning for Class Distribution Mismatch	May 11, 2025		CodeCode Available	1
Semantic-Guided Diffusion Model for Single-Step Image Super-Resolution	May 11, 2025	Image Super-ResolutionSemantic Segmentation	CodeCode Available	1
MELLM: Exploring LLM-Powered Micro-Expression Understanding Enhanced by Subtle Motion Perception	May 11, 2025	Emotion ClassificationLarge Language Model	CodeCode Available	1
Learning Soft Sparse Shapes for Efficient Time-Series Classification	May 11, 2025	ClassificationTime Series	CodeCode Available	1
BioProBench: Comprehensive Dataset and Benchmark in Biological Protocol Understanding and Reasoning	May 11, 2025	Question Answering	CodeCode Available	1
Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models	May 11, 2025	DescriptiveDiagnostic	CodeCode Available	1
Multimodal Fake News Detection: MFND Dataset and Shallow-Deep Multitask Learning	May 11, 2025	Contrastive LearningFace Swapping	CodeCode Available	1
JaxRobotarium: Training and Deploying Multi-Robot Policies in 10 Minutes	May 10, 2025	BenchmarkingGPU	CodeCode Available	1
Quadrupedal Robot Skateboard Mounting via Reverse Curriculum Learning	May 10, 2025		CodeCode Available	1
Reproducing and Improving CheXNet: Deep Learning for Chest X-ray Disease Classification	May 10, 2025	Multi-Label Classification	CodeCode Available	1
FNBench: Benchmarking Robust Federated Learning against Noisy Labels	May 10, 2025	BenchmarkingFederated Learning	CodeCode Available	1
TS-SUPERB: A Target Speech Processing Benchmark for Speech Self-Supervised Learning Models	May 10, 2025	Self-Supervised Learning	CodeCode Available	1
Edge-Enabled VIO with Long-Tracked Features for High-Accuracy Low-Altitude IoT Navigation	May 10, 2025	Depth EstimationDepth Prediction	CodeCode Available	1
M3CAD: Towards Generic Cooperative Autonomous Driving Benchmark	May 10, 2025	Autonomous DrivingMotion Forecasting	CodeCode Available	1
Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation	May 10, 2025		CodeCode Available	1
MacRAG: Compress, Slice, and Scale-up for Multi-Scale Adaptive Context RAG	May 10, 2025	RAGRetrieval	CodeCode Available	1
SmartPilot: A Multiagent CoPilot for Adaptive and Intelligent Manufacturing	May 10, 2025	Decision MakingProduction Forecasting	CodeCode Available	1
Emotion-Qwen: Training Hybrid Experts for Unified Emotion and General Vision-Language Understanding	May 10, 2025	DescriptiveEmotion Recognition	CodeCode Available	1
MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design	May 9, 2025	Mixture-of-ExpertsQuantization	CodeCode Available	1
FastDup: a scalable duplicate marking tool using speculation-and-test mechanism	May 9, 2025		CodeCode Available	1
RefRef: A Synthetic Dataset and Benchmark for Reconstructing Refractive and Reflective Objects	May 9, 2025	3D ReconstructionNeural Rendering	CodeCode Available	1
PYRREGULAR: A Unified Framework for Irregular Time Series, with Classification Benchmarks	May 9, 2025	Irregular Time SeriesMissing Values	CodeCode Available	1
A Survey on Bridging VLMs and Synthetic Data	May 9, 2025	Survey	CodeCode Available	1
Fast Differentiable Modal Simulation of Non-linear Strings, Membranes, and Plates	May 9, 2025	Audio SynthesisCPU	CodeCode Available	1
Accelerating Diffusion Transformer via Increment-Calibrated Caching with Channel-Aware Singular Value Decomposition	May 9, 2025	Image Generation	CodeCode Available	1
Physics-informed Temporal Difference Metric Learning for Robot Motion Planning	May 9, 2025	Metric LearningMotion Planning	CodeCode Available	1
Cost-Effective, Low Latency Vector Search with Azure Cosmos DB	May 9, 2025		CodeCode Available	1
DiGIT: Multi-Dilated Gated Encoder and Central-Adjacent Region Integrated Decoder for Temporal Action Detection Transformer	May 9, 2025	Action DetectionDecoder	CodeCode Available	1
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks	May 9, 2025	DiagnosticInstruction Following	CodeCode Available	1
LAPSO: A Unified Optimization View for Learning-Augmented Power System Operations	May 8, 2025		CodeCode Available	1
Building-Guided Pseudo-Label Learning for Cross-Modal Building Damage Mapping	May 8, 2025	Building Damage AssessmentChange Detection	CodeCode Available	1