The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8101–8125 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Towards Interpretable Mental Health Analysis with Large Language Models	Apr 6, 2023	Causal Emotion EntailmentEmotion Recognition	CodeCode Available	2	5
Cross-Modality Safety Alignment	Jun 21, 2024	Safety Alignment	CodeCode Available	2	5
FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization	Apr 21, 2024	Anomaly DetectionPosition	CodeCode Available	2	5
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference	Apr 8, 2025	CPUGPU	CodeCode Available	2	5
Target conversation extraction: Source separation using turn-taking dynamics	Jul 15, 2024		CodeCode Available	2	5
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts	Mar 14, 2024	DenoisingMixture-of-Experts	CodeCode Available	2	5
GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language Models	Sep 6, 2023		CodeCode Available	2	5
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions	Aug 19, 2023	MMEOptical Character Recognition (OCR)	CodeCode Available	2	5
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future	Jul 18, 2023	Knowledge Distillationobject-detection	CodeCode Available	2	5
normflows: A PyTorch Package for Normalizing Flows	Jan 26, 2023	Image GenerationVariational Inference	CodeCode Available	2	5
WidthFormer: Toward Efficient Transformer-based BEV View Transformation	Jan 8, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2	5
Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV System	Jun 27, 2023		CodeCode Available	2	5
Deep Incubation: Training Large Models by Divide-and-Conquering	Dec 8, 2022	Image Segmentationobject-detection	CodeCode Available	2	5
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models	Mar 18, 2025	AnatomyAttribute	CodeCode Available	2	5
Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark	May 14, 2024		CodeCode Available	2	5
MARLIN: Masked Autoencoder for facial video Representation LearnINg	Nov 12, 2022	Action ClassificationAttribute	CodeCode Available	2	5
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization	Sep 27, 2023	Contrastive Learninggeo-localization	CodeCode Available	2	5
Large Language Models for Anomaly and Out-of-Distribution Detection: A Survey	Sep 3, 2024	Out-of-Distribution Detection	CodeCode Available	2	5
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams	Jun 10, 2025	3DGS3D Reconstruction	CodeCode Available	2	5
eVAE: Evolutionary Variational Autoencoder	Jan 1, 2023	DisentanglementImage Generation	CodeCode Available	2	5
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation	Jun 20, 2025	Scene Generation	CodeCode Available	2	5
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision	Nov 3, 2023	Optical Flow EstimationSemantic Segmentation	CodeCode Available	2	5
Omni-Video: Democratizing Unified Video Understanding and Generation	Jul 8, 2025	Video GenerationVideo Understanding	CodeCode Available	2	5
From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking	Jun 24, 2024	BenchmarkingNeRF	CodeCode Available	2	5
Unwrapping The Black Box of Deep ReLU Networks: Interpretability, Diagnostics, and Simplification	Nov 8, 2020		CodeCode Available	2	5