The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6001–6050 of 661570 papers

Title	Date	Tasks	Status	Hype
EmoFace: Audio-driven Emotional 3D Face Animation	Jul 17, 2024	3D Face Animation	CodeCode Available	2
OmniBench: Towards The Future of Universal Omni-Language Models	Sep 23, 2024	Instruction Following	CodeCode Available	2
ADATIME: A Benchmarking Suite for Domain Adaptation on Time Series Data	Mar 15, 2022	BenchmarkingDomain Adaptation	CodeCode Available	2
ConceptExpress: Harnessing Diffusion Models for Single-image Unsupervised Concept Extraction	Jul 9, 2024	Image GenerationText to Image Generation	CodeCode Available	2
InteractRank: Personalized Web-Scale Search Pre-Ranking with Cross Interaction Features	Apr 9, 2025	Computational Efficiency	CodeCode Available	2
Specializing Smaller Language Models towards Multi-Step Reasoning	Jan 30, 2023	MathModel Selection	CodeCode Available	2
Stitchable Neural Networks	Feb 13, 2023	Image Classification	CodeCode Available	2
Respecting causality is all you need for training physics-informed neural networks	Mar 14, 2022	AllAttribute	CodeCode Available	2
Towards Interpretable Mental Health Analysis with Large Language Models	Apr 6, 2023	Causal Emotion EntailmentEmotion Recognition	CodeCode Available	2
Cross-Modality Safety Alignment	Jun 21, 2024	Safety Alignment	CodeCode Available	2
FiLo: Zero-Shot Anomaly Detection by Fine-Grained Description and High-Quality Localization	Apr 21, 2024	Anomaly DetectionPosition	CodeCode Available	2
HybriMoE: Hybrid CPU-GPU Scheduling and Cache Management for Efficient MoE Inference	Apr 8, 2025	CPUGPU	CodeCode Available	2
Target conversation extraction: Source separation using turn-taking dynamics	Jul 15, 2024		CodeCode Available	2
Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts	Mar 14, 2024	DenoisingMixture-of-Experts	CodeCode Available	2
GPT-InvestAR: Enhancing Stock Investment Strategies through Annual Report Analysis with Large Language Models	Sep 6, 2023		CodeCode Available	2
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions	Aug 19, 2023	MMEOptical Character Recognition (OCR)	CodeCode Available	2
H3T: Efficient Integration of Memory Optimization and Parallelism for Large-scale Transformer Training	Sep 21, 2023		CodeCode Available	2
Beyond Next Token Prediction: Patch-Level Training for Large Language Models	Jul 17, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
A Survey on Open-Vocabulary Detection and Segmentation: Past, Present, and Future	Jul 18, 2023	Knowledge Distillationobject-detection	CodeCode Available	2
normflows: A PyTorch Package for Normalizing Flows	Jan 26, 2023	Image GenerationVariational Inference	CodeCode Available	2
WidthFormer: Toward Efficient Transformer-based BEV View Transformation	Jan 8, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
Evidential Detection and Tracking Collaboration: New Problem, Benchmark and Algorithm for Robust Anti-UAV System	Jun 27, 2023		CodeCode Available	2
Deep Incubation: Training Large Models by Divide-and-Conquering	Dec 8, 2022	Image Segmentationobject-detection	CodeCode Available	2
Fortuna: A Library for Uncertainty Quantification in Deep Learning	Feb 8, 2023	Bayesian InferenceBenchmarking	CodeCode Available	2
BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation	Apr 3, 2022	DecoderDepth Estimation	CodeCode Available	2
TURNA: A Turkish Encoder-Decoder Language Model for Enhanced Understanding and Generation	Jan 25, 2024	DecoderLanguage Modeling	CodeCode Available	2
Med-R1: Reinforcement Learning for Generalizable Medical Reasoning in Vision-Language Models	Mar 18, 2025	AnatomyAttribute	CodeCode Available	2
Etalon: Holistic Performance Evaluation Framework for LLM Inference Systems	Jul 9, 2024		CodeCode Available	2
Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmark	May 14, 2024		CodeCode Available	2
A Diffusion-Based Generative Equalizer for Music Restoration	Mar 27, 2024	Bandwidth ExtensionHallucination	CodeCode Available	2
Omnizart: A General Toolbox for Automatic Music Transcription	Jun 1, 2021	Chord RecognitionDownbeat Tracking	CodeCode Available	2
MARLIN: Masked Autoencoder for facial video Representation LearnINg	Nov 12, 2022	Action ClassificationAttribute	CodeCode Available	2
Thermal half-lives of azobenzene derivatives: virtual screening based on intersystem crossing using a machine learning potential	Jul 23, 2022		CodeCode Available	2
GeoCLIP: Clip-Inspired Alignment between Locations and Images for Effective Worldwide Geo-localization	Sep 27, 2023	Contrastive Learninggeo-localization	CodeCode Available	2
Large Language Models for Anomaly and Out-of-Distribution Detection: A Survey	Sep 3, 2024	Out-of-Distribution Detection	CodeCode Available	2
Towards Scalable Automated Alignment of LLMs: A Survey	Jun 3, 2024	Survey	CodeCode Available	2
ViTime: A Visual Intelligence-Based Foundation Model for Time Series Forecasting	Jul 10, 2024	Time SeriesTime Series Analysis	CodeCode Available	2
StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams	Jun 10, 2025	3DGS3D Reconstruction	CodeCode Available	2
Understanding Performance of Long-Document Ranking Models through Comprehensive Evaluation and Leaderboarding	Jul 4, 2022	BenchmarkingDocument Ranking	CodeCode Available	2
eVAE: Evolutionary Variational Autoencoder	Jan 1, 2023	DisentanglementImage Generation	CodeCode Available	2
Let Images Give You More:Point Cloud Cross-Modal Training for Shape Analysis	Oct 9, 2022	3D Point Cloud ClassificationKnowledge Distillation	CodeCode Available	2
Long-term Traffic Simulation with Interleaved Autoregressive Motion and Scenario Generation	Jun 20, 2025	Scene Generation	CodeCode Available	2
EmerNeRF: Emergent Spatial-Temporal Scene Decomposition via Self-Supervision	Nov 3, 2023	Optical Flow EstimationSemantic Segmentation	CodeCode Available	2
L-AutoDA: Leveraging Large Language Models for Automated Decision-based Adversarial Attacks	Jan 27, 2024	Adversarial AttackComputational Efficiency	CodeCode Available	2
Omni-Video: Democratizing Unified Video Understanding and Generation	Jul 8, 2025	Video GenerationVideo Understanding	CodeCode Available	2
From Perfect to Noisy World Simulation: Customizable Embodied Multi-modal Perturbations for SLAM Robustness Benchmarking	Jun 24, 2024	BenchmarkingNeRF	CodeCode Available	2
ExpeL: LLM Agents Are Experiential Learners	Aug 20, 2023	Decision MakingTransfer Learning	CodeCode Available	2
MuMA-ToM: Multi-modal Multi-Agent Theory of Mind	Aug 22, 2024		CodeCode Available	2
Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient	Nov 26, 2024	GPUImage Generation	CodeCode Available	2
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body Simulation	Jun 24, 2021	MuJoCoOpenAI Gym	CodeCode Available	2