The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2401–2450 of 659983 papers

Title	Date	Tasks	Status	Hype
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation	Feb 12, 2026		—Unverified	3
LLaDA2.1: Speeding Up Text Diffusion via Token Editing	Feb 13, 2026		—Unverified	3
ArenaRL: Scaling RL for Open-Ended Agents via Tournament-based Relative Ranking	Jan 22, 2026		—Unverified	3
MagicPose: Realistic Human Poses and Facial Expressions Retargeting with Identity-aware Diffusion	Nov 18, 2023	Video Generation	CodeCode Available	3
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries	Jan 27, 2024	BenchmarkingRAG	CodeCode Available	3
NerfAcc: A General NeRF Acceleration Toolbox	Oct 10, 2022	NeRF	CodeCode Available	3
Llemma: An Open Language Model For Mathematics	Oct 16, 2023	Arithmetic ReasoningAutomated Theorem Proving	CodeCode Available	3
Datasets: A Community Library for Natural Language Processing	Sep 7, 2021	Image ClassificationObject Recognition	CodeCode Available	3
Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction	Feb 15, 2023	3D Semantic Scene CompletionAutonomous Driving	CodeCode Available	3
ResNeSt: Split-Attention Networks	Apr 19, 2020	image-classificationImage Classification	CodeCode Available	3
MedSegDiff-V2: Diffusion based Medical Image Segmentation with Transformer	Jan 19, 2023	Image GenerationImage Segmentation	CodeCode Available	3
IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus	Feb 22, 2024	Zero-shot Generalization	CodeCode Available	3
StableToolBench-MirrorAPI: Modeling Tool Environments as Mirrors of 7,000+ Real-World APIs	Mar 26, 2025	Benchmarking	CodeCode Available	3
Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory	Apr 10, 2025	MathMMLU	CodeCode Available	3
Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs	Jan 11, 2024	Representation LearningSelf-Supervised Learning	CodeCode Available	3
Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling	Jan 9, 2023	2D Object DetectionContrastive Learning	CodeCode Available	3
Inferring Articulated Rigid Body Dynamics from RGBD Video	Mar 20, 2022	Contact mechanicsInverse Rendering	CodeCode Available	3
SEED-Bench-2-Plus: Benchmarking Multimodal Large Language Models with Text-Rich Visual Comprehension	Apr 25, 2024	BenchmarkingMultiple-choice	CodeCode Available	3
Boosting Continual Learning of Vision-Language Models via Mixture-of-Experts Adapters	Mar 18, 2024	Continual LearningIncremental Learning	CodeCode Available	3
Neural Network Verification with Branch-and-Bound for General Nonlinearities	May 31, 2024		CodeCode Available	3
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation	Apr 4, 2023	Cross-Modal RetrievalImage-text Retrieval	CodeCode Available	3
DrivAerNet: A Parametric Car Dataset for Data-Driven Aerodynamic Design and Prediction	Mar 12, 2024		CodeCode Available	3
Exploring Intrinsic Normal Prototypes within a Single Image for Universal Anomaly Detection	Mar 4, 2025	Anomaly DetectionMulti-class Anomaly Detection	CodeCode Available	3
Diffusion Model-Based Video Editing: A Survey	Jun 26, 2024	modelSurvey	CodeCode Available	3
Tensor Programs V: Tuning Large Neural Networks via Zero-Shot Hyperparameter Transfer	Mar 7, 2022		CodeCode Available	3
BoT-SORT: Robust Associations Multi-Pedestrian Tracking	Jun 29, 2022	Multi-Object TrackingObject	CodeCode Available	3
TopoBench: A Framework for Benchmarking Topological Deep Learning	Jun 9, 2024	BenchmarkingDeep Learning	CodeCode Available	3
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation	Sep 12, 2023	GPUImage Generation	CodeCode Available	3
Impact of architecture on robustness and interpretability of multispectral deep neural networks	Sep 21, 2023	Deep Learning	CodeCode Available	3
Are Language Models Actually Useful for Time Series Forecasting?	Jun 22, 2024	Time SeriesTime Series Forecasting	CodeCode Available	3
PDEBENCH: An Extensive Benchmark for Scientific Machine Learning	Oct 13, 2022		CodeCode Available	3
Activating More Pixels in Image Super-Resolution Transformer	May 9, 2022	Image Super-ResolutionSuper-Resolution	CodeCode Available	3
The First Competition on Resource-Limited Infrared Small Target Detection Challenge: Methods and Results	Aug 18, 2024		CodeCode Available	3
ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing system	Jan 12, 2025	Chatbot	CodeCode Available	3
The Manga Whisperer: Automatically Generating Transcriptions for Comics	Jan 18, 2024		CodeCode Available	3
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Jun 10, 2024	2k3DGS	CodeCode Available	3
Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives	Nov 30, 2024	3D Scene ReconstructionNeRF	CodeCode Available	3
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation	Jun 13, 2024	Multi-agent Reinforcement Learning	CodeCode Available	3
Deep Neural Networks for Rank-Consistent Ordinal Regression Based On Conditional Probabilities	Nov 17, 2021	regression	CodeCode Available	3
Channel Permutations for N:M Sparsity	Dec 1, 2021		CodeCode Available	3
PP-MSVSR: Multi-Stage Video Super-Resolution	Dec 6, 2021	Image Super-ResolutionSuper-Resolution	CodeCode Available	3
QOC: Quantum On-Chip Training with Parameter Shift and Gradient Pruning	Feb 26, 2022	image-classificationImage Classification	CodeCode Available	3
Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer	Mar 24, 2022	Style TransferTransfer Learning	CodeCode Available	3
Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network for Surgical Tools Segmentation	Mar 29, 2022	Contrastive LearningSegmentation	CodeCode Available	3
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia	May 23, 2023	ChatbotHallucination	CodeCode Available	3
Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond	Mar 21, 2024	Anomaly DetectionDeep Learning	CodeCode Available	3
DeepCAVE: An Interactive Analysis Tool for Automated Machine Learning	Jun 7, 2022	AutoMLBIG-bench Machine Learning	CodeCode Available	3
Plotly-Resampler: Effective Visual Analytics for Large Time Series	Jun 17, 2022	Data VisualizationTime Series	CodeCode Available	3
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making	Apr 22, 2024	Decision MakingMedical Diagnosis	CodeCode Available	3
The Common Core Ontologies	Apr 27, 2024		CodeCode Available	3