The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10051–10100 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Isaac Gym: High Performance GPU-Based Physics Simulation For Robot Learning	Aug 24, 2021	CPUGPU	CodeCode Available	2	5
LibriSpeech-PC: Benchmark for Evaluation of Punctuation and Capitalization Capabilities of end-to-end ASR Models	Oct 4, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2	5
SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning	Feb 7, 2025		CodeCode Available	2	5
Harder Tasks Need More Experts: Dynamic Routing in MoE Models	Mar 12, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	2	5
SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation	Sep 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language	Jun 28, 2023	DescriptiveLanguage Modeling	CodeCode Available	2	5
PDF-WuKong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling	Oct 8, 2024	document understandingLanguage Modeling	CodeCode Available	2	5
Adaptive Keyframe Sampling for Long Video Understanding	Jan 1, 2025	Video Understanding	CodeCode Available	2	5
A Survey of Deep Learning for Mathematical Reasoning	Dec 20, 2022	Deep LearningMath	CodeCode Available	2	5
SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction	Oct 31, 2023	PredictionSemantic Similarity	CodeCode Available	2	5
Foundation Models for Spatio-Temporal Data Science: A Tutorial and Survey	Mar 12, 2025	Management	CodeCode Available	2	5
ByteTransformer: A High-Performance Transformer Boosted for Variable-Length Inputs	Oct 6, 2022	GPUVocal Bursts Intensity Prediction	CodeCode Available	2	5
DDPM-CD: Denoising Diffusion Probabilistic Models as Feature Extractors for Change Detection	Jun 23, 2022	Change DetectionDecision Making	CodeCode Available	2	5
Data Science with LLMs and Interpretable Models	Feb 22, 2024	Additive modelsQuestion Answering	CodeCode Available	2	5
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D Scenes	Aug 17, 2023	Language ModelingLanguage Modelling	CodeCode Available	2	5
Speech Foundation Model Ensembles for the Controlled Singing Voice Deepfake Detection (CtrSVDD) Challenge 2024	Sep 3, 2024	DeepFake DetectionFace Swapping	CodeCode Available	2	5
Preventing Local Pitfalls in Vector Quantization via Optimal Transport	Dec 19, 2024	Image ReconstructionQuantization	CodeCode Available	2	5
PersFormer: 3D Lane Detection via Perspective Transformer and the OpenLane Benchmark	Mar 21, 2022	3D Lane DetectionAutonomous Driving	CodeCode Available	2	5
A Survey of Financial AI: Architectures, Advances and Open Challenges	Nov 1, 2024	Decision MakingPortfolio Optimization	CodeCode Available	2	5
IMKGA-SM: Interpretable Multimodal Knowledge Graph Answer Prediction via Sequence Modeling	Jan 6, 2023	Link PredictionOptical Character Recognition	CodeCode Available	2	5
Habitat: A Platform for Embodied AI Research	Apr 2, 2019	BenchmarkingGPU	CodeCode Available	2	5
Masked Siamese Networks for Label-Efficient Learning	Apr 14, 2022	image-classificationImage Classification	CodeCode Available	2	5
Scaling Language Models: Methods, Analysis & Insights from Training Gopher	Dec 8, 2021	Abstract AlgebraAnachronisms	CodeCode Available	2	5
Mamba-ST: State Space Model for Efficient Style Transfer	Sep 16, 2024	MambaStyle Transfer	CodeCode Available	2	5
recommenderlab: An R Framework for Developing and Testing Recommendation Algorithms	May 24, 2022	Recommendation Systems	CodeCode Available	2	5
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities	Jun 17, 2024	Audio Question AnsweringInstruction Following	CodeCode Available	2	5
LayerTracer: Cognitive-Aligned Layered SVG Synthesis via Diffusion Transformer	Feb 3, 2025		CodeCode Available	2	5
RET-CLIP: A Retinal Image Foundation Model Pre-trained with Clinical Diagnostic Reports	May 23, 2024	DiagnosticMulti-Label Classification	CodeCode Available	2	5
MaGGIe: Masked Guided Gradual Human Instance Matting	Apr 24, 2024	Image MattingVideo Matting	CodeCode Available	2	5
GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation	Feb 24, 2024		CodeCode Available	2	5
Phi-4 Technical Report	Dec 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	2	5
3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation	Sep 29, 2022	Image SegmentationMedical Image Segmentation	CodeCode Available	2	5
PubTables-1M: Towards comprehensive table extraction from unstructured documents	Sep 30, 2021	Articlesobject-detection	CodeCode Available	2	5
CoqPilot, a plugin for LLM-based generation of proofs	Oct 25, 2024	Benchmarking	CodeCode Available	2	5
Formalizing and Benchmarking Prompt Injection Attacks and Defenses	Oct 19, 2023	Benchmarking	CodeCode Available	2	5
AI-Newton: A Concept-Driven Physical Law Discovery System without Prior Physical Knowledge	Apr 2, 2025	scientific discovery	CodeCode Available	2	5
Wind Noise Reduction with a Diffusion-based Stochastic Regeneration Model	Jun 22, 2023		CodeCode Available	2	5
DeeperHistReg: Robust Whole Slide Images Registration Framework	Apr 19, 2024	whole slide images	CodeCode Available	2	5
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer	Apr 19, 2022	2D Human Pose Estimation3D Human Pose Estimation	CodeCode Available	2	5
Common Diffusion Noise Schedules and Sample Steps are Flawed	May 15, 2023		CodeCode Available	2	5
Multi-Target XGBoostLSS Regression	Oct 13, 2022	regression	CodeCode Available	2	5
Recent advances in the Self-Referencing Embedding Strings (SELFIES) library	Feb 7, 2023		CodeCode Available	2	5
RETVec: Resilient and Efficient Text Vectorizer	Feb 18, 2023	Adversarial TextMetric Learning	CodeCode Available	2	5
Document Expansion by Query Prediction	Apr 17, 2019	Passage Re-RankingPrediction	CodeCode Available	2	5
Benchmarking Synthetic Tabular Data: A Multi-Dimensional Evaluation Framework	Apr 2, 2025	BenchmarkingSynthetic Data Generation	CodeCode Available	2	5
EdgeGaussians -- 3D Edge Mapping via Gaussian Splatting	Sep 19, 2024		CodeCode Available	2	5
OR-LLM-Agent: Automating Modeling and Solving of Operations Research Optimization Problem with Reasoning Large Language Model	Mar 13, 2025	AI AgentLanguage Modeling	CodeCode Available	2	5
RobustNeRF: Ignoring Distractors with Robust Losses	Feb 2, 2023	NeRF	CodeCode Available	2	5
Building Normalizing Flows with Stochastic Interpolants	Sep 30, 2022	BenchmarkingDensity Estimation	CodeCode Available	2	5
Efficient World Models with Context-Aware Tokenization	Jun 27, 2024	Deep Reinforcement LearningReinforcement Learning (RL)	CodeCode Available	2	5