The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2151–2200 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Navigation World Models	Dec 4, 2024	Robot NavigationVideo Generation	CodeCode Available	4	5
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models	Apr 21, 2025	MMEVideo MME	CodeCode Available	4	5
Diffusion-Based Planning for Autonomous Driving with Flexible Guidance	Jan 26, 2025	Autonomous DrivingImitation Learning	CodeCode Available	4	5
Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like Speed	Mar 7, 2024	3D ReconstructionImage Retrieval	CodeCode Available	4	5
VideoChat: Chat-Centric Video Understanding	May 10, 2023	Question AnsweringVideo-based Generative Performance Benchmarking	CodeCode Available	4	5
HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture Recognition	Dec 2, 2024	Gesture RecognitionHand Detection	CodeCode Available	4	5
Contextual Multilingual Spellchecker for User Queries	May 1, 2023		CodeCode Available	4	5
Panoptic Feature Pyramid Networks	Jan 8, 2019	Instance SegmentationPanoptic Segmentation	CodeCode Available	4	5
Evolution Transformer: In-Context Evolutionary Optimization	Mar 5, 2024		CodeCode Available	4	5
Segment and Track Anything	May 11, 2023	Autonomous Drivingmultimodal interaction	CodeCode Available	4	5
SmoothGrad: removing noise by adding noise	Jun 12, 2017	Interpretable Machine LearningSensitivity	CodeCode Available	4	5
A Comprehensive Survey on 3D Content Generation	Feb 2, 2024	Survey	CodeCode Available	4	5
Autoregressive Models in Vision: A Survey	Nov 8, 2024	3D GenerationImage Generation	CodeCode Available	4	5
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints	Dec 10, 2024	4D reconstructionVideo Generation	CodeCode Available	4	5
Ray: A Distributed Framework for Emerging AI Applications	Dec 16, 2017	reinforcement-learningReinforcement Learning	CodeCode Available	4	5
RegNet: Self-Regulated Network for Image Classification	Jan 3, 2021	ClassificationGeneral Classification	CodeCode Available	4	5
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View Stereo	May 20, 2024	NeRFNovel View Synthesis	CodeCode Available	4	5
CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue Dataset	Feb 27, 2020	Dialogue State TrackingTask-Oriented Dialogue Systems	CodeCode Available	4	5
On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages with Fewer Resources than English	Sep 1, 2021	Language Modelling	CodeCode Available	4	5
Dive into Deep Learning	Jun 21, 2021	Deep LearningMath	CodeCode Available	4	5
RLlib Flow: Distributed Reinforcement Learning is a Dataflow Problem	Nov 25, 2020	reinforcement-learningReinforcement Learning	CodeCode Available	4	5
Kolmogorov-Arnold Transformer	Sep 16, 2024	Image Classification	CodeCode Available	4	5
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery	Jun 16, 2024	scientific discoverySurvey	CodeCode Available	4	5
Sonata: Self-Supervised Learning of Reliable Point Representations	Mar 20, 2025	3D Semantic SegmentationSelf-Supervised Learning	CodeCode Available	4	5
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models	Feb 27, 2024	MarketingVideo Generation	CodeCode Available	4	5
fastai: A Layered API for Deep Learning	Feb 11, 2020	Deep LearningGPU	CodeCode Available	4	5
Learning Important Features Through Propagating Activation Differences	Apr 10, 2017	Interpretable Machine Learning	CodeCode Available	4	5
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents	Jun 13, 2025	Information RetrievalRetrieval	CodeCode Available	4	5
Orion-14B: Open-source Multilingual Large Language Models	Jan 20, 2024	Scheduling	CodeCode Available	4	5
iText2KG: Incremental Knowledge Graphs Construction Using Large Language Models	Sep 5, 2024	Few-Shot LearningInformation Retrieval	CodeCode Available	4	5
Acoustic modeling for Overlapping Speech Recognition: JHU Chime-5 Challenge System	May 17, 2024	Data AugmentationSpeech Dereverberation	CodeCode Available	4	5
KernelBench: Can LLMs Write Efficient GPU Kernels?	Feb 14, 2025	GPU	CodeCode Available	4	5
Image Segmentation Keras : Implementation of Segnet, FCN, UNet, PSPNet and other models in Keras	Jul 25, 2023	Image SegmentationSegmentation	CodeCode Available	4	5
MaskNet: Introducing Feature-Wise Multiplication to CTR Ranking Models by Instance-Guided Mask	Feb 9, 2021	Click-Through Rate PredictionRecommendation Systems	CodeCode Available	4	5
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning	Nov 4, 2024		CodeCode Available	4	5
Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale	Sep 12, 2024		CodeCode Available	4	5
A Framework For Contrastive Self-Supervised Learning And Designing A New Approach	Aug 31, 2020	Data AugmentationImage Classification	CodeCode Available	4	5
Brain-inspired Multilayer Perceptron with Spiking Neurons	Mar 28, 2022	Inductive Bias	CodeCode Available	4	5
V3D: Video Diffusion Models are Effective 3D Generators	Mar 11, 2024	3D GenerationNovel View Synthesis	CodeCode Available	4	5
LLM4AD: A Platform for Algorithm Design with Large Language Model	Dec 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	4	5
An Aggregated Multicolumn Dilated Convolution Network for Perspective-Free Counting	Apr 20, 2018		CodeCode Available	4	5
An Extended Sequence Tagging Vocabulary for Grammatical Error Correction	Feb 12, 2023	Grammatical Error CorrectionMorphological Inflection	CodeCode Available	4	5
GPUTreeShap: Massively Parallel Exact Calculation of SHAP Scores for Tree Ensembles	Oct 27, 2020	BIG-bench Machine LearningCPU	CodeCode Available	4	5
TransPixeler: Advancing Text-to-Video Generation with Transparency	Jan 6, 2025	Text-to-Video GenerationVideo Generation	CodeCode Available	4	5
BlazePose: On-device Real-time Body Pose tracking	Jun 17, 2020	2D Human Pose Estimation3D Human Pose Estimation	CodeCode Available	4	5
FitDiT: Advancing the Authentic Garment Details for High-fidelity Virtual Try-on	Nov 15, 2024	Virtual Try-on	CodeCode Available	4	5
EvoX: A Distributed GPU-accelerated Framework for Scalable Evolutionary Computation	Jan 29, 2023	GPUNavigate	CodeCode Available	4	5
Amortized Planning with Large-Scale Transformers: A Case Study on Chess	Feb 7, 2024	Memorization	CodeCode Available	4	5
LISA: Reasoning Segmentation via Large Language Model	Aug 1, 2023	Language ModelingLanguage Modelling	CodeCode Available	4	5
Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding	Sep 22, 2024	Anomaly DetectionGPU	CodeCode Available	4	5