The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9576–9600 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt	Jun 6, 2024	Language ModellingLarge Language Model	CodeCode Available	2	5
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor	Jun 10, 2024	RAGRetrieval	CodeCode Available	2	5
Exploring Orthogonality in Open World Object Detection	Jan 1, 2024	Incremental LearningObject	CodeCode Available	2	5
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects	Dec 13, 2024	Large Language Model	CodeCode Available	2	5
Equinox: neural networks in JAX via callable PyTrees and filtered transformations	Oct 30, 2021		CodeCode Available	2	5
Deep Architectures for Content Moderation and Movie Content Rating	Dec 8, 2022	Action RecognitionGenre classification	CodeCode Available	2	5
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models	Jun 24, 2024	Referring ExpressionReferring Expression Comprehension	CodeCode Available	2	5
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration	Jun 26, 2024	Contrastive LearningDeblurring	CodeCode Available	2	5
Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction	Nov 22, 2021	GPUNeRF	CodeCode Available	2	5
Investigating Tradeoffs in Real-World Video Super-Resolution	Nov 24, 2021	BenchmarkingSuper-Resolution	CodeCode Available	2	5
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search	May 21, 2021		CodeCode Available	2	5
Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI	Dec 30, 2021		CodeCode Available	2	5
POCO: Point Convolution for Surface Reconstruction	Jan 5, 2022	3D ReconstructionSurface Reconstruction	CodeCode Available	2	5
MoCapAct: A Multi-Task Dataset for Simulated Humanoid Control	Aug 15, 2022	Humanoid Control	CodeCode Available	2	5
Speech Denoising in the Waveform Domain with Self-Attention	Feb 15, 2022	DecoderDenoising	CodeCode Available	2	5
Rethinking Network Design and Local Geometry in Point Cloud: A Simple Residual MLP Framework	Feb 15, 2022	3D Point Cloud ClassificationPoint Cloud Segmentation	CodeCode Available	2	5
Differentiable and Learnable Robot Models	Feb 22, 2022		CodeCode Available	2	5
OpenDR: An Open Toolkit for Enabling High Performance, Low Footprint Deep Learning for Robotics	Mar 1, 2022		CodeCode Available	2	5
Recovering 3D Human Mesh from Monocular Images: A Survey	Mar 3, 2022	3D human pose and shape estimationHuman Mesh Recovery	CodeCode Available	2	5
SoftGroup for 3D Instance Segmentation on Point Clouds	Mar 3, 2022	3D Instance Segmentation3D Object Detection	CodeCode Available	2	5
Freeform Body Motion Generation from Speech	Mar 4, 2022	DiversityMotion Generation	CodeCode Available	2	5
Class-incremental Learning for Time Series: Benchmark and Evaluation	Feb 19, 2024	Activity RecognitionBenchmarking	CodeCode Available	2	5
MotionCLIP: Exposing Human Motion Generation to CLIP Space	Mar 15, 2022	DisentanglementMotion Generation	CodeCode Available	2	5
Real-time Object Detection for Streaming Perception	Mar 23, 2022	Autonomous DrivingObject	CodeCode Available	2	5
Touchstone Benchmark: Are We on the Right Way for Evaluating AI Algorithms for Medical Segmentation?	Nov 6, 2024		CodeCode Available	2	5