The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9201–9250 of 661570 papers

Title	Date	Tasks	Status	Hype
DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers	Feb 25, 2024	In-Context LearningSafety Alignment	CodeCode Available	2
Numerical Association Rule Mining: A Systematic Literature Review	Jul 2, 2023	ArticlesSystematic Literature Review	CodeCode Available	2
SciAssess: Benchmarking LLM Proficiency in Scientific Literature Analysis	Mar 4, 2024	BenchmarkingDrug Discovery	CodeCode Available	2
Deep Constrained Least Squares for Blind Image Super-Resolution	Feb 15, 2022	Blind Super-ResolutionDeblurring	CodeCode Available	2
Beyond Accuracy: Behavioral Testing of NLP models with CheckList	May 8, 2020	Question AnsweringSentiment Analysis	CodeCode Available	2
Unified Contrastive Learning in Image-Text-Label Space	Apr 7, 2022	Contrastive Learningimage-classification	CodeCode Available	2
Monitoring and explainability of models in production	Jul 13, 2020	BIG-bench Machine Learning	CodeCode Available	2
Dragonfly: Multi-Resolution Zoom-In Encoding Enhances Vision-Language Models	Jun 3, 2024	Image CaptioningLanguage Modelling	CodeCode Available	2
Shift-ConvNets: Small Convolutional Kernel with Large Kernel Effects	Jan 23, 2024		CodeCode Available	2
ReWOO: Decoupling Reasoning from Observations for Efficient Augmented Language Models	May 23, 2023	Retrieval	CodeCode Available	2
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model	Mar 10, 2025	Image DescriptionImage Generation	CodeCode Available	2
M^3-20M: A Large-Scale Multi-Modal Molecule Dataset for AI-driven Drug Design and Discovery	Dec 8, 2024	Drug DesignMolecular Property Prediction	CodeCode Available	2
JoJoGAN: One Shot Face Stylization	Dec 22, 2021	Image StylizationOne-Shot Face Stylization	CodeCode Available	2
MambaFusion: Height-Fidelity Dense Global Fusion for Multi-modal 3D Object Detection	Jul 6, 2025	3D Object DetectionAttribute	CodeCode Available	2
Reinforcing General Reasoning without Verifiers	May 27, 2025	MathMathematical Reasoning	CodeCode Available	2
pyRDF2Vec: A Python Implementation and Extension of RDF2Vec	May 4, 2022		CodeCode Available	2
Search Arena: Analyzing Search-Augmented LLMs	Jun 5, 2025	Fact Checking	CodeCode Available	2
R3M: A Universal Visual Representation for Robot Manipulation	Mar 23, 2022	Contrastive LearningRobot Manipulation	CodeCode Available	2
RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments	Aug 28, 2024	Autonomous DrivingAutonomous Navigation	CodeCode Available	2
UV-free Texture Generation with Denoising and Geodesic Heat Diffusions	Aug 29, 2024	DenoisingTexture Synthesis	CodeCode Available	2
From Tiny Machine Learning to Tiny Deep Learning: A Survey	Jun 21, 2025	AutoMLModel Optimization	CodeCode Available	2
CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation	Oct 30, 2024	BenchmarkingPassage Retrieval	CodeCode Available	2
Unishox: A hybrid encoder for Short Unicode Strings	Jan 18, 2022		CodeCode Available	2
Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users	May 6, 2022	Transliteration	CodeCode Available	2
Apply Hierarchical-Chain-of-Generation to Complex Attributes Text-to-3D Generation	May 7, 2025	3D GenerationAttribute	CodeCode Available	2
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies	Feb 6, 2024	Decision MakingDiversity	CodeCode Available	2
A Novel Plug-in Module for Fine-Grained Visual Classification	Feb 8, 2022	ClassificationFine-Grained Image Classification	CodeCode Available	2
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval	Mar 11, 2022	GPURetrieval	CodeCode Available	2
SAMM (Segment Any Medical Model): A 3D Slicer Integration to SAM	Apr 12, 2023	Image SegmentationSegmentation	CodeCode Available	2
CLRNet: Cross Layer Refinement Network for Lane Detection	Mar 19, 2022	Lane Detection	CodeCode Available	2
REGTR: End-to-end Point Cloud Correspondences with Transformers	Mar 28, 2022	Point Cloud RegistrationPose Estimation	CodeCode Available	2
Can LLMs Follow Simple Rules?	Nov 6, 2023		CodeCode Available	2
GIT: A Generative Image-to-text Transformer for Vision and Language	May 27, 2022	DecoderImage Captioning	CodeCode Available	2
URetinex-Net: Retinex-Based Deep Unfolding Network for Low-Light Image Enhancement	Jan 1, 2022	Image EnhancementLow-Light Image Enhancement	CodeCode Available	2
WanJuan: A Comprehensive Multimodal Dataset for Advancing English and Chinese Large Models	Aug 21, 2023		CodeCode Available	2
AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos	Jun 14, 2022	Super-ResolutionVideo Super-Resolution	CodeCode Available	2
Scale-Aware Trident Networks for Object Detection	Jan 7, 2019	Objectobject-detection	CodeCode Available	2
PrefixQuant: Eliminating Outliers by Prefixed Tokens for Large Language Models Quantization	Oct 7, 2024	Common Sense ReasoningQuantization	CodeCode Available	2
Protein-to-genome alignment with miniprot	Oct 14, 2022		CodeCode Available	2
Dialogue Learning With Human-In-The-Loop	Nov 29, 2016	Question Answeringreinforcement-learning	CodeCode Available	2
SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints	May 2, 2024	DiversityDrug Design	CodeCode Available	2
RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation	Nov 17, 2022	3D Generation3D Reconstruction	CodeCode Available	2
SEED: A Simple and Effective 3D DETR in Point Clouds	Jul 15, 2024		CodeCode Available	2
ExBEHRT: Extended Transformer for Electronic Health Records to Predict Disease Subtypes & Progressions	Mar 22, 2023		CodeCode Available	2
AutoTrust: Benchmarking Trustworthiness in Large Vision Language Models for Autonomous Driving	Dec 19, 2024	Autonomous DrivingBenchmarking	CodeCode Available	2
Enhancing Multi-Camera People Tracking with Anchor-Guided Clustering and Spatio-Temporal Consistency ID Re-Assignment	Apr 19, 2023	Multiple People Tracking	CodeCode Available	2
TSMixer: Lightweight MLP-Mixer Model for Multivariate Time Series Forecasting	Jun 14, 2023	Multivariate Time Series ForecastingRepresentation Learning	CodeCode Available	2
Listen, Denoise, Action! Audio-Driven Motion Synthesis with Diffusion Models	Nov 17, 2022	Gesture GenerationMotion Synthesis	CodeCode Available	2
Generalized Portrait Quality Assessment	Feb 14, 2024	Face Image Quality Assessment	CodeCode Available	2
Benchmarking Potential Based Rewards for Learning Humanoid Locomotion	Jul 19, 2023	BenchmarkingReinforcement Learning (RL)	CodeCode Available	2