The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11001–11025 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models	Feb 21, 2024		CodeCode Available	2	5
Saturn: Sample-efficient Generative Molecular Design using Memory Manipulation	May 27, 2024	Data AugmentationDrug Discovery	CodeCode Available	2	5
An Intelligent Agentic System for Complex Image Restoration Problems	Oct 23, 2024	Image Restoration	CodeCode Available	2	5
Multivariate Probabilistic Regression with Natural Gradient Boosting	Jun 7, 2021	regression	CodeCode Available	2	5
Brain-Computer-Interface controlled robot via RaspberryPi and PiEEG	Feb 4, 2022	Brain Computer InterfaceEEG	CodeCode Available	2	5
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models	May 30, 2025	ClassificationDisaster Response	CodeCode Available	2	5
Learning to Generalize Provably in Learning to Optimize	Feb 22, 2023		CodeCode Available	2	5
Next-ViT: Next Generation Vision Transformer for Efficient Deployment in Realistic Industrial Scenarios	Jul 12, 2022	Image Classification	CodeCode Available	2	5
AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts	Oct 29, 2024		CodeCode Available	2	5
DoTAT: A Domain-oriented Text Annotation Tool	May 1, 2022	text annotation	CodeCode Available	2	5
A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models	Sep 20, 2023	Language ModellingMachine Translation	CodeCode Available	2	5
How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning	Feb 5, 2024	In-Context LearningMetric Learning	CodeCode Available	2	5
A Replication Study of Dense Passage Retriever	Apr 12, 2021	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	2	5
A simple way to make neural networks robust against diverse image corruptions	Jan 16, 2020		CodeCode Available	2	5
Drive Like a Human: Rethinking Autonomous Driving with Large Language Models	Jul 14, 2023	Autonomous DrivingCommon Sense Reasoning	CodeCode Available	2	5
Automatic Depression Detection: An Emotional Audio-Textual Corpus and a GRU/BiLSTM-based Model	Feb 15, 2022	Depression DetectionDiagnostic	CodeCode Available	2	5
XGen-7B Technical Report	Sep 7, 2023	2k8k	CodeCode Available	2	5
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs	Jul 8, 2025	GPUreinforcement-learning	CodeCode Available	2	5
InPars Toolkit: A Unified and Reproducible Synthetic Data Generation Pipeline for Neural Information Retrieval	Jul 10, 2023	GPUInformation Retrieval	CodeCode Available	2	5
Advancing Real-time Pandemic Forecasting Using Large Language Models: A COVID-19 Case Study	Apr 10, 2024	Representation LearningTime Series	CodeCode Available	2	5
Image Super-Resolution using Efficient Striped Window Transformer	Jan 24, 2023	Image Super-Resolution	CodeCode Available	2	5
Eureka: Evaluating and Understanding Large Foundation Models	Sep 13, 2024	Information Retrieval	CodeCode Available	2	5
CLIMB: Data Foundations for Large Scale Multimodal Clinical Foundation Models	Mar 9, 2025		CodeCode Available	2	5
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning	Sep 14, 2023	HallucinationIn-Context Learning	CodeCode Available	2	5
Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation	Apr 4, 2024	Contrastive LearningReferring Expression	CodeCode Available	2	5