The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7801–7850 of 661570 papers

Title	Date	Tasks	Status	Hype
Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level	Jun 22, 2024	Machine TranslationTranslation	CodeCode Available	2
WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing	Jan 24, 2024	Activity Recognition	CodeCode Available	2
Pretrained LLM Adapted with LoRA as a Decision Transformer for Offline RL in Quantitative Trading	Nov 26, 2024	Offline RLparameter-efficient fine-tuning	CodeCode Available	2
RankVicuna: Zero-Shot Listwise Document Reranking with Open-Source Large Language Models	Sep 26, 2023	Information RetrievalReranking	CodeCode Available	2
SynJax: Structured Probability Distributions for JAX	Aug 7, 2023		CodeCode Available	2
GPTopic: Dynamic and Interactive Topic Representations	Mar 6, 2024		CodeCode Available	2
A Length-Extrapolatable Transformer	Dec 20, 2022	Language ModelingLanguage Modelling	CodeCode Available	2
Scaling Up Your Kernels to 31x31: Revisiting Large Kernel Design in CNNs	Mar 13, 2022	Image Classification	CodeCode Available	2
Democratizing Neural Machine Translation with OPUS-MT	Dec 4, 2022	Machine TranslationTranslation	CodeCode Available	2
Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training	Jan 10, 2024		CodeCode Available	2
YOLOv8-AM: YOLOv8 Based on Effective Attention Mechanisms for Pediatric Wrist Fracture Detection	Feb 14, 2024	Fracture detectionmedical image detection	CodeCode Available	2
PPLLaVA: Varied Video Sequence Understanding With Prompt Guidance	Nov 4, 2024	Caption GenerationMultiple-choice	CodeCode Available	2
EEG-Deformer: A Dense Convolutional Transformer for Brain-computer Interfaces	Apr 25, 2024	EEGElectroencephalogram (EEG)	CodeCode Available	2
An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records	Jun 13, 2024	Adversarial RobustnessExplainable Artificial Intelligence (XAI)	CodeCode Available	2
CHGNet: Pretrained universal neural network potential for charge-informed atomistic modeling	Feb 28, 2023	Atomic ForcesGraph Neural Network	CodeCode Available	2
LogAI: A Library for Log Analytics and Intelligence	Jan 31, 2023	Anomaly DetectionLog Parsing	CodeCode Available	2
ReMoDiffuse: Retrieval-Augmented Motion Diffusion Model	Apr 3, 2023	DenoisingDiversity	CodeCode Available	2
ConceptLab: Creative Concept Generation using VLM-Guided Diffusion Prior Constraints	Aug 3, 2023	Image GenerationLanguage Modelling	CodeCode Available	2
Geometric Latent Diffusion Models for 3D Molecule Generation	May 2, 2023	3D Molecule GenerationUnconditional Molecule Generation	CodeCode Available	2
Accelerating Self-Play Learning in Go	Feb 27, 2019	Game of Go	CodeCode Available	2
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models	May 23, 2023	Common Sense ReasoningImage Generation	CodeCode Available	2
MoEUT: Mixture-of-Experts Universal Transformers	May 25, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization Algorithms	May 27, 2025	Bayesian OptimizationBenchmarking	CodeCode Available	2
LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction	Jul 16, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs	Apr 25, 2024	Visual GroundingVisual Question Answering	CodeCode Available	2
Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization	Sep 9, 2023	Language ModellingLarge Language Model	CodeCode Available	2
Cross-Image Relational Knowledge Distillation for Semantic Segmentation	Apr 14, 2022	Knowledge DistillationSegmentation	CodeCode Available	2
MLAgentBench: Evaluating Language Agents on Machine Learning Experimentation	Oct 5, 2023	BenchmarkingDecision Making	CodeCode Available	2
Glimpse: Enabling White-Box Methods to Use Proprietary Models for Zero-Shot LLM-Generated Text Detection	Dec 16, 2024	LLM-generated Text DetectionText Detection	CodeCode Available	2
R3LIVE: A Robust, Real-time, RGB-colored, LiDAR-Inertial-Visual tightly-coupled state Estimation and mapping package	Sep 10, 2021	Sensor FusionState Estimation	CodeCode Available	2
ActiveRAG: Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents	Feb 21, 2024	Active LearningPosition	CodeCode Available	2
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval	Jul 21, 2024	General KnowledgeHighlight Detection	CodeCode Available	2
SMAC3: A Versatile Bayesian Optimization Package for Hyperparameter Optimization	Sep 20, 2021	AutoMLBayesian Optimization	CodeCode Available	2
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device	Sep 27, 2021	Video RecognitionVideo Understanding	CodeCode Available	2
MultiOOD: Scaling Out-of-Distribution Detection for Multiple Modalities	May 27, 2024	Autonomous DrivingOut-of-Distribution Detection	CodeCode Available	2
Self-Exploring Language Models: Active Preference Elicitation for Online Alignment	May 29, 2024	Instruction Following	CodeCode Available	2
FEC: Fast Euclidean Clustering for Point Cloud Segmentation	Aug 16, 2022	ClusteringInstance Segmentation	CodeCode Available	2
PeFoMed: Parameter Efficient Fine-tuning of Multimodal Large Language Models for Medical Imaging	Jan 5, 2024	Medical Report GenerationMedical Visual Question Answering	CodeCode Available	2
Jailbreak Vision Language Models via Bi-Modal Adversarial Prompt	Jun 6, 2024	Language ModellingLarge Language Model	CodeCode Available	2
UMBRELA: UMbrela is the (Open-Source Reproduction of the) Bing RELevance Assessor	Jun 10, 2024	RAGRetrieval	CodeCode Available	2
Exploring Orthogonality in Open World Object Detection	Jan 1, 2024	Incremental LearningObject	CodeCode Available	2
You Name It, I Run It: An LLM Agent to Execute Tests of Arbitrary Projects	Dec 13, 2024	Large Language Model	CodeCode Available	2
Equinox: neural networks in JAX via callable PyTrees and filtered transformations	Oct 30, 2021		CodeCode Available	2
Deep Architectures for Content Moderation and Movie Content Rating	Dec 8, 2022	Action RecognitionGenre classification	CodeCode Available	2
Revisiting Referring Expression Comprehension Evaluation in the Era of Large Multimodal Models	Jun 24, 2024	Referring ExpressionReferring Expression Comprehension	CodeCode Available	2
Denoising as Adaptation: Noise-Space Domain Adaptation for Image Restoration	Jun 26, 2024	Contrastive LearningDeblurring	CodeCode Available	2
Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction	Nov 22, 2021	GPUNeRF	CodeCode Available	2
Investigating Tradeoffs in Real-World Video Super-Resolution	Nov 24, 2021	BenchmarkingSuper-Resolution	CodeCode Available	2
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighborhood Search	May 21, 2021		CodeCode Available	2
Deep Learning Interviews: Hundreds of fully solved job interview questions from a wide range of key topics in AI	Dec 30, 2021		CodeCode Available	2