The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2751–2800 of 659983 papers

Title	Date	Tasks	Status	Hype
syftr: Pareto-Optimal Generative AI	May 26, 2025	Bayesian OptimizationRAG	CodeCode Available	3
Towards Urban General Intelligence: A Review and Outlook of Urban Foundation Models	Jan 30, 2024		CodeCode Available	3
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning	May 22, 2025	Reinforcement Learning (RL)	CodeCode Available	3
Taskflow: A Lightweight Parallel and Heterogeneous Task Graph Computing System	Apr 23, 2020	Scheduling	CodeCode Available	3
Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling	Mar 2, 2024		CodeCode Available	3
Unbiased Estimator for Distorted Conics in Camera Calibration	Mar 7, 2024	Camera Calibration	CodeCode Available	3
360Zhinao Technical Report	May 22, 2024	4k	CodeCode Available	3
LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training	Mar 3, 2025	3DGSGPU	CodeCode Available	3
E5-V: Universal Embeddings with Multimodal Large Language Models	Jul 17, 2024		CodeCode Available	3
Affordance-based Robot Manipulation with Flow Matching	Sep 2, 2024	Action GenerationRobot Manipulation	CodeCode Available	3
Harnessing the Universal Geometry of Embeddings	May 18, 2025	Attribute	CodeCode Available	3
FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance	Nov 19, 2020	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	3
On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning	Nov 10, 2021	Multi-agent Reinforcement Learningreinforcement-learning	CodeCode Available	3
RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework	Aug 2, 2024	BenchmarkingDataset Generation	CodeCode Available	3
Patches Are All You Need?	Jan 24, 2022	AllImage Classification	CodeCode Available	3
Cascade Prompt Learning for Vision-Language Model Adaptation	Sep 26, 2024	General Knowledgeimage-classification	CodeCode Available	3
Relational Multi-Task Learning: Modeling Relations between Data and Tasks	Mar 14, 2023	Multi-Task LearningTransfer Learning	CodeCode Available	3
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning	Feb 15, 2024	Data AugmentationInstruction Following	CodeCode Available	3
Deciphering Oracle Bone Language with Diffusion Models	Jun 2, 2024	DeciphermentImage Generation	CodeCode Available	3
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout Guidance	Jun 11, 2024	Image GenerationText to Image Generation	CodeCode Available	3
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to Advances	Oct 24, 2024	BenchmarkingImage to Video Generation	CodeCode Available	3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge Distillation	Aug 28, 2024	Computational EfficiencyHallucination	CodeCode Available	3
Deep Photo Style Transfer	Mar 22, 2017	Style Transfer	CodeCode Available	3
Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion Models	Feb 7, 2024	counterfactualImage Generation	CodeCode Available	3
Generalized Decoding for Pixel, Image, and Language	Dec 21, 2022	DecoderImage Segmentation	CodeCode Available	3
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks	Apr 2, 2024	In-Context Learning	CodeCode Available	3
SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and More	Apr 18, 2023	General KnowledgeImage Segmentation	CodeCode Available	3
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning	Jul 9, 2019	Speech Synthesistext-to-speech	CodeCode Available	3
StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting	Mar 12, 2024	3DGSDecoder	CodeCode Available	3
GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting	Feb 2, 2024		CodeCode Available	3
REPLUG: Retrieval-Augmented Black-Box Language Models	Jan 30, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
Query-Based Adversarial Prompt Generation	Feb 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
GRAG: Graph Retrieval-Augmented Generation	May 26, 2024	Entity RetrievalKnowledge Graphs	CodeCode Available	3
Conformer: Convolution-augmented Transformer for Speech Recognition	May 16, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	3
Producing and Leveraging Online Map Uncertainty in Trajectory Prediction	Mar 25, 2024	Autonomous DrivingPrediction	CodeCode Available	3
Efficient Inference for Large Reasoning Models: A Survey	Mar 29, 2025	Survey	CodeCode Available	3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases	Jan 6, 2025	FairnessLanguage Modeling	CodeCode Available	3
CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns	Sep 27, 2024	Time SeriesTime Series Forecasting	CodeCode Available	3
RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion	Apr 14, 2024	Time Series	CodeCode Available	3
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation	Feb 19, 2024	Image Generation	CodeCode Available	3
EXP-Bench: Can AI Conduct AI Research Experiments?	May 30, 2025		CodeCode Available	3
CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground Environments	May 10, 2025	Pose Estimation	CodeCode Available	3
Neural Ordinary Differential Equations	Jun 19, 2018	Multivariate Time Series ForecastingMultivariate Time Series Imputation	CodeCode Available	3
LEADS: Lightweight Embedded Assisted Driving System	Oct 23, 2024		CodeCode Available	3
Fine-Tuning Language Models with Just Forward Passes	May 27, 2023	GPUIn-Context Learning	CodeCode Available	3
USB: A Unified Semi-supervised Learning Benchmark for Classification	Aug 12, 2022	General ClassificationGPU	CodeCode Available	3
ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks	Feb 7, 2025	Benchmarking	CodeCode Available	3
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning	Feb 19, 2024		CodeCode Available	3
The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities	Jan 23, 2025	General KnowledgeInstruction Following	CodeCode Available	3
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives	Sep 5, 2024	ManagementTransfer Learning	CodeCode Available	3