The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2776–2800 of 661570 papers

Title	Date	Tasks	Status	Hype
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks	Apr 2, 2024	In-Context Learning	CodeCode Available	3
SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and More	Apr 18, 2023	General KnowledgeImage Segmentation	CodeCode Available	3
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice Cloning	Jul 9, 2019	Speech Synthesistext-to-speech	CodeCode Available	3
StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting	Mar 12, 2024	3DGSDecoder	CodeCode Available	3
GaMeS: Mesh-Based Adapting and Modification of Gaussian Splatting	Feb 2, 2024		CodeCode Available	3
REPLUG: Retrieval-Augmented Black-Box Language Models	Jan 30, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
Query-Based Adversarial Prompt Generation	Feb 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
GRAG: Graph Retrieval-Augmented Generation	May 26, 2024	Entity RetrievalKnowledge Graphs	CodeCode Available	3
Conformer: Convolution-augmented Transformer for Speech Recognition	May 16, 2020	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	3
Producing and Leveraging Online Map Uncertainty in Trajectory Prediction	Mar 25, 2024	Autonomous DrivingPrediction	CodeCode Available	3
Efficient Inference for Large Reasoning Models: A Survey	Mar 29, 2025	Survey	CodeCode Available	3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use Cases	Jan 6, 2025	FairnessLanguage Modeling	CodeCode Available	3
CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns	Sep 27, 2024	Time SeriesTime Series Forecasting	CodeCode Available	3
RF-Diffusion: Radio Signal Generation via Time-Frequency Diffusion	Apr 14, 2024	Time Series	CodeCode Available	3
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image Generation	Feb 19, 2024	Image Generation	CodeCode Available	3
EXP-Bench: Can AI Conduct AI Research Experiments?	May 30, 2025		CodeCode Available	3
CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground Environments	May 10, 2025	Pose Estimation	CodeCode Available	3
Neural Ordinary Differential Equations	Jun 19, 2018	Multivariate Time Series ForecastingMultivariate Time Series Imputation	CodeCode Available	3
LEADS: Lightweight Embedded Assisted Driving System	Oct 23, 2024		CodeCode Available	3
Fine-Tuning Language Models with Just Forward Passes	May 27, 2023	GPUIn-Context Learning	CodeCode Available	3
USB: A Unified Semi-supervised Learning Benchmark for Classification	Aug 12, 2022	General ClassificationGPU	CodeCode Available	3
ITBench: Evaluating AI Agents across Diverse Real-World IT Automation Tasks	Feb 7, 2025	Benchmarking	CodeCode Available	3
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning	Feb 19, 2024		CodeCode Available	3
The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities	Jan 23, 2025	General KnowledgeInstruction Following	CodeCode Available	3
The Role of Generative Systems in Historical Photography Management: A Case Study on Catalan Archives	Sep 5, 2024	ManagementTransfer Learning	CodeCode Available	3