The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6976–7000 of 474278 papers

Title	Date	Tasks	Status	Hype
InjecGuard: Benchmarking and Mitigating Over-defense in Prompt Injection Guardrail Models	Oct 30, 2024	Benchmarking	CodeCode Available	2
Controlling Language and Diffusion Models by Transporting Activations	Oct 30, 2024	Negation	CodeCode Available	2
A Survey on RGB, 3D, and Multimodal Approaches for Unsupervised Industrial Anomaly Detection	Oct 29, 2024	Anomaly Detection	CodeCode Available	2
ET-Flow: Equivariant Flow-Matching for Molecular Conformer Generation	Oct 29, 2024	Drug Discovery	CodeCode Available	2
CHORDONOMICON: A Dataset of 666,000 Songs and their Chord Progressions	Oct 29, 2024		CodeCode Available	2
PC-Gym: Benchmark Environments For Process Control Problems	Oct 29, 2024	BenchmarkingChemical Process	CodeCode Available	2
Beyond Text: Optimizing RAG with Multimodal Inputs for Industrial Applications	Oct 29, 2024	Image RetrievalRAG	CodeCode Available	2
Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench	Oct 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation	Oct 29, 2024	Few-shot 3D Point Cloud Semantic SegmentationPoint Cloud Segmentation	CodeCode Available	2
Rare-to-Frequent: Unlocking Compositional Generation Power of Diffusion Models on Rare Concepts with LLM Guidance	Oct 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
ActiveSplat: High-Fidelity Scene Reconstruction through Active Gaussian Splatting	Oct 29, 2024	Active 3D ReconstructionDecision Making	CodeCode Available	2
AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts	Oct 29, 2024		CodeCode Available	2
Semantic Editing Increment Benefits Zero-Shot Composed Image Retrieval	Oct 28, 2024	Image RetrievalImage to text	CodeCode Available	2
LongReward: Improving Long-context Large Language Models with AI Feedback	Oct 28, 2024	Offline RLReinforcement Learning (RL)	CodeCode Available	2
Hacking Back the AI-Hacker: Prompt Injection as a Defense Against LLM-driven Cyberattacks	Oct 28, 2024		CodeCode Available	2
ODRL: A Benchmark for Off-Dynamics Reinforcement Learning	Oct 28, 2024	Benchmarkingreinforcement-learning	CodeCode Available	2
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive Learning	Oct 28, 2024	Binary ClassificationContrastive Learning	CodeCode Available	2
BSD: a Bayesian framework for parametric models of neural spectra	Oct 28, 2024	Bayesian InferenceEEG	CodeCode Available	2
Fast Calibrated Explanations: Efficient and Uncertainty-Aware Explanations for Machine Learning Models	Oct 28, 2024	Computational EfficiencyFeature Importance	CodeCode Available	2
RecFlow: An Industrial Full Flow Recommendation Dataset	Oct 28, 2024	Recommendation SystemsSelection bias	CodeCode Available	2
Skinned Motion Retargeting with Dense Geometric Interaction Perception	Oct 28, 2024	motion retargeting	CodeCode Available	2
NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks	Oct 28, 2024	Quantization	CodeCode Available	2
Flaming-hot Initiation with Regular Execution Sampling for Large Language Models	Oct 28, 2024	DiversityMath	CodeCode Available	2
Domain Adaptation with a Single Vision-Language Embedding	Oct 28, 2024	Domain AdaptationOne-shot Unsupervised Domain Adaptation	CodeCode Available	2
Unpacking SDXL Turbo: Interpreting Text-to-Image Models with Sparse Autoencoders	Oct 28, 2024	Denoising	CodeCode Available	2