The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7201–7225 of 474278 papers

Title	Date	Tasks	Status	Hype
Merging in a Bottle: Differentiable Adaptive Merging (DAM) and the Path from Averaging to Automation	Oct 10, 2024		CodeCode Available	2
Thought2Text: Text Generation from EEG Signal using Large Language Models (LLMs)	Oct 10, 2024	EEGText Generation	CodeCode Available	2
Reversible Decoupling Network for Single Image Reflection Removal	Oct 10, 2024	Reflection Removal	CodeCode Available	2
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text	Oct 10, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection	Oct 10, 2024	object-detectionObject Detection	CodeCode Available	2
Interactive4D: Interactive 4D LiDAR Segmentation	Oct 10, 2024	Interactive SegmentationSegmentation	CodeCode Available	2
Benchmarking Agentic Workflow Generation	Oct 10, 2024	Benchmarking	CodeCode Available	2
Enhancing Soccer Camera Calibration Through Keypoint Exploitation	Oct 9, 2024	Camera CalibrationCamera Pose Estimation	CodeCode Available	2
Rodimus*: Breaking the Accuracy-Efficiency Trade-Off with Efficient Attentions	Oct 9, 2024	Semantic Compression	CodeCode Available	2
Compositional Entailment Learning for Hyperbolic Vision-Language Models	Oct 9, 2024	Language ModellingRepresentation Learning	CodeCode Available	2
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration Rate	Oct 9, 2024	cross-modal alignmentVisual Question Answering	CodeCode Available	2
LS-EEND: Long-Form Streaming End-to-End Neural Diarization with Online Attractor Extraction	Oct 9, 2024	DecoderForm	CodeCode Available	2
Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates	Oct 9, 2024		CodeCode Available	2
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific Hypotheses	Oct 9, 2024	scientific discoveryvalid	CodeCode Available	2
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and Training	Oct 9, 2024	Caption GenerationContrastive Learning	CodeCode Available	2
MatMamba: A Matryoshka State Space Model	Oct 9, 2024	modelRepresentation Learning	CodeCode Available	2
Pair-VPR: Place-Aware Pre-training and Contrastive Pair Classification for Visual Place Recognition with Vision Transformers	Oct 9, 2024	DecoderRe-Ranking	CodeCode Available	2
Towards Natural Image Matting in the Wild via Real-Scenario Prior	Oct 9, 2024	DecoderImage Matting	CodeCode Available	2
An Undetectable Watermark for Generative Image Models	Oct 9, 2024		CodeCode Available	2
SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration	Oct 9, 2024		CodeCode Available	2
CursorCore: Assist Programming through Aligning Anything	Oct 9, 2024	Code Completion	CodeCode Available	2
Sylber: Syllabic Embedding Representation of Speech from Raw Audio	Oct 9, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Spiking GS: Towards High-Accuracy and Low-Cost Surface Reconstruction via Spiking Neuron-based Gaussian Splatting	Oct 9, 2024	Surface Reconstruction	CodeCode Available	2
Quanda: An Interpretability Toolkit for Training Data Attribution Evaluation and Beyond	Oct 9, 2024	Benchmarking	CodeCode Available	2
Towards Interpreting Visual Information Processing in Vision-Language Models	Oct 9, 2024	Language ModelingLanguage Modelling	CodeCode Available	2