The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7226–7250 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
FanOutQA: A Multi-Hop, Multi-Document Question Answering Benchmark for Large Language Models	Feb 21, 2024	Question Answering	CodeCode Available	2	5
VCoder: Versatile Vision Encoders for Multimodal Large Language Models	Dec 21, 2023	Image CaptioningImage Generation	CodeCode Available	2	5
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance	May 23, 2024	Image GenerationPersonalized Image Generation	CodeCode Available	2	5
3D Gaussian Splatting with Deferred Reflection	Apr 29, 2024	Novel View Synthesis	CodeCode Available	2	5
Centroid-Based Efficient Minimum Bayes Risk Decoding	Feb 17, 2024	de-enTranslation	CodeCode Available	2	5
VectorMapNet: End-to-end Vectorized HD Map Learning	Jun 17, 2022	3D Lane DetectionAutonomous Driving	CodeCode Available	2	5
SCTransNet: Spatial-channel Cross Transformer Network for Infrared Small Target Detection	Jan 28, 2024		CodeCode Available	2	5
Jetfire: Efficient and Accurate Transformer Pretraining with INT8 Data Flow and Per-Block Quantization	Mar 19, 2024	Quantization	CodeCode Available	2	5
TinyLVLM-eHub: Towards Comprehensive and Efficient Evaluation for Large Vision-Language Models	Aug 7, 2023	HallucinationObject Hallucination	CodeCode Available	2	5
Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance	Sep 2, 2024		CodeCode Available	2	5
Measuring Re-identification Risk	Apr 12, 2023		CodeCode Available	2	5
DiffuseVAE: Efficient, Controllable and High-Fidelity Generation from Low-Dimensional Latents	Jan 2, 2022	Image GenerationVocal Bursts Intensity Prediction	CodeCode Available	2	5
RingFormer: A Neural Vocoder with Ring Attention and Convolution-Augmented Transformer	Jan 2, 2025	Audio Generationtext-to-speech	CodeCode Available	2	5
Transformer-Based Visual Segmentation: A Survey	Apr 19, 2023	Autonomous DrivingPoint Cloud Segmentation	CodeCode Available	2	5
Scalable Multi-Temporal Remote Sensing Change Data Generation via Simulating Stochastic Change Process	Sep 29, 2023	Change Data GenerationChange Detection	CodeCode Available	2	5
MOMAland: A Set of Benchmarks for Multi-Objective Multi-Agent Reinforcement Learning	Jul 23, 2024	BenchmarkingDecision Making	CodeCode Available	2	5
CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention	Mar 13, 2023	image-classificationImage Classification	CodeCode Available	2	5
YOLOPoint Joint Keypoint and Object Detection	Feb 6, 2024	Objectobject-detection	CodeCode Available	2	5
chemtrain: Learning Deep Potential Models via Automatic Differentiation and Statistical Physics	Aug 28, 2024		CodeCode Available	2	5
VeriThinker: Learning to Verify Makes Reasoning Model Efficient	May 23, 2025	model	CodeCode Available	2	5
Colar: Effective and Efficient Online Action Detection by Consulting Exemplars	Mar 2, 2022	Action DetectionOnline Action Detection	CodeCode Available	2	5
InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models	Dec 18, 2024	Reasoning SegmentationSegmentation	CodeCode Available	2	5
MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking	Jul 28, 2023	Multi-Object TrackingMultiple Object Tracking	CodeCode Available	2	5
GaussianWorld: Gaussian World Model for Streaming 3D Occupancy Prediction	Dec 13, 2024	Autonomous DrivingPrediction	CodeCode Available	2	5
ZAPBench: A Benchmark for Whole-Brain Activity Prediction in Zebrafish	Mar 4, 2025	Activity PredictionMultivariate Time Series Forecasting	CodeCode Available	2	5