The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6051–6075 of 474278 papers

Title	Date	Tasks	Status	Hype
voc2vec: A Foundation Model for Non-Verbal Vocalization	Feb 22, 2025	model	CodeCode Available	2
SalM2: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver Attention	Feb 22, 2025	Mamba	CodeCode Available	2
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification	Feb 21, 2025	Contrastive LearningTime Series	CodeCode Available	2
A Training-free LLM-based Approach to General Chinese Character Error Correction	Feb 21, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework	Feb 21, 2025	Autonomous Driving	CodeCode Available	2
Protein Large Language Models: A Comprehensive Survey	Feb 21, 2025	ArticlesProtein Structure Prediction	CodeCode Available	2
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind	Feb 21, 2025	Model Discovery	CodeCode Available	2
Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection	Feb 21, 2025	3D Anomaly Detection3D Anomaly Detection and Segmentation	CodeCode Available	2
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling	Feb 21, 2025	Autonomous DrivingImitation Learning	CodeCode Available	2
PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning	Feb 21, 2025	Hallucination	CodeCode Available	2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation	Feb 21, 2025	Audio GenerationFAD	CodeCode Available	2
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms	Feb 21, 2025	Scheduling	CodeCode Available	2
HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States	Feb 20, 2025		CodeCode Available	2
ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation	Feb 20, 2025	3D Molecule GenerationProtein Design	CodeCode Available	2
GiGL: Large-Scale Graph Neural Networks at Snapchat	Feb 20, 2025	Graph Learning	CodeCode Available	2
Optimizing Model Selection for Compound AI Systems	Feb 20, 2025	modelModel Selection	CodeCode Available	2
dtaianomaly: A Python library for time series anomaly detection	Feb 20, 2025	Anomaly DetectionTime Series	CodeCode Available	2
Risk-mediated dynamic regulation of effective contacts de-synchronizes outbreaks in metapopulation epidemic models	Feb 20, 2025		CodeCode Available	2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO	Feb 20, 2025	Autonomous NavigationNavigate	CodeCode Available	2
A Survey on Data Contamination for Large Language Models	Feb 20, 2025	SurveyText Generation	CodeCode Available	2
MAGO-SP: Detection and Correction of Water-Fat Swaps in Magnitude-Only VIBE MRI	Feb 20, 2025	Denoising	CodeCode Available	2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators	Feb 20, 2025	BenchmarkingCode Generation	CodeCode Available	2
Fast and Accurate Blind Flexible Docking	Feb 20, 2025	Blind DockingComputational Efficiency	CodeCode Available	2
MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders	Feb 20, 2025	Computational Efficiency	CodeCode Available	2
Multimodal RewardBench: Holistic Evaluation of Reward Models for Vision Language Models	Feb 20, 2025	Question AnsweringVisual Question Answering	CodeCode Available	2