The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 6051–6100 of 661570 papers

Title	Date	Tasks	Status	Hype
voc2vec: A Foundation Model for Non-Verbal Vocalization	Feb 22, 2025	model	CodeCode Available	2
Robust Dynamic Facial Expression Recognition	Feb 22, 2025	Dynamic Facial Expression RecognitionFacial Expression Recognition	CodeCode Available	2
AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind	Feb 21, 2025	Model Discovery	CodeCode Available	2
Protein Large Language Models: A Comprehensive Survey	Feb 21, 2025	ArticlesProtein Structure Prediction	CodeCode Available	2
OccProphet: Pushing Efficiency Frontier of Camera-Only 4D Occupancy Forecasting with Observer-Forecaster-Refiner Framework	Feb 21, 2025	Autonomous Driving	CodeCode Available	2
KAD: No More FAD! An Effective and Efficient Evaluation Metric for Audio Generation	Feb 21, 2025	Audio GenerationFAD	CodeCode Available	2
PIP-KAG: Mitigating Knowledge Conflicts in Knowledge-Augmented Generation via Parametric Pruning	Feb 21, 2025	Hallucination	CodeCode Available	2
VaViM and VaVAM: Autonomous Driving through Video Generative Modeling	Feb 21, 2025	Autonomous DrivingImitation Learning	CodeCode Available	2
Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification	Feb 21, 2025	Contrastive LearningTime Series	CodeCode Available	2
A Training-free LLM-based Approach to General Chinese Character Error Correction	Feb 21, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Boosting Global-Local Feature Matching via Anomaly Synthesis for Multi-Class Point Cloud Anomaly Detection	Feb 21, 2025	3D Anomaly Detection3D Anomaly Detection and Segmentation	CodeCode Available	2
AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms	Feb 21, 2025	Scheduling	CodeCode Available	2
ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation	Feb 20, 2025	3D Molecule GenerationProtein Design	CodeCode Available	2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton Operators	Feb 20, 2025	BenchmarkingCode Generation	CodeCode Available	2
MAGO-SP: Detection and Correction of Water-Fat Swaps in Magnitude-Only VIBE MRI	Feb 20, 2025	Denoising	CodeCode Available	2
Multimodal RewardBench: Holistic Evaluation of Reward Models for Vision Language Models	Feb 20, 2025	Question AnsweringVisual Question Answering	CodeCode Available	2
MedVAE: Efficient Automated Interpretation of Medical Images with Large-Scale Generalizable Autoencoders	Feb 20, 2025	Computational Efficiency	CodeCode Available	2
dtaianomaly: A Python library for time series anomaly detection	Feb 20, 2025	Anomaly DetectionTime Series	CodeCode Available	2
HiddenDetect: Detecting Jailbreak Attacks against Large Vision-Language Models via Monitoring Hidden States	Feb 20, 2025		CodeCode Available	2
GiGL: Large-Scale Graph Neural Networks at Snapchat	Feb 20, 2025	Graph Learning	CodeCode Available	2
FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis	Feb 20, 2025	Age EstimationBenchmarking	CodeCode Available	2
AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO	Feb 20, 2025	Autonomous NavigationNavigate	CodeCode Available	2
Fast and Accurate Blind Flexible Docking	Feb 20, 2025	Blind DockingComputational Efficiency	CodeCode Available	2
Optimizing Model Selection for Compound AI Systems	Feb 20, 2025	modelModel Selection	CodeCode Available	2
OBELiX: A Curated Dataset of Crystal Structures and Experimentally Measured Ionic Conductivities for Lithium Solid-State Electrolytes	Feb 20, 2025		CodeCode Available	2
A Survey on Data Contamination for Large Language Models	Feb 20, 2025	SurveyText Generation	CodeCode Available	2
Risk-mediated dynamic regulation of effective contacts de-synchronizes outbreaks in metapopulation epidemic models	Feb 20, 2025		CodeCode Available	2
Medical Image Classification with KAN-Integrated Transformers and Dilated Neighborhood Attention	Feb 19, 2025	image-classificationImage Classification	CodeCode Available	2
Calibration and Option Pricing with Stochastic Volatility and Double Exponential Jumps	Feb 19, 2025	ArticlesEconometrics	CodeCode Available	2
Repo2Run: Automated Building Executable Environment for Code Repository at Scale	Feb 19, 2025		CodeCode Available	2
Smaller But Better: Unifying Layout Generation with Smaller Large Language Models	Feb 19, 2025	Layout Generation	CodeCode Available	2
SIFT: Grounding LLM Reasoning in Contexts via Stickers	Feb 19, 2025	GSM8KMath	CodeCode Available	2
MoM: Linear Sequence Modeling with Mixture-of-Memories	Feb 19, 2025		CodeCode Available	2
TESS 2: A Large-Scale Generalist Diffusion Language Model	Feb 19, 2025	Instruction FollowingLanguage Modeling	CodeCode Available	2
Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework	Feb 19, 2025		CodeCode Available	2
Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models	Feb 19, 2025	Contrastive LearningSentence	CodeCode Available	2
Event-Based Video Frame Interpolation With Cross-Modal Asymmetric Bidirectional Motion Fields	Feb 19, 2025	Video Frame Interpolation	CodeCode Available	2
DataSciBench: An LLM Agent Benchmark for Data Science	Feb 19, 2025	Code GenerationLarge Language Model	CodeCode Available	2
Train Small, Infer Large: Memory-Efficient LoRA Training for Large Language Models	Feb 19, 2025	GPUQuantization	CodeCode Available	2
JL1-CD: A New Benchmark for Remote Sensing Change Detection and a Robust Multi-Teacher Knowledge Distillation Framework	Feb 19, 2025	Change DetectionEarth Observation	CodeCode Available	2
Helix-mRNA: A Hybrid Foundation Model For Full Sequence mRNA Therapeutics	Feb 19, 2025		CodeCode Available	2
DAMamba: Vision State Space Model with Dynamic Adaptive Scan	Feb 18, 2025	image-classificationImage Classification	CodeCode Available	2
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading	Feb 18, 2025	Computational EfficiencyCPU	CodeCode Available	2
NExT-Mol: 3D Diffusion Meets 1D Language Modeling for 3D Molecule Generation	Feb 18, 2025	3D Generation3D Molecule Generation	CodeCode Available	2
Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation	Feb 18, 2025	DecoderGPU	CodeCode Available	2
A Machine Learning Approach That Beats Large Rubik's Cubes	Feb 18, 2025	Rubik's Cube	CodeCode Available	2
Electron flow matching for generative reaction mechanism prediction obeying conservation laws	Feb 18, 2025	Prediction	CodeCode Available	2
CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation	Feb 18, 2025	Image GenerationText to Image Generation	CodeCode Available	2
VUS: Effective and Efficient Accuracy Measures for Time-Series Anomaly Detection	Feb 18, 2025	Anomaly DetectionInformation Retrieval	CodeCode Available	2
MotifBench: A standardized protein design benchmark for motif-scaffolding problems	Feb 18, 2025	Protein DesignProtein Structure Prediction	CodeCode Available	2