The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10376–10400 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
Context is Key: A Benchmark for Forecasting with Essential Textual Information	Oct 24, 2024	Decision MakingTime Series	CodeCode Available	2	5
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks	Jan 10, 2024	Benchmarking	CodeCode Available	2	5
A Survey on 3D Gaussian Splatting	Jan 8, 2024	3D ReconstructionSurvey	CodeCode Available	2	5
SafeAgentBench: A Benchmark for Safe Task Planning of Embodied LLM Agents	Dec 17, 2024	Task Planning	CodeCode Available	2	5
Efficient Parallel Genetic Algorithm for Perturbed Substructure Optimization in Complex Network	Dec 30, 2024	Combinatorial OptimizationGraph Mining	CodeCode Available	2	5
A Survey on Hardware Accelerators for Large Language Models	Jan 18, 2024	Survey	CodeCode Available	2	5
PSAvatar: A Point-based Shape Model for Real-Time Head Avatar Animation with 3D Gaussian Splatting	Jan 23, 2024		CodeCode Available	2	5
EQ-Bench: An Emotional Intelligence Benchmark for Large Language Models	Dec 11, 2023	BenchmarkingEmotional Intelligence	CodeCode Available	2	5
Object Detection using Event Camera: A MoE Heat Conduction based Detector and A New Benchmark Dataset	Dec 9, 2024	Computational EfficiencyMixture-of-Experts	CodeCode Available	2	5
Machine Unlearning of Pre-trained Large Language Models	Feb 23, 2024	Machine Unlearning	CodeCode Available	2	5
Segment Any Anomaly without Training via Hybrid Prompt Regularization	May 18, 2023	Anomaly DetectionAnomaly Localization	CodeCode Available	2	5
Nexus: A Lightweight and Scalable Multi-Agent Framework for Complex Tasks Automation	Feb 26, 2025	Code GenerationHumanEval	CodeCode Available	2	5
Advanced Millimeter-Wave Radar System for Real-Time Multiple-Human Tracking and Fall Detection	Mar 8, 2024	Clustering	CodeCode Available	2	5
DISTFLASHATTN: Distributed Memory-efficient Attention for Long-context LLMs Training	Oct 5, 2023	GPU	CodeCode Available	2	5
VideoSAGE: Video Summarization with Graph Representation Learning	Apr 14, 2024	Graph Representation LearningNode Classification	CodeCode Available	2	5
DISC-LawLLM: Fine-tuning Large Language Models for Intelligent Legal Services	Sep 20, 2023	Language ModellingLarge Language Model	CodeCode Available	2	5
Free-T2M: Frequency Enhanced Text-to-Motion Diffusion Model With Consistency Loss	Jan 30, 2025	DenoisingMotion Generation	CodeCode Available	2	5
Dual-path Mamba: Short and Long-term Bidirectional Selective Structured State Space Models for Speech Separation	Mar 27, 2024	MambaSpeech Separation	CodeCode Available	2	5
Magic-Boost: Boost 3D Generation with Multi-View Conditioned Diffusion	Apr 9, 2024	3D Generation	CodeCode Available	2	5
Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation	Apr 8, 2025	Domain AdaptationDomain Generalization	CodeCode Available	2	5
LaSagnA: Language-based Segmentation Assistant for Complex Queries	Apr 12, 2024	SegmentationSemantic Segmentation	CodeCode Available	2	5
Piccolo2: General Text Embedding with Multi-task Hybrid Loss Training	May 11, 2024		CodeCode Available	2	5
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios	Jun 13, 2024	Language IdentificationSelf-Supervised Learning	CodeCode Available	2	5
Recipe for a General, Powerful, Scalable Graph Transformer	May 25, 2022	Graph ClassificationGraph Property Prediction	CodeCode Available	2	5
WATT: Weight Average Test-Time Adaptation of CLIP	Jun 19, 2024	image-classificationImage Classification	CodeCode Available	2	5