The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 10026–10050 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
LLM-FP4: 4-Bit Floating-Point Quantized Transformers	Oct 25, 2023	Common Sense ReasoningQuantization	CodeCode Available	2	5
OVTR: End-to-End Open-Vocabulary Multiple Object Tracking with Transformer	Mar 13, 2025	Decodermultimodal interaction	CodeCode Available	2	5
RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs	Mar 8, 2025	Instruction FollowingMathematical Reasoning	CodeCode Available	2	5
A Comprehensive Survey on Knowledge Distillation	Mar 15, 2025	Knowledge DistillationSurvey	CodeCode Available	2	5
TimberTrek: Exploring and Curating Sparse Decision Trees with Interactive Visualization	Sep 19, 2022		CodeCode Available	2	5
LeanVAE: An Ultra-Efficient Reconstruction VAE for Video Diffusion Models	Mar 18, 2025	compressed sensingVideo Generation	CodeCode Available	2	5
MambaIC: State Space Models for High-Performance Learned Image Compression	Mar 16, 2025	Image CompressionState Space Models	CodeCode Available	2	5
Single Image Iterative Subject-driven Generation and Editing	Mar 20, 2025	Image Generation	CodeCode Available	2	5
NuiScene: Exploring Efficient Generation of Unbounded Outdoor Scenes	Mar 20, 2025	Scene Generation	CodeCode Available	2	5
SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer	Mar 20, 2025	DecoderMamba	CodeCode Available	2	5
Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping	Mar 21, 2025	GPUMotion Estimation	CodeCode Available	2	5
Correcting Deviations from Normality: A Reformulated Diffusion Model for Multi-Class Unsupervised Anomaly Detection	Mar 25, 2025	Anomaly DetectionUnsupervised Anomaly Detection	CodeCode Available	2	5
Datasets for Depression Modeling in Social Media: An Overview	Mar 27, 2025		CodeCode Available	2	5
AutoEval: Autonomous Evaluation of Generalist Robot Manipulation Policies in the Real World	Mar 31, 2025	Robot ManipulationScheduling	CodeCode Available	2	5
On-device Sora: Enabling Training-Free Diffusion-based Text-to-Video Generation for Mobile Devices	Mar 31, 2025	DenoisingModel Optimization	CodeCode Available	2	5
Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction	Apr 2, 2025	Federated Learning	CodeCode Available	2	5
An Illusion of Progress? Assessing the Current State of Web Agents	Apr 2, 2025		CodeCode Available	2	5
Re-thinking Temporal Search for Long-Form Video Understanding	Apr 3, 2025	Computational EfficiencyForm	CodeCode Available	2	5
A Decade of Deep Learning for Remote Sensing Spatiotemporal Fusion: Advances, Challenges, and Opportunities	Apr 1, 2025		CodeCode Available	2	5
Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting	Apr 7, 2025	Boundary DetectionObject	CodeCode Available	2	5
VocalNet: Speech LLM with Multi-Token Prediction for Faster and High-Quality Generation	Apr 5, 2025		CodeCode Available	2	5
Chain-of-Tools: Utilizing Massive Unseen Tools in the CoT Reasoning of Frozen Language Models	Mar 21, 2025	GSM8KQuestion Answering	CodeCode Available	2	5
LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models	Apr 14, 2025	Equation DiscoveryMemorization	CodeCode Available	2	5
Tokenize Image Patches: Global Context Fusion for Effective Haze Removal in Large Images	Apr 13, 2025	GPU	CodeCode Available	2	5
GaussCtrl: Multi-View Consistent Text-Driven 3D Gaussian Splatting Editing	Mar 13, 2024	3DGS	CodeCode Available	2	5