The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5926–5950 of 474278 papers

Title	Date	Tasks	Status	Hype
Next Token Is Enough: Realistic Image Quality and Aesthetic Scoring with Multimodal Large Language Model	Mar 8, 2025	Image Quality AssessmentLanguage Modeling	CodeCode Available	2
D2GV: Deformable 2D Gaussian Splatting for Video Representation in 400FPS	Mar 7, 2025	DenoisingQuantization	CodeCode Available	2
WritingBench: A Comprehensive Benchmark for Generative Writing	Mar 7, 2025	Text Generation	CodeCode Available	2
DriveTransformer: Unified Transformer for Scalable End-to-End Autonomous Driving	Mar 7, 2025	Autonomous DrivingBench2Drive	CodeCode Available	2
Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching	Mar 7, 2025		CodeCode Available	2
Linear-MoE: Linear Sequence Modeling Meets Mixture-of-Experts	Mar 7, 2025	Mixture-of-ExpertsState Space Models	CodeCode Available	2
EDM: Efficient Deep Feature Matching	Mar 7, 2025		CodeCode Available	2
Encrypted Vector Similarity Computations Using Partially Homomorphic Encryption: Applications and Performance Analysis	Mar 7, 2025	Image RetrievalPrivacy Preserving	CodeCode Available	2
CoMoGaussian: Continuous Motion-Aware Gaussian Splatting from Motion-Blurred Images	Mar 7, 2025	3DGS3D Scene Reconstruction	CodeCode Available	2
PromptPex: Automatic Test Generation for Language Model Prompts	Mar 7, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval	Mar 7, 2025	Information RetrievalLanguage Modeling	CodeCode Available	2
Slim attention: cut your context memory in half without loss of accuracy -- K-cache is all you need for MHA	Mar 7, 2025	AllDecoder	CodeCode Available	2
Generalized Interpolating Discrete Diffusion	Mar 6, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Omnidirectional Multi-Object Tracking	Mar 6, 2025	Multi-Object TrackingObject	CodeCode Available	2
Bridging the Vision-Brain Gap with an Uncertainty-Aware Blur Prior	Mar 6, 2025	Image Retrieval	CodeCode Available	2
Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model	Mar 6, 2025	General KnowledgeImage Captioning	CodeCode Available	2
ProtComposer: Compositional Protein Structure Generation with 3D Ellipsoids	Mar 6, 2025	Diversity	CodeCode Available	2
An Egocentric Vision-Language Model based Portable Real-time Smart Assistant	Mar 6, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM	Mar 6, 2025	Anomaly DetectionLanguage Modeling	CodeCode Available	2
Scaling Rich Style-Prompted Text-to-Speech Datasets	Mar 6, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian Process	Mar 6, 2025	Autonomous NavigationComputational Efficiency	CodeCode Available	2
Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities	Mar 6, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
PDX: A Data Layout for Vector Similarity Search	Mar 6, 2025	Avg	CodeCode Available	2
Collaborative Expert LLMs Guided Multi-Objective Molecular Optimization	Mar 5, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Universal Narrative Model: an Author-centric Storytelling Framework for Generative AI	Mar 5, 2025		CodeCode Available	2