The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8701–8725 of 474278 papers

Title	Date	Tasks	Status	Hype
Mamba-R: Vision Mamba ALSO Needs Registers	May 23, 2024	MambaSemantic Segmentation	CodeCode Available	2
Large language models can be zero-shot anomaly detectors for time series?	May 23, 2024	Anomaly DetectionLanguage Modeling	CodeCode Available	2
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing	May 23, 2024	Instruction Following	CodeCode Available	2
EMR-Merging: Tuning-Free High-Performance Model Merging	May 23, 2024	Image ClassificationImage Retrieval	CodeCode Available	2
RectifID: Personalizing Rectified Flow with Anchored Classifier Guidance	May 23, 2024	Image GenerationPersonalized Image Generation	CodeCode Available	2
TopoLogic: An Interpretable Pipeline for Lane Topology Reasoning on Driving Scenes	May 23, 2024	Autonomous DrivingLane Detection	CodeCode Available	2
Extracting Prompts by Inverting LLM Outputs	May 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Metric Flow Matching for Smooth Interpolations on the Data Manifold	May 23, 2024	Trajectory Prediction	CodeCode Available	2
Flatten Anything: Unsupervised Neural Surface Parameterization	May 23, 2024		CodeCode Available	2
S-Eval: Towards Automated and Comprehensive Safety Evaluation for Large Language Models	May 23, 2024	Benchmarking	CodeCode Available	2
AnalogCoder: Analog Circuit Design via Training-Free Code Generation	May 23, 2024	Code Generation	CodeCode Available	2
DreamText: High Fidelity Scene Text Synthesis	May 23, 2024		CodeCode Available	2
Dynamic Mixture of Experts: An Auto-Tuning Approach for Efficient Transformer Models	May 23, 2024	Mixture-of-ExpertsVisual Question Answering	CodeCode Available	2
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models	May 23, 2024	Natural Language UnderstandingQuantization	CodeCode Available	2
EHRMamba: Towards Generalizable and Scalable Foundation Models for Electronic Health Records	May 23, 2024	Mamba	CodeCode Available	2
Vikhr: Constructing a State-of-the-art Bilingual Open-Source Instruction-Following Large Language Model for Russian	May 22, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	2
CViT: Continuous Vision Transformer for Operator Learning	May 22, 2024	Operator learning	CodeCode Available	2
Generalizing Weather Forecast to Fine-grained Temporal Scales via Physics-AI Hybrid Modeling	May 22, 2024	Weather Forecasting	CodeCode Available	2
BrainMorph: A Foundational Keypoint Model for Robust and Flexible Brain MRI Registration	May 22, 2024		CodeCode Available	2
Learning Diffusion Priors from Observations by Expectation Maximization	May 22, 2024		CodeCode Available	2
Dense Connector for MLLMs	May 22, 2024	Video Understanding	CodeCode Available	2
Context and Geometry Aware Voxel Transformer for Semantic Scene Completion	May 22, 2024	3D Semantic Scene Completion from a single RGB image	CodeCode Available	2
A General Framework for Jersey Number Recognition in Sports Video	May 22, 2024	Jersey Number RecognitionScene Text Recognition	CodeCode Available	2
Model Editing as a Robust and Denoised variant of DPO: A Case Study on Toxicity	May 22, 2024	Language ModellingModel Editing	CodeCode Available	2
FedCache 2.0: Federated Edge Learning with Knowledge Caching and Dataset Distillation	May 22, 2024	Dataset DistillationFederated Learning	CodeCode Available	2