The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4276–4300 of 661570 papers

Title	Date	Tasks	Status	Hype
InstructIE: A Bilingual Instruction-based Information Extraction Dataset	May 19, 2023		CodeCode Available	3
Quantifying the robustness of deep multispectral segmentation models against natural perturbations and data poisoning	May 18, 2023	Adversarial RobustnessData Poisoning	CodeCode Available	3
SpeechGPT: Empowering Large Language Models with Intrinsic Cross-Modal Conversational Abilities	May 18, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities	May 18, 2023	1 Image, 2*2 StitchiAction Classification	CodeCode Available	3
Accelerating Transformer Inference for Translation via Parallel Decoding	May 17, 2023	Machine TranslationTranslation	CodeCode Available	3
OmniSafe: An Infrastructure for Accelerating Safe Reinforcement Learning Research	May 16, 2023	Philosophyreinforcement-learning	CodeCode Available	3
SpecInfer: Accelerating Generative Large Language Model Serving with Tree-based Speculative Inference and Verification	May 16, 2023	DecoderLanguage Modeling	CodeCode Available	3
NLG Evaluation Metrics Beyond Correlation Analysis: An Empirical Metric Preference Checklist	May 15, 2023	Controllable Language ModellingDialogue Generation	CodeCode Available	3
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation Models	May 15, 2023	Multiple-choice	CodeCode Available	3
A Comprehensive Survey on Segment Anything Model for Vision and Beyond	May 14, 2023		CodeCode Available	3
WikiWeb2M: A Page-Level Multimodal Wikipedia Dataset	May 9, 2023	ArticlesImage Captioning	CodeCode Available	3
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans	May 8, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	3
X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages	May 7, 2023	AttributeInstruction Following	CodeCode Available	3
Visual Causal Scene Refinement for Video Question Answering	May 7, 2023	Contrastive LearningQuestion Answering	CodeCode Available	3
PiML Toolbox for Interpretable Machine Learning Model Development and Diagnostics	May 7, 2023	FairnessInterpretable Machine Learning	CodeCode Available	3
Caption Anything: Interactive Image Description with Diverse Multimodal Controls	May 4, 2023	controllable image captioningImage Captioning	CodeCode Available	3
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision	May 4, 2023	DiversityIn-Context Learning	CodeCode Available	3
Personalize Segment Anything Model with One Shot	May 4, 2023	Image Generationmodel	CodeCode Available	3
Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models	May 4, 2023	Instruction Following	CodeCode Available	3
Unlimiformer: Long-Range Transformers with Unlimited Length Input	May 2, 2023	Book summarizationCPU	CodeCode Available	3
Is Your Code Generated by ChatGPT Really Correct? Rigorous Evaluation of Large Language Models for Code Generation	May 2, 2023	Code GenerationHumanEval	CodeCode Available	3
UCF: Uncovering Common Features for Generalizable Deepfake Detection	Apr 27, 2023	Binary ClassificationDecoder	CodeCode Available	3
LibCity: A Unified Library Towards Efficient and Comprehensive Urban Spatial-Temporal Prediction	Apr 27, 2023	Prediction	CodeCode Available	3
TorchBench: Benchmarking PyTorch with High API Surface Coverage	Apr 27, 2023	BenchmarkingGPU	CodeCode Available	3
Learning Neural PDE Solvers with Parameter-Guided Channel Attention	Apr 27, 2023	PDE Surrogate ModelingWeather Forecasting	CodeCode Available	3