The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 626–650 of 177339 papers

Title	Date	Tasks	Status	Hype	Score
Gorilla: Large Language Model Connected with Massive APIs	May 24, 2023	HallucinationLanguage Modeling	CodeCode Available	6	5
HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in Hugging Face	Mar 30, 2023	Automatic Machine Learning Model SelectionModel Selection	CodeCode Available	6	5
U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation	Nov 29, 2023	Computational EfficiencyDecoder	CodeCode Available	6	5
FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning	Nov 6, 2022	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	6	5
AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration	Jun 1, 2023	Autonomous DrivingCloud Computing	CodeCode Available	6	5
OxfordVGG Submission to the EGO4D AV Transcription Challenge	Jul 18, 2023	Automatic Speech Recognitionspeech-recognition	CodeCode Available	6	5
Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca	Apr 17, 2023		CodeCode Available	6	5
Training language models to follow instructions with human feedback	Mar 4, 2022	Question Answering	CodeCode Available	6	5
MoVQ: Modulating Quantized Vectors for High-Fidelity Image Generation	Sep 19, 2022	DecoderImage Generation	CodeCode Available	5	5
Unified Training of Universal Time Series Forecasting Transformers	Feb 4, 2024	Time SeriesTime Series Forecasting	CodeCode Available	5	5
InstantCharacter: Personalize Any Characters with a Scalable Diffusion Transformer Framework	Apr 16, 2025	Image Generation	CodeCode Available	5	5
TimeMixer++: A General Time Series Pattern Machine for Universal Predictive Analysis	Oct 21, 2024	Anomaly DetectionImputation	CodeCode Available	5	5
Learning Flow Fields in Attention for Controllable Person Image Generation	Dec 11, 2024	AttributeImage Generation	CodeCode Available	5	5
MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts	Apr 13, 2024	DiversityLanguage Modeling	CodeCode Available	5	5
Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively	Jan 5, 2024	image-classificationImage Classification	CodeCode Available	5	5
Common 7B Language Models Already Possess Strong Math Capabilities	Mar 7, 2024	GSM8KMath	CodeCode Available	5	5
Fast On-device LLM Inference with NPUs	Jul 8, 2024	CPUGPU	CodeCode Available	5	5
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation	Oct 30, 2023	Text-to-Video GenerationVideo Generation	CodeCode Available	5	5
Efficient Multimodal Learning from Data-centric Perspective	Feb 18, 2024	Image ClassificationReferring Expression Comprehension	CodeCode Available	5	5
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation	Aug 15, 2024	DiagnosticRAG	CodeCode Available	5	5
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference	Dec 18, 2024	DecoderRetrieval	CodeCode Available	5	5
StreamSpeech: Simultaneous Speech-to-Speech Translation with Multi-task Learning	Jun 5, 2024	Automatic Speech Recognition (ASR)de-en	CodeCode Available	5	5
A ConvNet for the 2020s	Jan 10, 2022	ClassificationDomain Generalization	CodeCode Available	5	5
A Time Series is Worth 64 Words: Long-term Forecasting with Transformers	Nov 27, 2022	Multivariate Time Series ForecastingRepresentation Learning	CodeCode Available	5	5
Tango 2: Aligning Diffusion-based Text-to-Audio Generations through Direct Preference Optimization	Apr 15, 2024	Audio Generation	CodeCode Available	5	5