The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 551–575 of 659983 papers

Title	Date	Tasks	Status	Hype
TaskBench: Benchmarking Large Language Models for Task Automation	Nov 30, 2023	BenchmarkingParameter Prediction	CodeCode Available	6
U-Net v2: Rethinking the Skip Connections of U-Net for Medical Image Segmentation	Nov 29, 2023	Computational EfficiencyDecoder	CodeCode Available	6
SparseCtrl: Adding Sparse Controls to Text-to-Video Diffusion Models	Nov 28, 2023	Video Generation	CodeCode Available	6
Adversarial Diffusion Distillation	Nov 28, 2023	Image Generation	CodeCode Available	6
TabRepo: A Large Scale Repository of Tabular Model Evaluations and its AutoML Applications	Nov 6, 2023	AutoMLHyperparameter Optimization	CodeCode Available	6
Res-Tuning: A Flexible and Efficient Tuning Paradigm via Unbinding Tuner from Backbone	Oct 30, 2023	Disentanglement	CodeCode Available	6
H2O Open Ecosystem for State-of-the-art Large Language Models	Oct 17, 2023		CodeCode Available	6
A decoder-only foundation model for time-series forecasting	Oct 14, 2023	DecoderTime Series	CodeCode Available	6
MemGPT: Towards LLMs as Operating Systems	Oct 12, 2023	Management	CodeCode Available	6
Mistral 7B	Oct 10, 2023	answerability predictionArithmetic Reasoning	CodeCode Available	6
iTransformer: Inverted Transformers Are Effective for Time Series Forecasting	Oct 10, 2023	Time SeriesTime Series Forecasting	CodeCode Available	6
NEFTune: Noisy Embeddings Improve Instruction Finetuning	Oct 9, 2023	Language ModelingLanguage Modelling	CodeCode Available	6
Enhancing Financial Sentiment Analysis via Retrieval Augmented Large Language Models	Oct 6, 2023	Decision MakingRetrieval	CodeCode Available	6
Improved Baselines with Visual Instruction Tuning	Oct 5, 2023	Factual Inconsistency Detection in Chart CaptioningImage Classification	CodeCode Available	6
Qwen Technical Report	Sep 28, 2023	Language ModelingLanguage Modelling	CodeCode Available	6
Vision Transformers Need Registers	Sep 28, 2023	Object DiscoverySelf-Supervised Image Classification	CodeCode Available	6
RAGAS: Automated Evaluation of Retrieval Augmented Generation	Sep 26, 2023	RAGRetrieval	CodeCode Available	6
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models	Sep 21, 2023	4kGPU	CodeCode Available	6
Data Formulator: AI-powered Concept-driven Visualization Authoring	Sep 18, 2023	AI Agent	CodeCode Available	6
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models	Sep 18, 2023	Visual Question Answering	CodeCode Available	6
Efficient Memory Management for Large Language Model Serving with PagedAttention	Sep 12, 2023	Language ModelingLanguage Modelling	CodeCode Available	6
ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models	Sep 2, 2023		CodeCode Available	6
YaRN: Efficient Context Window Extension of Large Language Models	Aug 31, 2023	Position	CodeCode Available	6
Code Llama: Open Foundation Models for Code	Aug 24, 2023	16kCode Generation	CodeCode Available	6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across Languages	Aug 23, 2023	Image GenerationImage to text	CodeCode Available	6