The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2101–2150 of 659983 papers

Title	Date	Tasks	Status	Hype
Exploiting Diffusion Prior for Real-World Image Super-Resolution	May 11, 2023	Blind Super-ResolutionImage Super-Resolution	CodeCode Available	4
VideoChat: Chat-Centric Video Understanding	May 10, 2023	Question AnsweringVideo-based Generative Performance Benchmarking	CodeCode Available	4
InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language	May 9, 2023	Language Modelling	CodeCode Available	4
Otter: A Multi-Modal Model with In-Context Instruction Tuning	May 5, 2023	GPUIn-Context Learning	CodeCode Available	4
Contextual Multilingual Spellchecker for User Queries	May 1, 2023		CodeCode Available	4
The Ideal Continual Learner: An Agent That Never Forgets	Apr 29, 2023	Continual LearningGeneralization Bounds	CodeCode Available	4
Towards Automated Circuit Discovery for Mechanistic Interpretability	Apr 28, 2023		CodeCode Available	4
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality	Apr 27, 2023	Visual Question Answering (VQA)Zero-Shot Video Question Answer	CodeCode Available	4
Segment Anything in Medical Images	Apr 24, 2023	DiagnosticImage Segmentation	CodeCode Available	4
Phoenix: Democratizing ChatGPT across Languages	Apr 20, 2023	Language ModelingLanguage Modelling	CodeCode Available	4
Enhancing Suno's Bark Text-to-Speech Model: Addressing Limitations Through Meta's Encodec and Pre-Trained Hubert	Apr 18, 2023	Audio GenerationExpressive Speech Synthesis	CodeCode Available	4
pgmpy: A Python Toolkit for Bayesian Networks	Apr 17, 2023	Causal DiscoveryCausal Identification	CodeCode Available	4
HuaTuo: Tuning LLaMA Model with Chinese Medical Knowledge	Apr 14, 2023	model	CodeCode Available	4
OpenAGI: When LLM Meets Domain Experts	Apr 10, 2023	BenchmarkingNatural Language Queries	CodeCode Available	4
Instruction Tuning with GPT-4	Apr 6, 2023	Instruction Following	CodeCode Available	4
SegGPT: Segmenting Everything In Context	Apr 6, 2023	Few-Shot Semantic SegmentationIn-Context Learning	CodeCode Available	4
Cerebras-GPT: Open Compute-Optimal Language Models Trained on the Cerebras Wafer-Scale Cluster	Apr 6, 2023		CodeCode Available	4
Vision-Language Models for Vision Tasks: A Survey	Apr 3, 2023	BenchmarkingKnowledge Distillation	CodeCode Available	4
Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data	Apr 3, 2023	ChatbotLanguage Modeling	CodeCode Available	4
Token Merging for Fast Stable Diffusion	Mar 30, 2023	Image Generation	CodeCode Available	4
AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators	Mar 29, 2023	Information RetrievalRetrieval	CodeCode Available	4
InceptionNeXt: When Inception Meets ConvNeXt	Mar 29, 2023	Image ClassificationSemantic Segmentation	CodeCode Available	4
ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks	Mar 27, 2023	text annotationText Classification	CodeCode Available	4
ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge	Mar 24, 2023	Information RetrievalLanguage Modeling	CodeCode Available	4
Text2Video-Zero: Text-to-Image Diffusion Models are Zero-Shot Video Generators	Mar 23, 2023	Image GenerationText-to-Video Generation	CodeCode Available	4
Real-time volumetric rendering of dynamic humans	Mar 21, 2023	3D ReconstructionGPU	CodeCode Available	4
FedML-HE: An Efficient Homomorphic-Encryption-Based Privacy-Preserving Federated Learning System	Mar 20, 2023	Federated LearningPrivacy Preserving	CodeCode Available	4
Reflexion: Language Agents with Verbal Reinforcement Learning	Mar 20, 2023	Decision MakingHumanEval	CodeCode Available	4
Zero-1-to-3: Zero-shot One Image to 3D Object	Mar 20, 2023	3D ReconstructionImage to 3D	CodeCode Available	4
Data-centric Artificial Intelligence: A Survey	Mar 17, 2023	Survey	CodeCode Available	4
VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation	Mar 15, 2023	Code GenerationDenoising	CodeCode Available	4
Eliciting Latent Predictions from Transformers with the Tuned Lens	Mar 14, 2023	Language Modelling	CodeCode Available	4
A Convergent Single-Loop Algorithm for Relaxation of Gromov-Wasserstein in Graph Data	Mar 12, 2023	Computational Efficiency	CodeCode Available	4
Tag2Text: Guiding Vision-Language Model via Image Tagging	Mar 10, 2023	Language ModelingLanguage Modelling	CodeCode Available	4
Cost-Effective Hyperparameter Optimization for Large Language Model Generation Inference	Mar 8, 2023	Hyperparameter OptimizationLanguage Modeling	CodeCode Available	4
FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical Training	Mar 3, 2023	Federated LearningGPU	CodeCode Available	4
Aligning benchmark datasets for table structure recognition	Mar 1, 2023	Table DetectionTable Recognition	CodeCode Available	4
Structured Pruning for Deep Convolutional Neural Networks: A survey	Mar 1, 2023	Network PruningNeural Architecture Search	CodeCode Available	4
Memory-aided Contrastive Consensus Learning for Co-salient Object Detection	Feb 28, 2023	Co-Salient Object Detectionobject-detection	CodeCode Available	4
Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection	Feb 23, 2023	Code CompletionComputer Security	CodeCode Available	4
AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving	Feb 22, 2023	Deep Learning	CodeCode Available	4
ChatGPT for Robotics: Design Principles and Model Abilities	Feb 20, 2023	Mathematical ReasoningPrompt Engineering	CodeCode Available	4
Improving Training Stability for Multitask Ranking Models in Recommender Systems	Feb 17, 2023	Recommendation Systems	CodeCode Available	4
T2I-Adapter: Learning Adapters to Dig out More Controllable Ability for Text-to-Image Diffusion Models	Feb 16, 2023	Image GenerationStyle Transfer	CodeCode Available	4
3D-aware Conditional Image Synthesis	Feb 16, 2023	Image Generation	CodeCode Available	4
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes	Feb 13, 2023	Information RetrievalRetrieval	CodeCode Available	4
An Extended Sequence Tagging Vocabulary for Grammatical Error Correction	Feb 12, 2023	Grammatical Error CorrectionMorphological Inflection	CodeCode Available	4
Multimodal Chain-of-Thought Reasoning in Language Models	Feb 2, 2023	HallucinationLanguage Modelling	CodeCode Available	4
Improving and generalizing flow-based generative models with minibatch optimal transport	Feb 1, 2023		CodeCode Available	4
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video	Feb 1, 2023	Action ClassificationImage Classification	CodeCode Available	4