The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 659983 papers

Title	Date	Tasks	Status	Hype
LongLLMLingua: Accelerating and Enhancing LLMs in Long Context Scenarios via Prompt Compression	Oct 10, 2023	Code CompletionFew-Shot Learning	CodeCode Available	5
LLMLingua: Compressing Prompts for Accelerated Inference of Large Language Models	Oct 9, 2023	GSM8KIn-Context Learning	CodeCode Available	5
EasyPhoto: Your Smart AI Photo Generator	Oct 7, 2023		CodeCode Available	5
ReLU Strikes Back: Exploiting Activation Sparsity in Large Language Models	Oct 6, 2023		CodeCode Available	5
Efficient Streaming Language Models with Attention Sinks	Sep 29, 2023	Language ModelingLanguage Modelling	CodeCode Available	5
YOLOR-Based Multi-Task Learning	Sep 29, 2023	Image CaptioningInstance Segmentation	CodeCode Available	5
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models	Sep 26, 2023	Quantization	CodeCode Available	5
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention	Sep 25, 2023	Language ModelingLanguage Modelling	CodeCode Available	5
ChatGPT MT: Competitive for High- (but not Low-) Resource Languages	Sep 14, 2023	Machine Translation	CodeCode Available	5
The Rise and Potential of Large Language Model Based Agents: A Survey	Sep 14, 2023	Language ModelingLanguage Modelling	CodeCode Available	5
Agents: An Open-source Framework for Autonomous Language Agents	Sep 14, 2023		CodeCode Available	5
ImageBind-LLM: Multi-modality Instruction Tuning	Sep 7, 2023	Instruction FollowingText Generation	CodeCode Available	5
ProPainter: Improving Propagation and Transformer for Video Inpainting	Sep 7, 2023	Optical Flow EstimationVideo Inpainting	CodeCode Available	5
Data-Juicer: A One-Stop Data Processing System for Large Language Models	Sep 5, 2023	Distributed Computing	CodeCode Available	5
Nougat: Neural Optical Understanding for Academic Documents	Aug 25, 2023	Optical Character RecognitionOptical Character Recognition (OCR)	CodeCode Available	5
Qwen-VL: A Versatile Vision-Language Model for Understanding, Localization, Text Reading, and Beyond	Aug 24, 2023	Chart Question AnsweringFS-MEVQA	CodeCode Available	5
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct	Aug 18, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	5
IP-Adapter: Text Compatible Image Prompt Adapter for Text-to-Image Diffusion Models	Aug 13, 2023	Diffusion Personalization Tuning FreeImage Generation	CodeCode Available	5
ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs	Jul 31, 2023	Trajectory PlanningZero-shot Generalization	CodeCode Available	5
MMBench: Is Your Multi-modal Model an All-around Player?	Jul 12, 2023	AllInstruction Following	CodeCode Available	5
ReLoRA: High-Rank Training Through Low-Rank Updates	Jul 11, 2023	GPU	CodeCode Available	5
Chatlaw: A Multi-Agent Collaborative Legal Assistant with Knowledge Graph Enhanced Mixture-of-Experts Large Language Model	Jun 28, 2023	HallucinationKnowledge Graphs	CodeCode Available	5
Faster Segment Anything: Towards Lightweight SAM for Mobile Applications	Jun 25, 2023	CPUDecoder	CodeCode Available	5
LMFlow: An Extensible Toolkit for Finetuning and Inference of Large Foundation Models	Jun 21, 2023		CodeCode Available	5
Infinite Photorealistic Worlds using Procedural Generation	Jun 15, 2023	3D Reconstructionobject-detection	CodeCode Available	5
WizardCoder: Empowering Code Large Language Models with Evol-Instruct	Jun 14, 2023	Code GenerationHumanEval	CodeCode Available	5
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models	Jun 13, 2023	Speech Synthesistext-to-speech	CodeCode Available	5
Image Vectorization: a Review	Jun 10, 2023	Image GenerationVector Graphics	CodeCode Available	5
XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech	May 31, 2023	text-to-speechText to Speech	CodeCode Available	5
Voyager: An Open-Ended Embodied Agent with Large Language Models	May 25, 2023	Lifelong learningMinecraft	CodeCode Available	5
Know Your Self-supervised Learning: A Survey on Image-based Generative and Discriminative Training	May 23, 2023	Contrastive LearningSelf-Supervised Learning	CodeCode Available	5
Tree of Thoughts: Deliberate Problem Solving with Large Language Models	May 17, 2023	Arithmetic ReasoningDecision Making	CodeCode Available	5
ImageBind: One Embedding Space To Bind Them All	May 9, 2023	AllCross-Modal Retrieval	CodeCode Available	5
StarCoder: may the source be with you!	May 9, 2023	8kCode Generation	CodeCode Available	5
CodeGen2: Lessons for Training LLMs on Programming and Natural Languages	May 3, 2023	Causal Language ModelingDecoder	CodeCode Available	5
LLaMA-Adapter V2: Parameter-Efficient Visual Instruction Model	Apr 28, 2023	Instruction Followingmodel	CodeCode Available	5
WizardLM: Empowering Large Language Models to Follow Complex Instructions	Apr 24, 2023	Instruction Following	CodeCode Available	5
Track Anything: Segment Anything Meets Videos	Apr 24, 2023	Image SegmentationObject Tracking	CodeCode Available	5
Long-term Forecasting with TiDE: Time-series Dense Encoder	Apr 17, 2023	Anomaly DetectionDecoder	CodeCode Available	5
Tool Learning with Foundation Models	Apr 17, 2023		CodeCode Available	5
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and Evaluation	Apr 16, 2023	Instruction Following	CodeCode Available	5
Inpaint Anything: Segment Anything Meets Image Inpainting	Apr 13, 2023	Image Inpainting	CodeCode Available	5
RAFT: Reward rAnked FineTuning for Generative Foundation Model Alignment	Apr 13, 2023	Ethics	CodeCode Available	5
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and Beyond	Apr 11, 2023	Text to 3D	CodeCode Available	5
How to Design Translation Prompts for ChatGPT: An Empirical Study	Apr 5, 2023	Machine TranslationNatural Language Understanding	CodeCode Available	5
Segment Anything	Apr 5, 2023	Event-based Object SegmentationImage Segmentation	CodeCode Available	5
Assessing Language Model Deployment with Risk Cards	Mar 31, 2023	Language ModelingLanguage Modelling	CodeCode Available	5
CodeGeeX: A Pre-Trained Model for Code Generation with Multilingual Benchmarking on HumanEval-X	Mar 30, 2023	BenchmarkingCode Generation	CodeCode Available	5
LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention	Mar 28, 2023	Instruction FollowingLanguage Modelling	CodeCode Available	5
Does `Deep Learning on a Data Diet' reproduce? Overall yes, but GraNd at Initialization does not	Mar 26, 2023		CodeCode Available	5