Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 976–1000 of 1135 papers

Title	Date	Tasks	Status	Hype
OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue	Jun 21, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	1
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models	Jun 19, 2023	Instruction FollowingText Generation	CodeCode Available	2
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation	Jun 17, 2023	Decision MakingInstruction Following	—Unverified	0
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models	Jun 15, 2023	HallucinationImage Captioning	CodeCode Available	2
MiniLLM: Knowledge Distillation of Large Language Models	Jun 14, 2023	Instruction FollowingKnowledge Distillation	CodeCode Available	2
Valley: Video Assistant with Large Language model Enhanced abilitY	Jun 12, 2023	Action RecognitionInstruction Following	CodeCode Available	2
How Can Recommender Systems Benefit from Large Language Models: A Survey	Jun 9, 2023	EthicsFeature Engineering	CodeCode Available	3
How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources	Jun 7, 2023	Instruction Following	CodeCode Available	4
"Are you telling me to put glasses on the dog?'' Content-Grounded Annotation of Instruction Clarification Requests in the CoDraw Dataset	Jun 4, 2023	Instruction Following	—Unverified	0
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day	Jun 1, 2023	Image ClassificationInstruction Following	CodeCode Available	4
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft	Jun 1, 2023	Decision MakingImage Generation	CodeCode Available	2
From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces	May 31, 2023	Instruction Following	CodeCode Available	1
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction	May 30, 2023	Image GenerationInstruction Following	CodeCode Available	2
Controllable Text-to-Image Generation with GPT-4	May 29, 2023	Image GenerationInstruction Following	—Unverified	0
A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents	May 26, 2023	Instruction FollowingReinforcement Learning (RL)	CodeCode Available	0
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models	May 26, 2023	Instruction FollowingVision and Language Navigation	CodeCode Available	2
PandaGPT: One Model To Instruction-Follow Them All	May 25, 2023	AllImage Description	CodeCode Available	2
TOAST: Transfer Learning via Attention Steering	May 24, 2023	Fine-Grained Image ClassificationInstruction Following	CodeCode Available	1
PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology	May 24, 2023	DiagnosticInstruction Following	CodeCode Available	1
PIVOINE: Instruction Tuning for Open-world Information Extraction	May 24, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	1
ExpertPrompting: Instructing Large Language Models to be Distinguished Experts	May 24, 2023	In-Context LearningInstruction Following	CodeCode Available	2
Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation	May 24, 2023	Instruction Following	CodeCode Available	1
SAIL: Search-Augmented Instruction Learning	May 24, 2023	DenoisingFact Checking	—Unverified	0
A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction	May 24, 2023	Computational EfficiencyEvent Extraction	—Unverified	0
QLoRA: Efficient Finetuning of Quantized LLMs	May 23, 2023	ChatbotGPU	CodeCode Available	6

Show:10 25 50

← PrevPage 40 of 46Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified