Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–1000 of 1135 papers

Title	Date	Tasks	Status	Hype
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions	Aug 8, 2023	Caption GenerationImage Captioning	CodeCode Available	2
Zhongjing: Enhancing the Chinese Medical Capabilities of Large Language Model through Expert Feedback and Real-world Multi-turn Dialogue	Aug 7, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	2
AgentBench: Evaluating LLMs as Agents	Aug 7, 2023	Decision MakingInstruction Following	CodeCode Available	4
Toward Zero-Shot Instruction Following	Aug 4, 2023	Instruction Following	CodeCode Available	0
Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering	Jul 31, 2023	Instruction FollowingQuestion Answering	CodeCode Available	1
ETHER: Aligning Emergent Communication for Hindsight Experience Replay	Jul 28, 2023	Inductive BiasInstruction Following	—Unverified	0
L-Eval: Instituting Standardized Evaluation for Long Context Language Models	Jul 20, 2023	Instruction Following	CodeCode Available	6
Instruction-following Evaluation through Verbalizer Manipulation	Jul 20, 2023	Instruction Following	—Unverified	0
FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets	Jul 20, 2023	Instruction FollowingLanguage Model Evaluation	CodeCode Available	2
LLM Censorship: A Machine Learning Challenge or a Computer Security Problem?	Jul 20, 2023	Computer SecurityInstruction Following	—Unverified	0
ChatSpot: Bootstrapping Multimodal LLMs via Precise Referring Instruction Tuning	Jul 18, 2023	Instruction FollowingLanguage Modeling	—Unverified	0
AlpaGasus: Training A Better Alpaca with Fewer Data	Jul 17, 2023	Instruction Following	CodeCode Available	1
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs	Jul 17, 2023	Instruction FollowingSentence	CodeCode Available	2
Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study	Jul 16, 2023	In-Context LearningInstruction Following	CodeCode Available	1
Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study	Jul 13, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified	0
MMBench: Is Your Multi-modal Model an All-around Player?	Jul 12, 2023	AllInstruction Following	CodeCode Available	5
Instruction Mining: Instruction Data Selection for Tuning Large Language Models	Jul 12, 2023	Instruction FollowingLanguage Modeling	—Unverified	0
Opening up ChatGPT: Tracking openness, transparency, and accountability in instruction-tuned text generators	Jul 8, 2023	FairnessInstruction Following	CodeCode Available	1
Becoming self-instruct: introducing early stopping criteria for minimal instruct tuning	Jul 5, 2023	Instruction Following	—Unverified	0
What Matters in Training a GPT4-Style Language Model with Multimodal Inputs?	Jul 5, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	2
Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models	Jul 3, 2023	FormInstruction Following	CodeCode Available	1
Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control	Jun 30, 2023	Instruction Following	—Unverified	0
KITE: Keypoint-Conditioned Policies for Semantic Manipulation	Jun 29, 2023	Instruction FollowingObject	—Unverified	0
LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding	Jun 29, 2023	16kImage Captioning	CodeCode Available	2
On the Exploitability of Instruction Tuning	Jun 28, 2023	Data PoisoningInstruction Following	CodeCode Available	1
OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue	Jun 21, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	1
BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models	Jun 19, 2023	Instruction FollowingText Generation	CodeCode Available	2
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation	Jun 17, 2023	Decision MakingInstruction Following	—Unverified	0
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models	Jun 15, 2023	HallucinationImage Captioning	CodeCode Available	2
MiniLLM: Knowledge Distillation of Large Language Models	Jun 14, 2023	Instruction FollowingKnowledge Distillation	CodeCode Available	2
Valley: Video Assistant with Large Language model Enhanced abilitY	Jun 12, 2023	Action RecognitionInstruction Following	CodeCode Available	2
How Can Recommender Systems Benefit from Large Language Models: A Survey	Jun 9, 2023	EthicsFeature Engineering	CodeCode Available	3
How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources	Jun 7, 2023	Instruction Following	CodeCode Available	4
"Are you telling me to put glasses on the dog?'' Content-Grounded Annotation of Instruction Clarification Requests in the CoDraw Dataset	Jun 4, 2023	Instruction Following	—Unverified	0
LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day	Jun 1, 2023	Image ClassificationInstruction Following	CodeCode Available	4
STEVE-1: A Generative Model for Text-to-Behavior in Minecraft	Jun 1, 2023	Decision MakingImage Generation	CodeCode Available	2
From Pixels to UI Actions: Learning to Follow Instructions via Graphical User Interfaces	May 31, 2023	Instruction Following	CodeCode Available	1
GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction	May 30, 2023	Image GenerationInstruction Following	CodeCode Available	2
Controllable Text-to-Image Generation with GPT-4	May 29, 2023	Image GenerationInstruction Following	—Unverified	0
A Reminder of its Brittleness: Language Reward Shaping May Hinder Learning for Instruction Following Agents	May 26, 2023	Instruction FollowingReinforcement Learning (RL)	CodeCode Available	0
NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models	May 26, 2023	Instruction FollowingVision and Language Navigation	CodeCode Available	2
PandaGPT: One Model To Instruction-Follow Them All	May 25, 2023	AllImage Description	CodeCode Available	2
TOAST: Transfer Learning via Attention Steering	May 24, 2023	Fine-Grained Image ClassificationInstruction Following	CodeCode Available	1
PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology	May 24, 2023	DiagnosticInstruction Following	CodeCode Available	1
PIVOINE: Instruction Tuning for Open-world Information Extraction	May 24, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	1
ExpertPrompting: Instructing Large Language Models to be Distinguished Experts	May 24, 2023	In-Context LearningInstruction Following	CodeCode Available	2
Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation	May 24, 2023	Instruction Following	CodeCode Available	1
SAIL: Search-Augmented Instruction Learning	May 24, 2023	DenoisingFact Checking	—Unverified	0
A Monte Carlo Language Model Pipeline for Zero-Shot Sociopolitical Event Extraction	May 24, 2023	Computational EfficiencyEvent Extraction	—Unverified	0
QLoRA: Efficient Finetuning of Quantized LLMs	May 23, 2023	ChatbotGPU	CodeCode Available	6

Show:10 25 50

← PrevPage 20 of 23Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified