Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–525 of 1135 papers

Title	Date	Tasks	Status
Instruction-Following Pruning for Large Language Models	Jan 3, 2025	Instruction FollowingMath	—Unverified
Instruction-Following Speech Recognition	Sep 18, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
ICCO: Learning an Instruction-conditioned Coordinator for Language-guided Task-aligned Multi-robot Control	Mar 15, 2025	Instruction FollowingMulti-agent Reinforcement Learning	—Unverified
Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors	Nov 3, 2024	Instruction FollowingRAG	—Unverified
VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought	Jun 20, 2024	Action AnticipationContinual Learning	—Unverified
HyperCLOVA X Technical Report	Apr 2, 2024	Instruction FollowingMachine Translation	—Unverified
Data Diversity Matters for Robust Instruction Tuning	Nov 21, 2023	DiversityInstruction Following	—Unverified
Hypencoder: Hypernetworks for Information Retrieval	Feb 7, 2025	Information RetrievalInstruction Following	—Unverified
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought	May 21, 2025	ChatbotInstruction Following	—Unverified
Human-Instruction-Free LLM Self-Alignment with Limited Samples	Jan 6, 2024	In-Context LearningInstruction Following	—Unverified
DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering	Mar 11, 2025	FormInstruction Following	—Unverified
LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction	Jun 16, 2025	Instruction FollowingVision-Language-Action	—Unverified
LEWIS (LayEr WIse Sparsity) -- A Training Free Guided Model Merging Approach	Mar 5, 2025	Instruction FollowingMath	—Unverified
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text	May 19, 2020	Deep Reinforcement LearningInstruction Following	—Unverified
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction	Jan 1, 2025	DescriptiveInstruction Following	—Unverified
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning	Mar 14, 2025	DiversityInstruction Following	—Unverified
BARE: Leveraging Base Language Models for Few-Shot Synthetic Data Generation	Feb 3, 2025	DiversityGSM8K	—Unverified
CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment	Oct 25, 2023	In-Context LearningInstruction Following	—Unverified
Learning to Navigate the Web	Dec 21, 2018	Deep Reinforcement LearningInstruction Following	—Unverified
LearnLM: Improving Gemini for Learning	Dec 21, 2024	Instruction Following	—Unverified
How well can LLMs Grade Essays in Arabic?	Jan 27, 2025	Automated Essay ScoringIn-Context Learning	—Unverified
BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues	Jan 18, 2025	Instruction FollowingMinecraft	—Unverified
How Many Instructions Can LLMs Follow at Once?	Jul 15, 2025	Instruction Following	—Unverified
Length Controlled Generation for Black-box LLMs	Dec 19, 2024	Abstractive Text SummarizationInstruction Following	—Unverified
How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment	Jun 17, 2024	In-Context LearningInstruction Following	—Unverified

Show:10 25 50

← PrevPage 21 of 46Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified