Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 511–520 of 1135 papers

Title	Date	Tasks	Status
Data Diversity Matters for Robust Instruction Tuning	Nov 21, 2023	DiversityInstruction Following	—Unverified
Hypencoder: Hypernetworks for Information Retrieval	Feb 7, 2025	Information RetrievalInstruction Following	—Unverified
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought	May 21, 2025	ChatbotInstruction Following	—Unverified
Human-Instruction-Free LLM Self-Alignment with Limited Samples	Jan 6, 2024	In-Context LearningInstruction Following	—Unverified
DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering	Mar 11, 2025	FormInstruction Following	—Unverified
Less is More: Generating Grounded Navigation Instructions from Landmarks	Nov 25, 2021	DecoderInstruction Following	—Unverified
LIMIT: Less Is More for Instruction Tuning Across Evaluation Paradigms	Nov 22, 2023	Instruction Following	—Unverified
Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text	May 19, 2020	Deep Reinforcement LearningInstruction Following	—Unverified
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction	Jan 1, 2025	DescriptiveInstruction Following	—Unverified
D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning	Mar 14, 2025	DiversityInstruction Following	—Unverified

Show:10 25 50

← PrevPage 52 of 114Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified