Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 1135 papers

Title	Date	Tasks	Status	Hype	Score
RMM: A Recursive Mental Model for Dialogue Navigation	Nov 1, 2020	Answer GenerationInstruction Following	CodeCode Available	1	5
InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4	Aug 23, 2023	Instruction FollowingQuestion Answering	CodeCode Available	1	5
Instruction-Guided Visual Masking	May 30, 2024	Instruction FollowingVisual Grounding	CodeCode Available	1	5
Can Language Models Follow Multiple Turns of Entangled Instructions?	Mar 17, 2025	Instruction FollowingMemorization	CodeCode Available	1	5
ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environments	Jul 26, 2024	Instruction Following	CodeCode Available	1	5
CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection	Oct 10, 2024	Instruction Following	CodeCode Available	1	5
RealWebAssist: A Benchmark for Long-Horizon Web Assistance with Real-World Users	Apr 14, 2025	Instruction Following	CodeCode Available	1	5
EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing	Jul 18, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	1	5
Instruction-Following Agents with Multimodal Transformer	Oct 24, 2022	Instruction FollowingVisual Grounding	CodeCode Available	1	5
Instruction Position Matters in Sequence Generation with Large Language Models	Aug 23, 2023	Instruction FollowingPosition	CodeCode Available	1	5
RecGPT: Generative Pre-training for Text-based Recommendation	May 21, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	1	5
Instruct and Extract: Instruction Tuning for On-Demand Information Extraction	Oct 24, 2023	Instruction Following	CodeCode Available	1	5
Answer is All You Need: Instruction-following Text Embedding via Answering the Question	Feb 15, 2024	abstractive question answeringAll	CodeCode Available	1	5
Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction	Apr 22, 2025	DiversityDomain Adaptation	CodeCode Available	1	5
Inferring Rewards from Language in Context	Apr 5, 2022	Instruction FollowingReinforcement Learning (RL)	CodeCode Available	1	5
InfMLLM: A Unified Framework for Visual-Language Tasks	Nov 12, 2023	GPUImage Captioning	CodeCode Available	1	5
An In-depth Look at Gemini's Language Abilities	Dec 18, 2023	Instruction FollowingMath	CodeCode Available	1	5
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models	Jun 2, 2025	Instruction FollowingReinforcement Learning (RL)	CodeCode Available	1	5
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization	Aug 14, 2024	InformativenessInstruction Following	CodeCode Available	1	5
Creative Agents: Empowering Agents with Imagination for Creative Tasks	Dec 5, 2023	Instruction FollowingLanguage Modelling	CodeCode Available	1	5
Don't Reinvent the Wheel: Efficient Instruction-Following Text Embedding based on Guided Space Transformation	May 30, 2025	Instruction Following	CodeCode Available	1	5
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning	Feb 9, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	1	5
Improving Translation Faithfulness of Large Language Models via Augmenting Instructions	Aug 24, 2023	Instruction FollowingMachine Translation	CodeCode Available	1	5
Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following	Feb 28, 2023	Instruction FollowingZero-shot Generalization	CodeCode Available	1	5
BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues	Oct 20, 2023	Instruction Following	CodeCode Available	1	5
Zero-Shot Compositional Policy Learning via Language Grounding	Apr 15, 2020	DescriptiveDomain Adaptation	CodeCode Available	1	5
Do LLMs "know" internally when they follow instructions?	Oct 18, 2024	Instruction FollowingPrompt Engineering	CodeCode Available	1	5
A Dual-Space Framework for General Knowledge Distillation of Large Language Models	Apr 15, 2025	Code GenerationGeneral Knowledge	CodeCode Available	1	5
PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation	Dec 4, 2024	Instruction Following	CodeCode Available	1	5
Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst Design	Oct 22, 2023	Computational chemistryInstruction Following	CodeCode Available	1	5
PromptKD: Distilling Student-Friendly Knowledge for Generative Language Models via Prompt Tuning	Feb 20, 2024	Instruction FollowingKnowledge Distillation	CodeCode Available	1	5
An Emulator for Fine-Tuning Large Language Models using Small Language Models	Oct 19, 2023	Instruction Following	CodeCode Available	1	5
Language-Conditioned Reinforcement Learning to Solve Misunderstandings with Action Corrections	Nov 18, 2022	Instruction Followingreinforcement-learning	CodeCode Available	1	5
Do Emergent Abilities Exist in Quantized Large Language Models: An Empirical Study	Jul 16, 2023	In-Context LearningInstruction Following	CodeCode Available	1	5
Making Large Language Models Better Data Creators	Oct 31, 2023	Instruction FollowingPrompt Engineering	CodeCode Available	1	5
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning	Oct 23, 2024	Image CaptioningInstruction Following	CodeCode Available	1	5
NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models	Mar 4, 2024	Instruction Following	CodeCode Available	1	5
Diversify and Conquer: Diversity-Centric Data Selection with Iterative Refinement	Sep 17, 2024	Active LearningDiversity	CodeCode Available	1	5
IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis	May 23, 2025	Instruction Following	CodeCode Available	1	5
Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions	Nov 1, 2023	Few-Shot NLIInstruction Following	CodeCode Available	1	5
IHEval: Evaluating Language Models on Following the Instruction Hierarchy	Feb 12, 2025	Instruction Following	CodeCode Available	1	5
Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Models	Feb 16, 2024	DiversityInstruction Following	CodeCode Available	1	5
Infer Human's Intentions Before Following Natural Language Instructions	Sep 26, 2024	Instruction Following	CodeCode Available	1	5
TOAST: Transfer Learning via Attention Steering	May 24, 2023	Fine-Grained Image ClassificationInstruction Following	CodeCode Available	1	5
Scaling Reasoning, Losing Control: Evaluating Instruction Following in Large Reasoning Models	May 20, 2025	Instruction FollowingMathematical Reasoning	CodeCode Available	1	5
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection	Apr 3, 2025	Instruction FollowingLanguage Modeling	CodeCode Available	1	5
Playpen: An Environment for Exploring Learning Through Conversational Interaction	Apr 11, 2025	Instruction FollowingLarge Language Model	CodeCode Available	0	5
Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models	Oct 31, 2024	Instruction FollowingReranking	CodeCode Available	0	5
Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction	May 22, 2024	Instruction Following	CodeCode Available	0	5
Analysis of Language Change in Collaborative Instruction Following	Sep 9, 2021	Instruction Following	CodeCode Available	0	5

Show:10 25 50

← PrevPage 9 of 23Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified