Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 1135 papers

Title	Date	Tasks	Status
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Dec 12, 2024	Image ComprehensionImage Generation	—Unverified
DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model	Oct 14, 2024	DiversityInstruction Following	—Unverified
Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting	Mar 9, 2025	Instruction FollowingLarge Language Model	—Unverified
Effectively Controlling Reasoning Models through Thinking Intervention	Mar 31, 2025	Instruction FollowingSafety Alignment	—Unverified
Efficient Finetuning Large Language Models For Vietnamese Chatbot	Sep 9, 2023	ChatbotInstruction Following	—Unverified
Sparse Activation Editing for Reliable Instruction Following in Narratives	May 22, 2025	Instruction Following	—Unverified
Efficient Telecom Specific LLM: TSLAM-Mini with QLoRA and Digital Twin Data	May 10, 2025	Instruction Followingparameter-efficient fine-tuning	—Unverified
Eliciting Instruction-tuned Code Language Models' Capabilities to Utilize Auxiliary Function for Code Generation	Sep 20, 2024	Code GenerationInstruction Following	—Unverified
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following	Apr 7, 2023	Instruction FollowingSelf-Supervised Learning	—Unverified
Embodied Instruction Following in Unknown Environments	Jun 17, 2024	Instruction FollowingTask Planning	—Unverified
Emo-DPO: Controllable Emotional Speech Synthesis through Direct Preference Optimization	Sep 16, 2024	Emotional Speech SynthesisIn-Context Learning	—Unverified
D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions	Jul 2, 2024	DiagnosticInstruction Following	—Unverified
Empirical Analysis of Large Vision-Language Models against Goal Hijacking via Visual Prompt Injection	Aug 7, 2024	Instruction Following	—Unverified
Empowering LLMs to Understand and Generate Complex Vector Graphics	Dec 15, 2024	Instruction FollowingVector Graphics	—Unverified
Draw Me a Flower: Processing and Grounding Abstraction in Natural Language	Jun 27, 2021	4kInstruction Following	—Unverified
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment	May 6, 2024	Arithmetic ReasoningCode Generation	—Unverified
VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models	Feb 21, 2024	Backdoor AttackFew-Shot Learning	—Unverified
Enhancing Complex Instruction Following for Large Language Models with Mixture-of-Contexts Fine-tuning	May 17, 2025	DecoderInstruction Following	—Unverified
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation	Dec 2, 2024	Data IntegrationInstruction Following	—Unverified
Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy	Nov 23, 2024	Instruction FollowingMME	—Unverified
Enhancing Low-Resource Language and Instruction Following Capabilities of Audio Language Models	Sep 17, 2024	Audio captioningInstruction Following	—Unverified
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling	Oct 15, 2024	Instruction FollowingKnowledge Distillation	—Unverified
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity	Aug 29, 2024	Code GenerationDiversity	—Unverified
ETHER: Aligning Emergent Communication for Hindsight Experience Replay	Jul 28, 2023	Inductive BiasInstruction Following	—Unverified
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models	May 14, 2024	Adversarial RobustnessInstruction Following	—Unverified
Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking	Jan 18, 2025	Binary ClassificationFact Checking	—Unverified
SpeechVerse: A Large-scale Generalizable Audio Language Model	May 14, 2024	Automatic Speech RecognitionBenchmarking	—Unverified
Evaluating Robustness of Large Audio Language Models to Audio Injection: An Empirical Study	May 26, 2025	Instruction Following	—Unverified
Evaluating the Robustness to Instructions of Large Language Models	Aug 28, 2023	Instruction FollowingRelation Extraction	—Unverified
Evaluation of Instruction-Following Ability for Large Language Models on Story-Ending Generation	Jun 24, 2024	Instruction FollowingMachine Reading Comprehension	—Unverified
Evolutionary Contrastive Distillation for Language Model Alignment	Oct 10, 2024	Contrastive LearningInstruction Following	—Unverified
Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation	Jul 11, 2020	Decision MakingImitation Learning	—Unverified
SSP: A Simple and Safe automatic Prompt engineering method towards realistic image synthesis on LVM	Jan 2, 2024	Image GenerationInstruction Following	—Unverified
EXAONE 3.0 7.8B Instruction Tuned Language Model	Aug 7, 2024	Instruction FollowingLanguage Modeling	—Unverified
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases	Dec 6, 2024	Instruction Following	—Unverified
Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding	Mar 12, 2025	Instruction FollowingVideo Understanding	—Unverified
Do we Really Need Visual Instructions? Towards Visual Instruction-Free Fine-tuning for Large Vision-Language Models	Feb 17, 2025	Instruction Followingvisual instruction following	—Unverified
Explicit Object Relation Alignment for Vision and Language Navigation	Nov 16, 2021	Instruction FollowingRelation	—Unverified
Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks	Feb 11, 2023	Computer SecurityInstruction Following	—Unverified
Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study	Jul 13, 2023	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	—Unverified
Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning	Apr 19, 2024	Benchmarkingcounterfactual	—Unverified
FaceGPT: Self-supervised Learning to Chat about 3D Human Faces	Jun 11, 2024	3D Face ReconstructionFace Model	—Unverified
Stay on the Path: Instruction Fidelity in Vision-and-Language Navigation	May 29, 2019	Instruction FollowingVision and Language Navigation	—Unverified
FactLLaMA: Optimizing Instruction-Following Language Models with External Knowledge for Automated Fact-Checking	Sep 1, 2023	Fact CheckingInstruction Following	—Unverified
AlignFormer: Modality Matching Can Achieve Better Zero-shot Instruction-Following Speech-LLM	Dec 2, 2024	Instruction FollowingQuestion Answering	—Unverified
Ferret-UI: Grounded Mobile UI Understanding with Multimodal LLMs	Apr 8, 2024	Instruction Following	—Unverified
Zero-shot cross-lingual transfer in instruction tuning of large language models	Feb 22, 2024	Cross-Lingual TransferInstruction Following	—Unverified
Few-shot Dialogue Strategy Learning for Motivational Interviewing via Inductive Reasoning	Mar 23, 2024	Instruction Following	—Unverified
StressPrompt: Does Stress Impact Large Language Models and Human Performance Similarly?	Sep 14, 2024	Emotional IntelligenceInstruction Following	—Unverified
Stronger Models are NOT Stronger Teachers for Instruction Tuning	Nov 11, 2024	Instruction Following	—Unverified

Show:10 25 50

← PrevPage 18 of 23Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified