Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 1135 papers

Title	Date	Tasks	Status
Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags	Jun 16, 2024	Image to textInstruction Following	—Unverified
DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding	Jun 13, 2024	Instruction FollowingLanguage Modeling	—Unverified
Comparison Visual Instruction Tuning	Jun 13, 2024	Instruction FollowingNovelty Detection	—Unverified
Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models	Jun 12, 2024	Instruction FollowingSafety Alignment	—Unverified
FaceGPT: Self-supervised Learning to Chat about 3D Human Faces	Jun 11, 2024	3D Face ReconstructionFace Model	—Unverified
CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation	Jun 11, 2024	Instruction Following	CodeCode Available
3D-Properties: Identifying Challenges in DPO and Charting a Path Forward	Jun 11, 2024	Instruction FollowingMathematical Problem-Solving	—Unverified
OPTune: Efficient Online Preference Tuning	Jun 11, 2024	Instruction Following	—Unverified
Attend and Enrich: Enhanced Visual Prompt for Zero-Shot Learning	Jun 5, 2024	AttributeDomain Generalization	—Unverified
Synthetic Programming Elicitation for Text-to-Code in Very Low-Resource Programming and Formal Languages	Jun 5, 2024	Instruction FollowingRetrieval	CodeCode Available
Adversarial Moment-Matching Distillation of Large Language Models	Jun 5, 2024	Imitation LearningInstruction Following	CodeCode Available
Scalable Ensembling For Mitigating Reward Overoptimisation	Jun 3, 2024	Instruction FollowingLanguage Modeling	—Unverified
LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models	Jun 1, 2024	FairnessInstruction Following	—Unverified
Phased Instruction Fine-Tuning for Large Language Models	Jun 1, 2024	Instruction Following	CodeCode Available
clembench-2024: A Challenging, Dynamic, Complementary, Multilingual Benchmark and Underlying Flexible Framework for LLMs as Multi-Action Agents	May 31, 2024	Instruction Following	—Unverified
Improving Reward Models with Synthetic Critiques	May 31, 2024	Instruction Following	—Unverified
Joint Embeddings for Graph Instruction Tuning	May 31, 2024	Instruction Followingvisual instruction following	—Unverified
TS-Align: A Teacher-Student Collaborative Framework for Scalable Iterative Finetuning of Large Language Models	May 30, 2024	Instruction Following	—Unverified
Nadine: An LLM-driven Intelligent Social Robot with Affective Capabilities and Human-like Memory	May 30, 2024	Instruction Following	—Unverified
InstructionCP: A fast approach to transfer Large Language Models into target language	May 30, 2024	Instruction Following	—Unverified
SAM-E: Leveraging Visual Foundation Model with Sequence Imitation for Embodied Manipulation	May 30, 2024	Instruction Followingparameter-efficient fine-tuning	—Unverified
X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions	May 30, 2024	Instruction FollowingLanguage Modeling	CodeCode Available
BLSP-KD: Bootstrapping Language-Speech Pre-training via Knowledge Distillation	May 29, 2024	Instruction FollowingKnowledge Distillation	—Unverified
X-VILA: Cross-Modality Alignment for Large Language Model	May 29, 2024	Instruction FollowingLanguage Modeling	—Unverified
Empowering Source-Free Domain Adaptation with MLLM-driven Curriculum Learning	May 28, 2024	Domain AdaptationInstruction Following	CodeCode Available
Self-Corrected Multimodal Large Language Model for End-to-End Robot Manipulation	May 27, 2024	Instruction FollowingLanguage Modeling	—Unverified
RE-Adapt: Reverse Engineered Adaptation of Large Language Models	May 23, 2024	Instruction FollowingRetrieval	—Unverified
From Role-Play to Drama-Interaction: An LLM Solution	May 23, 2024	Instruction Following	—Unverified
Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction	May 22, 2024	Instruction Following	CodeCode Available
Distilling Instruction-following Abilities of Large Language Models with Task-aware Curriculum Planning	May 22, 2024	Code GenerationInstruction Following	—Unverified
Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning	May 16, 2024	Decision MakingInstruction Following	—Unverified
A safety realignment framework via subspace-oriented model fusion for large language models	May 15, 2024	Instruction FollowingMath	CodeCode Available
SpeechGuard: Exploring the Adversarial Robustness of Multimodal Large Language Models	May 14, 2024	Adversarial RobustnessInstruction Following	—Unverified
SpeechVerse: A Large-scale Generalizable Audio Language Model	May 14, 2024	Automatic Speech RecognitionBenchmarking	—Unverified
Improving Instruction Following in Language Models through Proxy-Based Uncertainty Estimation	May 10, 2024	Instruction FollowingLanguage Modeling	CodeCode Available
Zero-shot LLM-guided Counterfactual Generation: A Case Study on NLP Model Evaluation	May 8, 2024	counterfactualInstruction Following	CodeCode Available
Long Context Alignment with Short Instructions and Synthesized Positions	May 7, 2024	16kInstruction Following	—Unverified
Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment	May 6, 2024	Arithmetic ReasoningCode Generation	—Unverified
LLM as Dataset Analyst: Subpopulation Structure Discovery with Large Language Model	May 3, 2024	Image CaptioningInstruction Following	CodeCode Available
WildChat: 1M ChatGPT Interaction Logs in the Wild	May 2, 2024	ChatbotInstruction Following	—Unverified
LLM-AD: Large Language Model based Audio Description System	May 2, 2024	Instruction FollowingLanguage Modeling	—Unverified
FLAME: Factuality-Aware Alignment for Large Language Models	May 2, 2024	HallucinationInstruction Following	—Unverified
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation	Apr 30, 2024	Caption GenerationHallucination	—Unverified
HELPER-X: A Unified Instructable Embodied Agent to Tackle Four Interactive Vision-Language Domains with Memory-Augmented Language Models	Apr 29, 2024	Instruction Following	—Unverified
From Persona to Personalization: A Survey on Role-Playing Language Agents	Apr 28, 2024	In-Context LearningInstruction Following	—Unverified
URL: Universal Referential Knowledge Linking via Task-instructed Representation Compression	Apr 24, 2024	Information RetrievalInstruction Following	CodeCode Available
Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models	Apr 23, 2024	Instruction Following	—Unverified
Socratic Planner: Self-QA-Based Zero-Shot Planning for Embodied Instruction Following	Apr 21, 2024	In-Context LearningInstruction Following	—Unverified
Look Before You Decide: Prompting Active Deduction of MLLMs for Assumptive Reasoning	Apr 19, 2024	Benchmarkingcounterfactual	—Unverified
Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V	Apr 16, 2024	Instruction FollowingMultimodal Reasoning	—Unverified

Show:10 25 50

← PrevPage 18 of 23Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified