Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 651–700 of 1135 papers

Title	Date	Tasks	Status
SLADE: Shielding against Dual Exploits in Large Vision-Language Models	Jan 1, 2025	Contrastive LearningInstruction Following	—Unverified
HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction	Jan 1, 2025	DescriptiveInstruction Following	—Unverified
Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following	Dec 27, 2024	Instruction Following	—Unverified
Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models	Dec 27, 2024	Instruction Following	CodeCode Available
Internalized Self-Correction for Large Language Models	Dec 21, 2024	Instruction Following	—Unverified
LearnLM: Improving Gemini for Learning	Dec 21, 2024	Instruction Following	—Unverified
HREF: Human Response-Guided Evaluation of Instruction Following in Language Models	Dec 20, 2024	Instruction Following	CodeCode Available
Systematic Evaluation of Long-Context LLMs on Financial Concepts	Dec 19, 2024	Instruction Following	—Unverified
Length Controlled Generation for Black-box LLMs	Dec 19, 2024	Abstractive Text SummarizationInstruction Following	—Unverified
Pipeline Analysis for Developing Instruct LLMs in Low-Resource Languages: A Case Study on Basque	Dec 18, 2024	Instruction FollowingNatural Language Understanding	—Unverified
MetaMorph: Multimodal Understanding and Generation via Instruction Tuning	Dec 18, 2024	Instruction FollowingMORPH	—Unverified
A Systematic Examination of Preference Learning through the Lens of Instruction-Following	Dec 18, 2024	Instruction FollowingSynthetic Data Generation	—Unverified
Question: How do Large Language Models perform on the Question Answering tasks? Answer:	Dec 17, 2024	ArticlesInstruction Following	—Unverified
LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering	Dec 16, 2024	In-Context LearningInstruction Following	CodeCode Available
Empowering LLMs to Understand and Generate Complex Vector Graphics	Dec 15, 2024	Instruction FollowingVector Graphics	—Unverified
ChipAlign: Instruction Alignment in Large Language Models for Chip Design via Geodesic Interpolation	Dec 15, 2024	Instruction Following	—Unverified
Leveraging Large Vision-Language Model as User Intent-aware Encoder for Composed Image Retrieval	Dec 15, 2024	Image RetrievalInstruction Following	—Unverified
VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation	Dec 13, 2024	Instruction FollowingQuestion Answering	—Unverified
EasyRef: Omni-Generalized Group Image Reference for Diffusion Models via Multimodal LLM	Dec 12, 2024	Image ComprehensionImage Generation	—Unverified
LLaVA-Zip: Adaptive Visual Token Compression with Intrinsic Image Information	Dec 11, 2024	Data AugmentationInstruction Following	—Unverified
SmolTulu: Higher Learning Rate to Batch Size Ratios Can Lead to Better Reasoning in SLMs	Dec 11, 2024	ARCGSM8K	—Unverified
Sloth: scaling laws for LLM skills to predict multi-benchmark performance across families	Dec 9, 2024	Emotional IntelligenceInstruction Following	CodeCode Available
PediaBench: A Comprehensive Chinese Pediatric Dataset for Benchmarking Large Language Models	Dec 9, 2024	BenchmarkingInstruction Following	CodeCode Available
LLMs for Generalizable Language-Conditioned Policy Learning under Minimal Data Requirements	Dec 9, 2024	Decision MakingInstruction Following	—Unverified
GROOT-2: Weakly Supervised Multi-Modal Instruction Following Agents	Dec 7, 2024	Instruction Following	—Unverified
Compositional Image Retrieval via Instruction-Aware Contrastive Learning	Dec 7, 2024	Contrastive LearningImage Retrieval	CodeCode Available
LLM-Align: Utilizing Large Language Models for Entity Alignment in Knowledge Graphs	Dec 6, 2024	Entity AlignmentEntity Embeddings	—Unverified
EXAONE 3.5: Series of Large Language Models for Real-world Use Cases	Dec 6, 2024	Instruction Following	—Unverified
If You Can't Use Them, Recycle Them: Optimizing Merging at Scale Mitigates Performance Tradeoffs	Dec 5, 2024	Code GenerationInstruction Following	—Unverified
VidHalluc: Evaluating Temporal Hallucinations in Multimodal Large Language Models for Video Understanding	Dec 4, 2024	HallucinationInstruction Following	—Unverified
From Words to Workflows: Automating Business Processes	Dec 4, 2024	Decision MakingInstruction Following	—Unverified
Optimizing Latent Goal by Learning from Trajectory Preference	Dec 3, 2024	Continual LearningInstruction Following	—Unverified
T-REG: Preference Optimization with Token-Level Reward Regularization	Dec 3, 2024	Instruction Following	CodeCode Available
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation	Dec 2, 2024	Data IntegrationInstruction Following	—Unverified
AlignFormer: Modality Matching Can Achieve Better Zero-shot Instruction-Following Speech-LLM	Dec 2, 2024	Instruction FollowingQuestion Answering	—Unverified
MiningGPT -- A Domain-Specific Large Language Model for the Mining Industry	Dec 2, 2024	Instruction FollowingLanguage Modeling	—Unverified
VISTA: Enhancing Long-Duration and High-Resolution Video Understanding by Video Spatiotemporal Augmentation	Dec 1, 2024	Instruction FollowingVideo Understanding	—Unverified
InsightEdit: Towards Better Instruction Following for Image Editing	Nov 26, 2024	Instruction Following	—Unverified
Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors	Nov 24, 2024	Depth EstimationInstruction Following	—Unverified
From MTEB to MTOB: Retrieval-Augmented Classification for Descriptive Grammars	Nov 23, 2024	DescriptiveIn-Context Learning	CodeCode Available
Enhancing Instruction-Following Capability of Visual-Language Models by Reducing Image Redundancy	Nov 23, 2024	Instruction FollowingMME	—Unverified
Separable Mixture of Low-Rank Adaptation for Continual Visual Instruction Tuning	Nov 21, 2024	Continual LearningInstruction Following	—Unverified
MpoxVLM: A Vision-Language Model for Diagnosing Skin Lesions from Mpox Virus Infection	Nov 16, 2024	DiagnosticInstruction Following	CodeCode Available
MLAN: Language-Based Instruction Tuning Improves Zero-Shot Generalization of Multimodal Large Language Models	Nov 15, 2024	Instruction FollowingZero-shot Generalization	CodeCode Available
Adaptive Decoding via Latent Preference Optimization	Nov 14, 2024	GSM8KInstruction Following	—Unverified
Zero-shot Object-Centric Instruction Following: Integrating Foundation Models with Traditional Navigation	Nov 12, 2024	Instruction FollowingObject	—Unverified
Stronger Models are NOT Stronger Teachers for Instruction Tuning	Nov 11, 2024	Instruction Following	—Unverified
MrSteve: Instruction-Following Agents in Minecraft with What-Where-When Memory	Nov 11, 2024	Instruction FollowingMinecraft	—Unverified
LIFBench: Evaluating the Instruction Following Performance and Stability of Large Language Models in Long-Context Scenarios	Nov 11, 2024	Instruction Following	CodeCode Available
IOPO: Empowering LLMs with Complex Instruction Following via Input-Output Preference Optimization	Nov 9, 2024	Instruction Following	—Unverified

Show:10 25 50

← PrevPage 14 of 23Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified