Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1076–1100 of 1135 papers

Title	Date	Tasks	Status	Hype
Improving the Robustness to Variations of Objects and Instructions with a Neuro-Symbolic Approach for Interactive Instruction Following	Oct 13, 2021	Instruction Following	—Unverified	0
FILM: Following Instructions in Language with Modular Methods	Oct 12, 2021	Imitation LearningInstruction Following	CodeCode Available	1
Waypoint Models for Instruction-guided Navigation in Continuous Environments	Oct 5, 2021	Instruction FollowingVisual Navigation	CodeCode Available	1
Hierarchical Modular Framework for Long Horizon Instruction Following	Sep 29, 2021	Instruction FollowingNavigate	CodeCode Available	0
Procedures as Programs: Hierarchical Control of Situated Agents through Natural Language	Sep 16, 2021	Instruction Following	—Unverified	0
Analysis of Language Change in Collaborative Instruction Following	Sep 9, 2021	Instruction Following	CodeCode Available	0
Modular Framework for Visuomotor Language Grounding	Sep 5, 2021	Instruction Following	—Unverified	0
Lexicon Learning for Few Shot Sequence Modeling	Aug 1, 2021	Instruction FollowingMachine Translation	CodeCode Available	1
Improving Coherence and Consistency in Neural Sequence Models with Dual-System, Neuro-Symbolic Reasoning	Jul 6, 2021	Instruction FollowingLogical Reasoning	—Unverified	0
Draw Me a Flower: Processing and Grounding Abstraction in Natural Language	Jun 27, 2021	4kInstruction Following	—Unverified	0
Room-and-Object Aware Knowledge Reasoning for Remote Embodied Referring Expression	Jun 19, 2021	Instruction FollowingNavigate	CodeCode Available	1
Lexicon Learning for Few-Shot Neural Sequence Modeling	Jun 7, 2021	Instruction FollowingMachine Translation	CodeCode Available	1
Zero-shot Task Adaptation using Natural Language	Jun 5, 2021	Imitation LearningInstruction Following	—Unverified	0
Generalization in Instruction Following Systems	Jun 1, 2021	Data AugmentationInstruction Following	—Unverified	0
Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks	Jun 1, 2021	AI AgentInstruction Following	CodeCode Available	0
PanGEA: The Panoramic Graph Environment Annotation Toolkit	Mar 23, 2021	Instruction Following	—Unverified	0
A modular vision language navigation and manipulation framework for long horizon compositional tasks in indoor environment	Jan 19, 2021	Instruction FollowingVision-Language Navigation	CodeCode Available	1
Are We There Yet? Learning to Localize in Embodied Instruction Following	Jan 9, 2021	Instruction Followingobject-detection	—Unverified	0
Factorizing Perception and Policy for Interactive Instruction Following	Dec 6, 2020	Instruction FollowingNavigate	CodeCode Available	1
Spatial Language Understanding for Object Search in Partially Observed City-scale Environments	Dec 4, 2020	Decision MakingInstruction Following	CodeCode Available	0
From “Before” to “After”: Generating Natural Language Instructions from Image Pairs in a Simple Visual Domain	Dec 1, 2020	Image CaptioningInstruction Following	—Unverified	0
Few-shot Object Grounding and Mapping for Natural Language Robot Instruction Following	Nov 14, 2020	continuous-controlContinuous Control	CodeCode Available	1
RMM: A Recursive Mental Model for Dialogue Navigation	Nov 1, 2020	Answer GenerationInstruction Following	CodeCode Available	1
Modular Networks for Compositional Instruction Following	Oct 24, 2020	Instruction Following	—Unverified	0
Learning to Recombine and Resample Data for Compositional Generalization	Oct 8, 2020	Data AugmentationInstruction Following	CodeCode Available	0

Show:10 25 50

← PrevPage 44 of 46Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified