Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1091–1100 of 1135 papers

Title	Date	Tasks	Status	Hype
PanGEA: The Panoramic Graph Environment Annotation Toolkit	Mar 23, 2021	Instruction Following	—Unverified	0
A modular vision language navigation and manipulation framework for long horizon compositional tasks in indoor environment	Jan 19, 2021	Instruction FollowingVision-Language Navigation	CodeCode Available	1
Are We There Yet? Learning to Localize in Embodied Instruction Following	Jan 9, 2021	Instruction Followingobject-detection	—Unverified	0
Factorizing Perception and Policy for Interactive Instruction Following	Dec 6, 2020	Instruction FollowingNavigate	CodeCode Available	1
Spatial Language Understanding for Object Search in Partially Observed City-scale Environments	Dec 4, 2020	Decision MakingInstruction Following	CodeCode Available	0
From “Before” to “After”: Generating Natural Language Instructions from Image Pairs in a Simple Visual Domain	Dec 1, 2020	Image CaptioningInstruction Following	—Unverified	0
Few-shot Object Grounding and Mapping for Natural Language Robot Instruction Following	Nov 14, 2020	continuous-controlContinuous Control	CodeCode Available	1
RMM: A Recursive Mental Model for Dialogue Navigation	Nov 1, 2020	Answer GenerationInstruction Following	CodeCode Available	1
Modular Networks for Compositional Instruction Following	Oct 24, 2020	Instruction Following	—Unverified	0
Learning to Recombine and Resample Data for Compositional Generalization	Oct 8, 2020	Data AugmentationInstruction Following	CodeCode Available	0

Show:10 25 50

← PrevPage 110 of 114Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	AutoIF (Llama3 70B)	Inst-level loose-accuracy	90.4	—	Unverified
2	AutoIF (Qwen2 72B)	Inst-level loose-accuracy	88	—	Unverified
3	GPT-4	Inst-level loose-accuracy	85.37	—	Unverified
4	PaLM 2 S	Inst-level loose-accuracy	59.11	—	Unverified