SOTAVerified

Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Showing 931940 of 1135 papers

TitleStatusHype
Diverse and Fine-Grained Instruction-Following Ability Exploration with Synthetic Data0
Gaussian Scenes: Pose-Free Sparse-View Scene Reconstruction using Depth-Enhanced Diffusion Priors0
Gemma 3 Technical Report0
DistilQwen2.5: Industrial Practices of Training Distilled Open Lightweight Language Models0
Distilling Internet-Scale Vision-Language Models into Embodied Agents0
Generalization in Instruction Following Systems0
Can Query Expansion Improve Generalization of Strong Cross-Encoder Rankers?0
Generate Subgoal Images before Act: Unlocking the Chain-of-Thought Reasoning in Diffusion Model for Robot Manipulation with Multimodal Prompts0
Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles0
SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models0
Show:102550
← PrevPage 94 of 114Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AutoIF (Llama3 70B)Inst-level loose-accuracy90.4Unverified
2AutoIF (Qwen2 72B)Inst-level loose-accuracy88Unverified
3GPT-4Inst-level loose-accuracy85.37Unverified
4PaLM 2 SInst-level loose-accuracy59.11Unverified