SOTAVerified

Instruction Following

Instruction following is the basic task of the model. This task is dedicated to evaluating the ability of the large model to follow human instructions. It is hoped that the model can generate controllable and safe answers.

Papers

Showing 661670 of 1135 papers

TitleStatusHype
Multilingual Coarse Political Stance Classification of Media. The Editorial Line of a ChatGPT and Bard Newspaper0
Multi-lingual Functional Evaluation for Large Language Models0
Multilingual Instruction Tuning With Just a Pinch of Multilinguality0
Multilingual Multimodal Software Developer for Code Generation0
Bridging Offline and Online Reinforcement Learning for LLMs0
Multi-Modal Instruction-Tuning Small-Scale Language-and-Vision Assistant for Semiconductor Electron Micrograph Analysis0
Transformer-based Causal Language Models Perform Clustering0
Multimodal Sequential Generative Models for Semi-Supervised Language Instruction Following0
Multimodal Situational Safety0
Multimodal Web Navigation with Instruction-Finetuned Foundation Models0
Show:102550
← PrevPage 67 of 114Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AutoIF (Llama3 70B)Inst-level loose-accuracy90.4Unverified
2AutoIF (Qwen2 72B)Inst-level loose-accuracy88Unverified
3GPT-4Inst-level loose-accuracy85.37Unverified
4PaLM 2 SInst-level loose-accuracy59.11Unverified