Evaluating Large Language Models at Evaluating Instruction Following Oct 11, 2023 Instruction Following
Code Code Available 15 INSTRUCTIR: A Benchmark for Instruction Following of Information Retrieval Models Feb 22, 2024 Information Retrieval Instruction Following
Code Code Available 15 AGENTIF: Benchmarking Instruction Following of Large Language Models in Agentic Scenarios May 22, 2025 Benchmarking Instruction Following
Code Code Available 15 Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering Jul 31, 2023 Instruction Following Question Answering
Code Code Available 15 Instruction-Guided Visual Masking May 30, 2024 Instruction Following Visual Grounding
Code Code Available 15 Instruction-Following Agents with Multimodal Transformer Oct 24, 2022 Instruction Following Visual Grounding
Code Code Available 15 ChartInstruct: Instruction Tuning for Chart Comprehension and Reasoning Mar 14, 2024 Chart Understanding Instruction Following
Code Code Available 15 DocLens: Multi-aspect Fine-grained Evaluation for Medical Text Generation Nov 16, 2023 Decision Making Instruction Following
Code Code Available 15 ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback Apr 5, 2023 Instruction Following Machine Translation
Code Code Available 15 PathAsst: A Generative Foundation AI Assistant Towards Artificial General Intelligence of Pathology May 24, 2023 Diagnostic Instruction Following
Code Code Available 15 Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning Sep 19, 2024 Form Instruction Following
Code Code Available 15 Follow My Instruction and Spill the Beans: Scalable Data Extraction from Retrieval-Augmented Generation Systems Feb 27, 2024 Instruction Following RAG
Code Code Available 15 Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations Oct 2, 2023 In-Context Learning Instruction Following
Code Code Available 15 Learning to Map Natural Language Instructions to Physical Quadcopter Control using Simulated Flight Oct 21, 2019 continuous-control Continuous Control
Code Code Available 15 Instruct and Extract: Instruction Tuning for On-Demand Information Extraction Oct 24, 2023 Instruction Following
Code Code Available 15 InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4 Aug 23, 2023 Instruction Following Question Answering
Code Code Available 15 Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping Feb 16, 2025 Code Generation Instruction Following
Code Code Available 15 CB2: Collaborative Natural Language Interaction Research Platform Mar 14, 2023 Instruction Following
Code Code Available 15 Personalized Language Modeling from Personalized Human Feedback Feb 6, 2024 Instruction Following Language Modeling
Code Code Available 15 Lexicon Learning for Few-Shot Neural Sequence Modeling Jun 7, 2021 Instruction Following Machine Translation
Code Code Available 15 Pragmatic Instruction Following and Goal Assistance via Cooperative Language-Guided Inverse Planning Feb 27, 2024 Bayesian Inference Instruction Following
Code Code Available 15 Engineering flexible machine learning systems by traversing functionally-invariant paths Apr 30, 2022 Adversarial Robustness Continual Learning
Code Code Available 15 InfMLLM: A Unified Framework for Visual-Language Tasks Nov 12, 2023 GPU Image Captioning
Code Code Available 15 OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue Jun 21, 2023 Instruction Following Language Modeling
Code Code Available 15 LIONs: An Empirically Optimized Approach to Align Language Models Jul 9, 2024 Instruction Following
Code Code Available 15 Are Emergent Abilities in Large Language Models just In-Context Learning? Sep 4, 2023 In-Context Learning Instruction Following
Code Code Available 15 Infer Human's Intentions Before Following Natural Language Instructions Sep 26, 2024 Instruction Following
Code Code Available 15 MergeBench: A Benchmark for Merging Domain-Specialized LLMs May 16, 2025 Instruction Following
Code Code Available 15 Inferring Rewards from Language in Context Apr 5, 2022 Instruction Following Reinforcement Learning (RL)
Code Code Available 15 Opening up ChatGPT: Tracking openness, transparency, and accountability in instruction-tuned text generators Jul 8, 2023 Fairness Instruction Following
Code Code Available 15 Improving Translation Faithfulness of Large Language Models via Augmenting Instructions Aug 24, 2023 Instruction Following Machine Translation
Code Code Available 15 On the Multi-turn Instruction Following for Conversational Web Agents Feb 23, 2024 Conversational Web Navigation Instruction Following
Code Code Available 15 A Recipe For Building a Compliant Real Estate Chatbot Oct 7, 2024 Chatbot Instruction Following
Code Code Available 15 Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer Nov 12, 2023 In-Context Learning Instruction Following
Code Code Available 15 Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models Jun 2, 2025 Instruction Following Reinforcement Learning (RL)
Code Code Available 15 Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following Feb 28, 2023 Instruction Following Zero-shot Generalization
Code Code Available 15 Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text Jun 8, 2025 Instruction Following
Code Code Available 15 InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems Jun 19, 2025 Benchmarking Descriptive
Code Code Available 15 Efficient Inference of Vision Instruction-Following Models with Elastic Cache Jul 25, 2024 Instruction Following Text Generation
Code Code Available 15 On the Exploitability of Instruction Tuning Jun 28, 2023 Data Poisoning Instruction Following
Code Code Available 15 Hybrid Alignment Training for Large Language Models Jun 21, 2024 Instruction Following
Code Code Available 15 IDA-Bench: Evaluating LLMs on Interactive Guided Data Analysis May 23, 2025 Instruction Following
Code Code Available 15 Can Language Models Follow Multiple Turns of Entangled Instructions? Mar 17, 2025 Instruction Following Memorization
Code Code Available 15 LLM-CXR: Instruction-Finetuned LLM for CXR Image Understanding and Generation May 19, 2023 Image Generation Instruction Following
Code Code Available 15 OmniGenBench: A Benchmark for Omnipotent Multimodal Generation across 50+ Tasks May 24, 2025 Image Generation Instruction Following
Code Code Available 15 EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing Jul 18, 2024 Instruction Following Language Modeling
Code Code Available 15 Generation-driven Contrastive Self-training for Zero-shot Text Classification with Instruction-following LLM Apr 24, 2023 Instruction Following Language Modelling
Code Code Available 15 Aligning Instruction Tasks Unlocks Large Language Models as Zero-Shot Relation Extractors May 18, 2023 Instruction Following Question Answering
Code Code Available 15 "No, to the Right" -- Online Language Corrections for Robotic Manipulation via Shared Autonomy Jan 6, 2023 Instruction Following
Code Code Available 15 NPHardEval4V: A Dynamic Reasoning Benchmark of Multimodal Large Language Models Mar 4, 2024 Instruction Following
Code Code Available 15