Instruction-Following Evaluation for Large Language Models Nov 14, 2023 Instruction Following
Code Code Available 5MART: Improving LLM Safety with Multi-round Automatic Red-Teaming Nov 13, 2023 Instruction Following Red Teaming
— Unverified 0Generalization Analogies: A Testbed for Generalizing AI Oversight to Hard-To-Measure Domains Nov 13, 2023 Instruction Following
Code Code Available 0To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning Nov 13, 2023 Instruction Following MM-Vet
Code Code Available 2WaterBench: Towards Holistic Evaluation of Watermarks for Large Language Models Nov 13, 2023 Benchmarking Instruction Following
Code Code Available 1InfMLLM: A Unified Framework for Visual-Language Tasks Nov 12, 2023 GPU Image Captioning
Code Code Available 1Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer Nov 12, 2023 In-Context Learning Instruction Following
Code Code Available 1DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training Nov 12, 2023 Instruction Following Position
Code Code Available 0u-LLaVA: Unifying Multi-Modal Tasks via Large Language Model Nov 9, 2023 Instruction Following Language Modeling
Code Code Available 1LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents Nov 9, 2023 Instruction Following LLM real-life tasks
Code Code Available 2Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Nov 6, 2023 Decoder GSM8K
Code Code Available 2PhoGPT: Generative Pre-training for Vietnamese Nov 6, 2023 Instruction Following
Code Code Available 2ChEF: A Comprehensive Evaluation Framework for Standardized Assessment of Multimodal Large Language Models Nov 5, 2023 Hallucination In-Context Learning
— Unverified 0COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning Nov 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Active Reasoning in an Open-World Environment Nov 3, 2023 Instruction Following Minecraft
— Unverified 0FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models Nov 2, 2023 Descriptive Instruction Following
Code Code Available 1Tensor Trust: Interpretable Prompt Injection Attacks from an Online Game Nov 2, 2023 Instruction Following
— Unverified 0Instructive Decoding: Instruction-Tuned Large Language Models are Self-Refiner from Noisy Instructions Nov 1, 2023 Few-Shot NLI Instruction Following
Code Code Available 1FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models Oct 31, 2023 Instruction Following
Code Code Available 1Making Large Language Models Better Data Creators Oct 31, 2023 Instruction Following Prompt Engineering
Code Code Available 1Myriad: Large Multimodal Model by Applying Vision Experts for Industrial Anomaly Detection Oct 29, 2023 Anomaly Detection Image Captioning
Code Code Available 1Privately Aligning Language Models with Reinforcement Learning Oct 25, 2023 Instruction Following Privacy Preserving
— Unverified 0CycleAlign: Iterative Distillation from Black-box LLM to White-box Models for Better Human Alignment Oct 25, 2023 In-Context Learning Instruction Following
— Unverified 0Multilingual Coarse Political Stance Classification of Media. The Editorial Line of a ChatGPT and Bard Newspaper Oct 25, 2023 Articles Instruction Following
— Unverified 0Instruct and Extract: Instruction Tuning for On-Demand Information Extraction Oct 24, 2023 Instruction Following
Code Code Available 1Analyzing Multilingual Competency of LLMs in Multi-Turn Instruction Following: A Case Study of Arabic Oct 23, 2023 Benchmarking Instruction Following
— Unverified 0AlpaCare:Instruction-tuned Large Language Models for Medical Application Oct 23, 2023 Diversity Instruction Following
Code Code Available 1Monte Carlo Thought Search: Large Language Model Querying for Complex Scientific Reasoning in Catalyst Design Oct 22, 2023 Computational chemistry Instruction Following
Code Code Available 1BotChat: Evaluating LLMs' Capabilities of Having Multi-Turn Dialogues Oct 20, 2023 Instruction Following
Code Code Available 1Democratizing Reasoning Ability: Tailored Learning from Large Language Model Oct 20, 2023 Instruction Following Language Modeling
Code Code Available 1An Emulator for Fine-Tuning Large Language Models using Small Language Models Oct 19, 2023 Instruction Following
Code Code Available 1LACMA: Language-Aligning Contrastive Learning with Meta-Actions for Embodied Instruction Following Oct 18, 2023 Contrastive Learning Instruction Following
Code Code Available 0LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation Oct 18, 2023 Caption Generation Instruction Following
— Unverified 0Quantifying Self-diagnostic Atomic Knowledge in Chinese Medical Foundation Model: A Computational Analysis Oct 18, 2023 Diagnostic Instruction Following
Code Code Available 0VeRA: Vector-based Random Matrix Adaptation Oct 17, 2023 image-classification Image Classification
— Unverified 0Gaining Wisdom from Setbacks: Aligning Large Language Models via Mistake Analysis Oct 16, 2023 Instruction Following
— Unverified 0Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning Oct 14, 2023 In-Context Learning Instruction Following
— Unverified 0Towards Better Evaluation of Instruction-Following: A Case-Study in Summarization Oct 12, 2023 Instruction Following
— Unverified 0GROOT: Learning to Follow Instructions by Watching Gameplay Videos Oct 12, 2023 Decoder Instruction Following
— Unverified 0Parrot: Enhancing Multi-Turn Instruction Following for Large Language Models Oct 11, 2023 Attribute Instruction Following
— Unverified 0Evaluating Large Language Models at Evaluating Instruction Following Oct 11, 2023 Instruction Following
Code Code Available 1From Supervised to Generative: A Novel Paradigm for Tabular Deep Learning with Large Language Models Oct 11, 2023 In-Context Learning Instruction Following
Code Code Available 0LLark: A Multimodal Instruction-Following Language Model for Music Oct 11, 2023 Instruction Following Language Modeling
Code Code Available 2TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models Oct 10, 2023 Code Generation Continual Learning
Code Code Available 1Understanding the Effects of RLHF on LLM Generalisation and Diversity Oct 10, 2023 Diversity Instruction Following
Code Code Available 1How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition Oct 9, 2023 Code Generation Instruction Following
Code Code Available 3Chat Vector: A Simple Approach to Equip LLMs with Instruction Following and Model Alignment in New Languages Oct 7, 2023 Instruction Following
Code Code Available 1SteP: Stacked LLM Policies for Web Actions Oct 5, 2023 Instruction Following
— Unverified 0Benchmarking and Improving Generator-Validator Consistency of Language Models Oct 3, 2023 Benchmarking Instruction Following
— Unverified 0Use Your INSTINCT: INSTruction optimization for LLMs usIng Neural bandits Coupled with Transformers Oct 2, 2023 Bayesian Optimization Instruction Following
Code Code Available 1