AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models Jun 24, 2024 Benchmarking Data Augmentation
Code Code Available 1Evaluation of Instruction-Following Ability for Large Language Models on Story-Ending Generation Jun 24, 2024 Instruction Following Machine Reading Comprehension
— Unverified 0Lottery Ticket Adaptation: Mitigating Destructive Interference in LLMs Jun 24, 2024 Instruction Following Math
Code Code Available 1AudioBench: A Universal Benchmark for Audio Large Language Models Jun 23, 2024 Audio Scene Understanding Instruction Following
Code Code Available 3RuleR: Improving LLM Controllability by Rule-based Data Recycling Jun 22, 2024 Data Augmentation Instruction Following
Code Code Available 1Teach Better or Show Smarter? On Instructions and Exemplars in Automatic Prompt Optimization Jun 22, 2024 Instruction Following Prompt Engineering
— Unverified 0DEM: Distribution Edited Model for Training with Mixed Data Distributions Jun 21, 2024 Diversity Instruction Following
— Unverified 0Hybrid Alignment Training for Large Language Models Jun 21, 2024 Instruction Following
Code Code Available 1AdaGrad under Anisotropic Smoothness Jun 21, 2024 Instruction Following
— Unverified 0VLM Agents Generate Their Own Memories: Distilling Experience into Embodied Programs of Thought Jun 20, 2024 Action Anticipation Continual Learning
— Unverified 0LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors Jun 20, 2024 16k Instruction Following
Code Code Available 1IWISDM: Assessing instruction following in multimodal models at scale Jun 20, 2024 Decision Making Instruction Following
Code Code Available 0Finding Blind Spots in Evaluator LLMs with Interpretable Checklists Jun 19, 2024 Instruction Following Text Generation
Code Code Available 1Biomedical Visual Instruction Tuning with Clinician Preference Alignment Jun 19, 2024 Instruction Following Visual Question Answering (VQA)
Code Code Available 0Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Jun 19, 2024 Instruction Following
Code Code Available 3The Comparative Trap: Pairwise Comparisons Amplifies Biased Preferences of LLM Evaluators Jun 18, 2024 Instruction Following Text Generation
— Unverified 0Unveiling the Flaws: Exploring Imperfections in Synthetic Data and Mitigation Strategies for Large Language Models Jun 18, 2024 Instruction Following
— Unverified 0ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools Jun 18, 2024 All GSM8K
Code Code Available 14Opt-Out: Investigating Entity-Level Unlearning for Large Language Models via Optimal Transport Jun 18, 2024 Instruction Following Machine Unlearning
Code Code Available 0RS-GPT4V: A Unified Multimodal Instruction-Following Dataset for Remote Sensing Image Understanding Jun 18, 2024 Attribute Instruction Following
Code Code Available 1Refine Large Language Model Fine-tuning via Instruction Vector Jun 18, 2024 Instruction Following Language Modeling
— Unverified 0ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates Jun 17, 2024 Instruction Following Safety Alignment
Code Code Available 1Grade Score: Quantifying LLM Performance in Option Selection Jun 17, 2024 Decision Making Fairness
Code Code Available 0Generative Visual Instruction Tuning Jun 17, 2024 Image Generation Image-text matching
Code Code Available 0Enhancing and Assessing Instruction-Following with Fine-Grained Instruction Variants Jun 17, 2024 Data Augmentation Diversity
— Unverified 0How Far Can In-Context Alignment Go? Exploring the State of In-Context Alignment Jun 17, 2024 In-Context Learning Instruction Following
— Unverified 0WPO: Enhancing RLHF with Weighted Preference Optimization Jun 17, 2024 Instruction Following
Code Code Available 1GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities Jun 17, 2024 Audio Question Answering Instruction Following
Code Code Available 2Refusal in Language Models Is Mediated by a Single Direction Jun 17, 2024 Instruction Following
Code Code Available 3Embodied Instruction Following in Unknown Environments Jun 17, 2024 Instruction Following Task Planning
— Unverified 0Reminding Multimodal Large Language Models of Object-aware Knowledge with Retrieved Tags Jun 16, 2024 Image to text Instruction Following
— Unverified 0DiscreteSLU: A Large Language Model with Self-Supervised Discrete Speech Units for Spoken Language Understanding Jun 13, 2024 Instruction Following Language Modeling
— Unverified 0Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback Jun 13, 2024 Instruction Following Math
Code Code Available 7MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning Jun 13, 2024 Instruction Following Math
Code Code Available 3Comparison Visual Instruction Tuning Jun 13, 2024 Instruction Following Novelty Detection
— Unverified 0Mimicking User Data: On Mitigating Fine-Tuning Risks in Closed Large Language Models Jun 12, 2024 Instruction Following Safety Alignment
— Unverified 0TasTe: Teaching Large Language Models to Translate through Self-Reflection Jun 12, 2024 Instruction Following Machine Translation
Code Code Available 1OPTune: Efficient Online Preference Tuning Jun 11, 2024 Instruction Following
— Unverified 0CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation Jun 11, 2024 Instruction Following
Code Code Available 03D-Properties: Identifying Challenges in DPO and Charting a Path Forward Jun 11, 2024 Instruction Following Mathematical Problem-Solving
— Unverified 0FaceGPT: Self-supervised Learning to Chat about 3D Human Faces Jun 11, 2024 3D Face Reconstruction Face Model
— Unverified 0RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent Jun 11, 2024 AI Agent Descriptive
Code Code Available 2SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature Jun 10, 2024 Claim Verification Instruction Following
Code Code Available 1The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models Jun 9, 2024 Instruction Following
Code Code Available 5F-LMM: Grounding Frozen Large Multimodal Models Jun 9, 2024 General Knowledge Instruction Following
Code Code Available 2CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for Task-Aware Parameter-Efficient Fine-tuning Jun 7, 2024 Instruction Following Math
Code Code Available 2GenAI Arena: An Open Evaluation Platform for Generative Models Jun 6, 2024 Image Generation Instruction Following
Code Code Available 2BLSP-Emo: Towards Empathetic Large Speech-Language Models Jun 6, 2024 Emotion Recognition Instruction Following
Code Code Available 2Synthetic Programming Elicitation for Text-to-Code in Very Low-Resource Programming and Formal Languages Jun 5, 2024 Instruction Following Retrieval
Code Code Available 0Large Language Models as Evaluators for Recommendation Explanations Jun 5, 2024 Common Sense Reasoning Instruction Following
Code Code Available 1