Language Imbalance Driven Rewarding for Multilingual Self-improving Oct 11, 2024 Arithmetic Reasoning Instruction Following
Code Code Available 1CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection Oct 10, 2024 Instruction Following
Code Code Available 1Reward-Augmented Data Enhances Direct Preference Alignment of LLMs Oct 10, 2024 Instruction Following
Code Code Available 1Evolutionary Contrastive Distillation for Language Model Alignment Oct 10, 2024 Contrastive Learning Instruction Following
— Unverified 0Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy Oct 9, 2024 Instruction Following
— Unverified 0HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding Oct 9, 2024 Benchmarking Instruction Following
— Unverified 0Large Language Model Compression with Neural Architecture Search Oct 9, 2024 Instruction Following Language Modeling
— Unverified 0LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints Oct 9, 2024 Instruction Following
— Unverified 0ReIFE: Re-evaluating Instruction-Following Evaluation Oct 9, 2024 Instruction Following
Code Code Available 0Self-Boosting Large Language Models with Synthetic Preference Data Oct 9, 2024 Instruction Following
— Unverified 0Direct Preference Optimization for LLM-Enhanced Recommendation Systems Oct 8, 2024 In-Context Learning Instruction Following
— Unverified 0TOWER: Tree Organized Weighting for Evaluating Complex Instructions Oct 8, 2024 Chatbot Instruction Following
— Unverified 0Aria: An Open Multimodal Native Mixture-of-Experts Model Oct 8, 2024 Instruction Following Mixture-of-Experts
Code Code Available 5Multimodal Situational Safety Oct 8, 2024 Instruction Following
— Unverified 0TEOChat: A Large Vision-Language Assistant for Temporal Earth Observation Data Oct 8, 2024 Change Detection Earth Observation
Code Code Available 2Superficial Safety Alignment Hypothesis Oct 7, 2024 Attribute Binary Classification
— Unverified 0A Recipe For Building a Compliant Real Estate Chatbot Oct 7, 2024 Chatbot Instruction Following
Code Code Available 1On Instruction-Finetuning Neural Machine Translation Models Oct 7, 2024 Domain Adaptation Instruction Following
— Unverified 0RevisEval: Improving LLM-as-a-Judge via Response-Adapted References Oct 7, 2024 Instruction Following Text Generation
— Unverified 0SFTMix: Elevating Language Model Instruction Tuning with Mixup Recipe Oct 7, 2024 Instruction Following Language Modeling
— Unverified 0Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization Oct 7, 2024 Diversity Instruction Following
— Unverified 0CS4: Measuring the Creativity of Large Language Models Automatically by Controlling the Number of Story-Writing Constraints Oct 5, 2024 Instruction Following Specificity
Code Code Available 0Self-Powered LLM Modality Expansion for Large Speech-Text Models Oct 4, 2024 Automatic Speech Recognition Instruction Following
Code Code Available 0SAG: Style-Aligned Article Generation via Model Collaboration Oct 4, 2024 Hallucination Instruction Following
— Unverified 0CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions Oct 4, 2024 Instruction Following MMLU
Code Code Available 0TICKing All the Boxes: Generated Checklists Improve LLM Evaluation and Generation Oct 4, 2024 All Instruction Following
— Unverified 0Better Instruction-Following Through Minimum Bayes Risk Oct 3, 2024 Instruction Following
— Unverified 0LoGra-Med: Long Context Multi-Graph Alignment for Medical Vision-Language Model Oct 3, 2024 image-classification Image Classification
— Unverified 0LLaVA-Critic: Learning to Evaluate Multimodal Models Oct 3, 2024 Instruction Following
— Unverified 0Video Instruction Tuning With Synthetic Data Oct 3, 2024 3D Question Answering (3D-QA)
— Unverified 0MedQA-CS: Benchmarking Large Language Models Clinical Skills Using an AI-SCE Framework Oct 2, 2024 Benchmarking Instruction Following
Code Code Available 1LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits Oct 2, 2024 Instruction Following Math
Code Code Available 1Robin3D: Improving 3D Large Language Model via Robust Instruction Tuning Sep 30, 2024 Instruction Following Language Modeling
Code Code Available 2DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data Sep 30, 2024 Instruction Following Language Modeling
Code Code Available 2The Perfect Blend: Redefining RLHF with Mixture of Judges Sep 30, 2024 Instruction Following Math
— Unverified 0Revisiting the Superficial Alignment Hypothesis Sep 27, 2024 Instruction Following Math
— Unverified 0Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models Sep 27, 2024 Instruction Following
Code Code Available 1Align^2LLaVA: Cascaded Human and Large Language Model Preference Alignment for Multi-modal Instruction Curation Sep 27, 2024 Instruction Following Language Modeling
Code Code Available 0MMMT-IF: A Challenging Multimodal Multi-Turn Instruction Following Benchmark Sep 26, 2024 Instruction Following
— Unverified 0Inference-Time Language Model Alignment via Integrated Value Guidance Sep 26, 2024 Instruction Following Language Modeling
— Unverified 0Infer Human's Intentions Before Following Natural Language Instructions Sep 26, 2024 Instruction Following
Code Code Available 1Mitigating the Bias of Large Language Model Evaluation Sep 25, 2024 Instruction Following Language Model Evaluation
Code Code Available 0EventHallusion: Diagnosing Event Hallucinations in Video LLMs Sep 25, 2024 Hallucination Instruction Following
Code Code Available 1EAGLE: Towards Efficient Arbitrary Referring Visual Prompts Comprehension for Multimodal Large Language Models Sep 25, 2024 Instruction Following
— Unverified 0FMDLlama: Financial Misinformation Detection based on Large Language Models Sep 24, 2024 Explanation Generation Instruction Following
Code Code Available 0MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios Sep 24, 2024 Instruction Following
Code Code Available 1Style Outweighs Substance: Failure Modes of LLM Judges in Alignment Benchmarking Sep 23, 2024 Benchmarking Diversity
Code Code Available 0OmniBench: Towards The Future of Universal Omni-Language Models Sep 23, 2024 Instruction Following
Code Code Available 2Archon: An Architecture Search Framework for Inference-Time Techniques Sep 23, 2024 Hyperparameter Optimization Instruction Following
Code Code Available 2ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback Sep 23, 2024 Instruction Following
Code Code Available 1