On the Loss of Context-awareness in General Instruction Fine-tuning Nov 5, 2024 Benchmarking Instruction Following
Code Code Available 0Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors Nov 3, 2024 Instruction Following RAG
— Unverified 0Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models Nov 3, 2024 Hallucination Instruction Following
Code Code Available 0TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models Nov 2, 2024 Image Description Image Generation
— Unverified 0LLaMo: Large Language Model-based Molecular Graph Assistant Oct 31, 2024 Instruction Following IUPAC Name Prediction
Code Code Available 1Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models Oct 31, 2024 Instruction Following Reranking
Code Code Available 0Constraint Back-translation Improves Complex Instruction Following of Large Language Models Oct 31, 2024 Instruction Following Translation
Code Code Available 1MDCure: A Scalable Pipeline for Multi-Document Instruction-Following Oct 30, 2024 Articles Instruction Following
Code Code Available 0FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system Oct 28, 2024 Code Generation HumanEval
Code Code Available 0UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function Oct 28, 2024 Instruction Following Text Generation
— Unverified 0SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models Oct 25, 2024 Instruction Following Knowledge Distillation
— Unverified 0Open6DOR: Benchmarking Open-instruction 6-DoF Object Rearrangement and A VLM-based Approach Oct 24, 2024 Benchmarking Instruction Following
Code Code Available 2BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning Oct 24, 2024 Instruction Following Natural Language Understanding
— Unverified 0DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucinations Oct 24, 2024 Instruction Following Question Answering
Code Code Available 1Unbounded: A Generative Infinite Game of Character Life Simulation Oct 24, 2024 Instruction Following Language Modelling
— Unverified 0Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks Oct 23, 2024 Instruction Following Safety Alignment
— Unverified 0Cross-model Control: Improving Multiple Large Language Models in One-time Training Oct 23, 2024 Instruction Following Language Modeling
Code Code Available 1Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models Oct 23, 2024 Instruction Following Language Modelling
Code Code Available 2Cross-lingual Transfer of Reward Models in Multilingual Alignment Oct 23, 2024 Cross-Lingual Transfer Instruction Following
Code Code Available 0SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains Oct 23, 2024 Domain Adaptation Instruction Following
— Unverified 0ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning Oct 23, 2024 Image Captioning Instruction Following
Code Code Available 1Multi-IF: Benchmarking LLMs on Multi-Turn and Multilingual Instructions Following Oct 21, 2024 Benchmarking Instruction Following
Code Code Available 2GATEAU: Selecting Influential Samples for Long Context Alignment Oct 21, 2024 Instruction Following Long-Context Understanding
Code Code Available 1Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Oct 21, 2024 Instruction Following object-detection
Code Code Available 0Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Experiments, and Challenges Oct 20, 2024 Autonomous Driving Decision Making
— Unverified 0LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound Oct 19, 2024 Instruction Following Knowledge Distillation
— Unverified 0LoGU: Long-form Generation with Uncertainty Expressions Oct 18, 2024 Form Instruction Following
Code Code Available 1Do LLMs "know" internally when they follow instructions? Oct 18, 2024 Instruction Following Prompt Engineering
Code Code Available 1Do LLMs estimate uncertainty well in instruction-following? Oct 18, 2024 Instruction Following
Code Code Available 0Boosting LLM Translation Skills without General Ability Loss via Rationale Distillation Oct 17, 2024 General Knowledge Instruction Following
— Unverified 0LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning Oct 17, 2024 image-classification Image Classification
Code Code Available 0POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization Oct 16, 2024 Instruction Following
Code Code Available 0Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception Oct 16, 2024 Binary Classification Chunking
Code Code Available 3Evaluating the Instruction-following Abilities of Language Models using Knowledge Tasks Oct 16, 2024 Instruction Following Multiple-choice
Code Code Available 0RuleRAG: Rule-guided retrieval-augmented generation with language models for question answering Oct 15, 2024 In-Context Learning Instruction Following
Code Code Available 1Improving Instruction-Following in Language Models through Activation Steering Oct 15, 2024 Instruction Following Text Generation
— Unverified 0Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling Oct 15, 2024 Instruction Following Knowledge Distillation
— Unverified 0SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding Oct 15, 2024 Instruction Following Visual Question Answering (VQA)
— Unverified 0Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs Oct 14, 2024 Instruction Following
— Unverified 0How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective Oct 14, 2024 Density Ratio Estimation GSM8K
Code Code Available 0ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization Oct 14, 2024 Explanation Generation Image Forgery Detection
— Unverified 0Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search Oct 14, 2024 Instruction Following
— Unverified 0DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model Oct 14, 2024 Diversity Instruction Following
— Unverified 0Thinking LLMs: General Instruction Following with Thought Generation Oct 14, 2024 General Knowledge Instruction Following
— Unverified 0Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles Oct 13, 2024 Autonomous Vehicles Code Generation
— Unverified 0Surgical-LLaVA: Toward Surgical Scenario Understanding via Large Language and Vision Models Oct 13, 2024 Instruction Following Question Answering
— Unverified 0Toward General Instruction-Following Alignment for Retrieval-Augmented Generation Oct 12, 2024 Instruction Following RAG
Code Code Available 2Are You Human? An Adversarial Benchmark to Expose LLMs Oct 12, 2024 Instruction Following
— Unverified 0SeRA: Self-Reviewing and Alignment of Large Language Models using Implicit Reward Margins Oct 12, 2024 Instruction Following
— Unverified 0Nudging: Inference-time Alignment of LLMs via Guided Decoding Oct 11, 2024 General Knowledge GSM8K
— Unverified 0