Fox-1 Technical Report Nov 8, 2024 2k 8k
— Unverified 0Bayesian Calibration of Win Rate Estimation with LLM Evaluators Nov 7, 2024 Bayesian Inference Instruction Following
Code Code Available 0Multi-Reward as Condition for Instruction-based Image Editing Nov 6, 2024 Descriptive Instruction Following
— Unverified 0On the Loss of Context-awareness in General Instruction Fine-tuning Nov 5, 2024 Benchmarking Instruction Following
Code Code Available 0Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models Nov 3, 2024 Hallucination Instruction Following
Code Code Available 0Data Extraction Attacks in Retrieval-Augmented Generation via Backdoors Nov 3, 2024 Instruction Following RAG
— Unverified 0TypeScore: A Text Fidelity Metric for Text-to-Image Generative Models Nov 2, 2024 Image Description Image Generation
— Unverified 0Beyond Content Relevance: Evaluating Instruction Following in Retrieval Models Oct 31, 2024 Instruction Following Reranking
Code Code Available 0MDCure: A Scalable Pipeline for Multi-Document Instruction-Following Oct 30, 2024 Articles Instruction Following
Code Code Available 0UFT: Unifying Fine-Tuning of SFT and RLHF/DPO/UNA through a Generalized Implicit Reward Function Oct 28, 2024 Instruction Following Text Generation
— Unverified 0FALCON: Feedback-driven Adaptive Long/short-term memory reinforced Coding Optimization system Oct 28, 2024 Code Generation HumanEval
Code Code Available 0SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models Oct 25, 2024 Instruction Following Knowledge Distillation
— Unverified 0BioMistral-NLU: Towards More Generalizable Medical Language Understanding through Instruction Tuning Oct 24, 2024 Instruction Following Natural Language Understanding
— Unverified 0Unbounded: A Generative Infinite Game of Character Life Simulation Oct 24, 2024 Instruction Following Language Modelling
— Unverified 0Cross-lingual Transfer of Reward Models in Multilingual Alignment Oct 23, 2024 Cross-Lingual Transfer Instruction Following
Code Code Available 0Towards Understanding the Fragility of Multilingual LLMs against Fine-Tuning Attacks Oct 23, 2024 Instruction Following Safety Alignment
— Unverified 0SimRAG: Self-Improving Retrieval-Augmented Generation for Adapting Large Language Models to Specialized Domains Oct 23, 2024 Domain Adaptation Instruction Following
— Unverified 0Griffon-G: Bridging Vision-Language and Vision-Centric Tasks via Large Multimodal Models Oct 21, 2024 Instruction Following object-detection
— Unverified 0Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Experiments, and Challenges Oct 20, 2024 Autonomous Driving Decision Making
— Unverified 0LLaVA-Ultra: Large Chinese Language and Vision Assistant for Ultrasound Oct 19, 2024 Instruction Following Knowledge Distillation
— Unverified 0Do LLMs estimate uncertainty well in instruction-following? Oct 18, 2024 Instruction Following
Code Code Available 0Boosting LLM Translation Skills without General Ability Loss via Rationale Distillation Oct 17, 2024 General Knowledge Instruction Following
— Unverified 0LoLDU: Low-Rank Adaptation via Lower-Diag-Upper Decomposition for Parameter-Efficient Fine-Tuning Oct 17, 2024 image-classification Image Classification
Code Code Available 0Evaluating the Instruction-following Abilities of Language Models using Knowledge Tasks Oct 16, 2024 Instruction Following Multiple-choice
Code Code Available 0POROver: Improving Safety and Reducing Overrefusal in Large Language Models with Overgeneration and Preference Optimization Oct 16, 2024 Instruction Following
Code Code Available 0SlideChat: A Large Vision-Language Assistant for Whole-Slide Pathology Image Understanding Oct 15, 2024 Instruction Following Visual Question Answering (VQA)
— Unverified 0Improving Instruction-Following in Language Models through Activation Steering Oct 15, 2024 Instruction Following Text Generation
— Unverified 0Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling Oct 15, 2024 Instruction Following Knowledge Distillation
— Unverified 0ForgeryGPT: Multimodal Large Language Model For Explainable Image Forgery Detection and Localization Oct 14, 2024 Explanation Generation Image Forgery Detection
— Unverified 0DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model Oct 14, 2024 Diversity Instruction Following
— Unverified 0Optimizing Instruction Synthesis: Effective Exploration of Evolutionary Space with Tree Search Oct 14, 2024 Instruction Following
— Unverified 0Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs Oct 14, 2024 Instruction Following
— Unverified 0How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective Oct 14, 2024 Density Ratio Estimation GSM8K
Code Code Available 0Thinking LLMs: General Instruction Following with Thought Generation Oct 14, 2024 General Knowledge Instruction Following
— Unverified 0Conversational Code Generation: a Case Study of Designing a Dialogue System for Generating Driving Scenarios for Testing Autonomous Vehicles Oct 13, 2024 Autonomous Vehicles Code Generation
— Unverified 0Surgical-LLaVA: Toward Surgical Scenario Understanding via Large Language and Vision Models Oct 13, 2024 Instruction Following Question Answering
— Unverified 0Are You Human? An Adversarial Benchmark to Expose LLMs Oct 12, 2024 Instruction Following
— Unverified 0SeRA: Self-Reviewing and Alignment of Large Language Models using Implicit Reward Margins Oct 12, 2024 Instruction Following
— Unverified 0Nudging: Inference-time Alignment of LLMs via Guided Decoding Oct 11, 2024 General Knowledge GSM8K
— Unverified 0Evolutionary Contrastive Distillation for Language Model Alignment Oct 10, 2024 Contrastive Learning Instruction Following
— Unverified 0Instructional Segment Embedding: Improving LLM Safety with Instruction Hierarchy Oct 9, 2024 Instruction Following
— Unverified 0LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints Oct 9, 2024 Instruction Following
— Unverified 0Self-Boosting Large Language Models with Synthetic Preference Data Oct 9, 2024 Instruction Following
— Unverified 0HERM: Benchmarking and Enhancing Multimodal LLMs for Human-Centric Understanding Oct 9, 2024 Benchmarking Instruction Following
— Unverified 0Large Language Model Compression with Neural Architecture Search Oct 9, 2024 Instruction Following Language Modeling
— Unverified 0ReIFE: Re-evaluating Instruction-Following Evaluation Oct 9, 2024 Instruction Following
Code Code Available 0Direct Preference Optimization for LLM-Enhanced Recommendation Systems Oct 8, 2024 In-Context Learning Instruction Following
— Unverified 0Multimodal Situational Safety Oct 8, 2024 Instruction Following
— Unverified 0TOWER: Tree Organized Weighting for Evaluating Complex Instructions Oct 8, 2024 Chatbot Instruction Following
— Unverified 0Only-IF:Revealing the Decisive Effect of Instruction Diversity on Generalization Oct 7, 2024 Diversity Instruction Following
— Unverified 0