Finding Fantastic Experts in MoEs: A Unified Study for Expert Dropping Strategies and Observations Apr 8, 2025 Instruction Following Mixture-of-Experts
— Unverified 0Separator Injection Attack: Uncovering Dialogue Biases in Large Language Models Caused by Role Separators Apr 8, 2025 Instruction Following
— Unverified 0From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models Apr 8, 2025 In-Context Learning Instruction Following
— Unverified 0The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context Apr 3, 2025 Instruction Following
— Unverified 0Effectively Controlling Reasoning Models through Thinking Intervention Mar 31, 2025 Instruction Following Safety Alignment
— Unverified 0Pay More Attention to the Robustness of Prompt for Instruction Data Mining Mar 31, 2025 Instruction Following
— Unverified 0Learning to Instruct for Visual Instruction Tuning Mar 28, 2025 Hallucination Instruction Following
— Unverified 0Gemma 3 Technical Report Mar 25, 2025 Instruction Following Math
— Unverified 0OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence Mar 20, 2025 Instruction Following Natural Language Understanding
— Unverified 0Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings Mar 19, 2025 Instruction Following Large Language Model
Code Code Available 0ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs Mar 17, 2025 Instruction Following
— Unverified 0ICCO: Learning an Instruction-conditioned Coordinator for Language-guided Task-aligned Multi-robot Control Mar 15, 2025 Instruction Following Multi-agent Reinforcement Learning
— Unverified 0ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction Tuning Mar 14, 2025 Code Generation Decoder
Code Code Available 0D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning Mar 14, 2025 Diversity Instruction Following
— Unverified 0Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models Mar 13, 2025 Instruction Following
— Unverified 0Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding Mar 12, 2025 Instruction Following Video Understanding
— Unverified 0Got Compute, but No Data: Lessons From Post-training a Finnish LLM Mar 12, 2025 Instruction Following
— Unverified 0Open-World Skill Discovery from Unsegmented Demonstrations Mar 11, 2025 Boundary Detection Event Segmentation
— Unverified 0DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering Mar 11, 2025 Form Instruction Following
— Unverified 0Robust Multi-Objective Controlled Decoding of Large Language Models Mar 11, 2025 Instruction Following
Code Code Available 0XIFBench: Evaluating Large Language Models on Multilingual Instruction Following Mar 10, 2025 Instruction Following Specificity
— Unverified 0Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting Mar 9, 2025 Instruction Following Large Language Model
— Unverified 0WildIFEval: Instruction Following in the Wild Mar 9, 2025 Instruction Following
Code Code Available 0S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information Mar 7, 2025 Instruction Following
— Unverified 0IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval Mar 6, 2025 Information Retrieval Instruction Following
— Unverified 0Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment Mar 6, 2025 Instruction Following Transfer Learning
Code Code Available 0Unified Mind Model: Reimagining Autonomous Agents in the LLM Era Mar 5, 2025 Instruction Following
— Unverified 0CodeIF-Bench: Evaluating Instruction-Following Capabilities of Large Language Models in Interactive Code Generation Mar 5, 2025 Code Generation Instruction Following
— Unverified 0Robust Learning of Diverse Code Edits Mar 5, 2025 Code Generation Instruction Following
— Unverified 0LEWIS (LayEr WIse Sparsity) -- A Training Free Guided Model Merging Approach Mar 5, 2025 Instruction Following Math
— Unverified 0InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training Mar 4, 2025 Instruction Following text-to-speech
— Unverified 0Iterative Value Function Optimization for Guided Decoding Mar 4, 2025 Decision Making Instruction Following
— Unverified 0In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models Mar 3, 2025 In-Context Learning Instruction Following
— Unverified 0Re-Imagining Multimodal Instruction Tuning: A Representation View Mar 2, 2025 Instruction Following MME
Code Code Available 0Triple Phase Transitions: Understanding the Learning Dynamics of Large Language Models from a Neuroscience Perspective Feb 28, 2025 Instruction Following
— Unverified 0Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge Feb 27, 2025 GSM8K HumanEval
— Unverified 0DataMan: Data Manager for Pre-training Large Language Models Feb 26, 2025 In-Context Learning Instruction Following
— Unverified 0Stay Focused: Problem Drift in Multi-Agent Debate Feb 26, 2025 Instruction Following
Code Code Available 0Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models Feb 26, 2025 Instruction Following Vision-Language-Action
— Unverified 0Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments Feb 26, 2025 Instruction Following Vision and Language Navigation
— Unverified 0URO-Bench: A Comprehensive Benchmark for End-to-End Spoken Dialogue Models Feb 25, 2025 Instruction Following
— Unverified 0TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning Feb 25, 2025 Instruction Following Language Modeling
Code Code Available 0UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings Feb 24, 2025 Diversity Instruction Following
— Unverified 0Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing Feb 24, 2025 Instruction Following Model Selection
Code Code Available 0ATEB: Evaluating and Improving Advanced NLP Tasks for Text Embedding Models Feb 24, 2025 Information Retrieval Instruction Following
— Unverified 0Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following Feb 24, 2025 Instruction Following Position
Code Code Available 0NatSGLD: A Dataset with Speech, Gesture, Logic, and Demonstration for Robot Learning in Natural Human-Robot Interaction Feb 23, 2025 Instruction Following
Code Code Available 0Sequence-level Large Language Model Training with Contrastive Preference Optimization Feb 23, 2025 Instruction Following Language Modeling
— Unverified 0SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instruction Following Evaluation for Social Agents Feb 21, 2025 Instruction Following
Code Code Available 0OpenSearch-SQL: Enhancing Text-to-SQL with Dynamic Few-shot and Consistency Alignment Feb 19, 2025 Hallucination Instruction Following
— Unverified 0