Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments Feb 26, 2025 Instruction Following Vision and Language Navigation
— Unverified 0TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning Feb 25, 2025 Instruction Following Language Modeling
Code Code Available 0Rank1: Test-Time Compute for Reranking in Information Retrieval Feb 25, 2025 Information Retrieval Instruction Following
Code Code Available 2URO-Bench: A Comprehensive Benchmark for End-to-End Spoken Dialogue Models Feb 25, 2025 Instruction Following
— Unverified 0Order Matters: Investigate the Position Bias in Multi-constraint Instruction Following Feb 24, 2025 Instruction Following Position
Code Code Available 0ATEB: Evaluating and Improving Advanced NLP Tasks for Text Embedding Models Feb 24, 2025 Information Retrieval Instruction Following
— Unverified 0UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings Feb 24, 2025 Diversity Instruction Following
— Unverified 0Capability Instruction Tuning: A New Paradigm for Dynamic LLM Routing Feb 24, 2025 Instruction Following Model Selection
Code Code Available 0Sequence-level Large Language Model Training with Contrastive Preference Optimization Feb 23, 2025 Instruction Following Language Modeling
— Unverified 0NatSGLD: A Dataset with Speech, Gesture, Logic, and Demonstration for Robot Learning in Natural Human-Robot Interaction Feb 23, 2025 Instruction Following
Code Code Available 0SOTOPIA-Ω: Dynamic Strategy Injection Learning and Social Instruction Following Evaluation for Social Agents Feb 21, 2025 Instruction Following
Code Code Available 0StructFlowBench: A Structured Flow Benchmark for Multi-turn Instruction Following Feb 20, 2025 Instruction Following
Code Code Available 1OpenSearch-SQL: Enhancing Text-to-SQL with Dynamic Few-shot and Consistency Alignment Feb 19, 2025 Hallucination Instruction Following
— Unverified 0Investigating Non-Transitivity in LLM-as-a-Judge Feb 19, 2025 Chatbot Computational Efficiency
— Unverified 0Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh Feb 19, 2025 Instruction Following Multiple-choice
— Unverified 0TESS 2: A Large-Scale Generalist Diffusion Language Model Feb 19, 2025 Instruction Following Language Modeling
Code Code Available 2MMTEB: Massive Multilingual Text Embedding Benchmark Feb 19, 2025 Instruction Following Retrieval
Code Code Available 0TALKPLAY: Multimodal Music Recommendation with Large Language Models Feb 19, 2025 Conversational Recommendation Instruction Following
— Unverified 0Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models Feb 18, 2025 Data Augmentation GSM8K
— Unverified 0Do we Really Need Visual Instructions? Towards Visual Instruction-Free Fine-tuning for Large Vision-Language Models Feb 17, 2025 Instruction Following visual instruction following
— Unverified 0RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following Feb 17, 2025 Instruction Following Machine Reading Comprehension
Code Code Available 0SAIF: A Sparse Autoencoder Framework for Interpreting and Steering Instruction Following of Language Models Feb 17, 2025 Instruction Following
— Unverified 0MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training Feb 17, 2025 Instruction Following
Code Code Available 0Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding Feb 17, 2025 Instruction Following Language Modeling
— Unverified 0Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction Feb 17, 2025 Instruction Following Voice Cloning
Code Code Available 7Cuckoo: An IE Free Rider Hatched by Massive Nutrition in LLM's Nest Feb 16, 2025 Instruction Following Nutrition
Code Code Available 0CORDIAL: Can Multimodal Large Language Models Effectively Understand Coherence Relationships? Feb 16, 2025 Instruction Following
Code Code Available 0Enhancing Cross-Tokenizer Knowledge Distillation with Contextual Dynamical Mapping Feb 16, 2025 Code Generation Instruction Following
Code Code Available 1Rewrite to Jailbreak: Discover Learnable and Transferable Implicit Harmfulness Instruction Feb 16, 2025 Instruction Following
Code Code Available 0Large Language Diffusion Models Feb 14, 2025 In-Context Learning Instruction Following
Code Code Available 7E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection Feb 12, 2025 Instruction Following Language Modeling
— Unverified 0IHEval: Evaluating Language Models on Following the Instruction Hierarchy Feb 12, 2025 Instruction Following
Code Code Available 1BenchMAX: A Comprehensive Multilingual Evaluation Suite for Large Language Models Feb 11, 2025 Code Generation Instruction Following
Code Code Available 1Who Taught You That? Tracing Teachers in Model Distillation Feb 10, 2025 Instruction Following POS
— Unverified 0Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following Feb 8, 2025 Instruction Following
— Unverified 0Hypencoder: Hypernetworks for Information Retrieval Feb 7, 2025 Information Retrieval Instruction Following
— Unverified 0M-IFEval: Multilingual Instruction-Following Evaluation Feb 7, 2025 Instruction Following
Code Code Available 1Verifiable Format Control for Large Language Model Generations Feb 6, 2025 Benchmarking Instruction Following
— Unverified 0UltraIF: Advancing Instruction Following from the Wild Feb 6, 2025 Instruction Following
Code Code Available 1LLMs can be easily Confused by Instructional Distractions Feb 5, 2025 Bias Detection Code Generation
— Unverified 0Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons Feb 5, 2025 Instruction Following Knowledge Distillation
— Unverified 0SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Feb 4, 2025 Instruction Following Language Modeling
— Unverified 0Shuttle Between the Instructions and the Parameters of Large Language Models Feb 4, 2025 Dimensionality Reduction Instruction Following
— Unverified 0CoDe: Blockwise Control for Denoising Diffusion Models Feb 3, 2025 Denoising Instruction Following
Code Code Available 0BARE: Leveraging Base Language Models for Few-Shot Synthetic Data Generation Feb 3, 2025 Diversity GSM8K
— Unverified 0Learning Human Perception Dynamics for Informative Robot Communication Feb 3, 2025 Data Augmentation Instruction Following
— Unverified 0Disentangling Length Bias In Preference Learning Via Response-Conditioned Modeling Feb 2, 2025 Instruction Following
— Unverified 0ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration Feb 2, 2025 Instruction Following Natural Language Queries
— Unverified 0mFollowIR: a Multilingual Benchmark for Instruction Following in Retrieval Jan 31, 2025 Instruction Following Retrieval
Code Code Available 2Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models Jan 30, 2025 Instruction Following Visual Reasoning
— Unverified 0