Investigating Non-Transitivity in LLM-as-a-Judge Feb 19, 2025 Chatbot Computational Efficiency
— Unverified 0Instruction Tuning on Public Government and Cultural Data for Low-Resource Language: a Case Study in Kazakh Feb 19, 2025 Instruction Following Multiple-choice
— Unverified 0TALKPLAY: Multimodal Music Recommendation with Large Language Models Feb 19, 2025 Conversational Recommendation Instruction Following
— Unverified 0MMTEB: Massive Multilingual Text Embedding Benchmark Feb 19, 2025 Instruction Following Retrieval
— Unverified 0Integrating Arithmetic Learning Improves Mathematical Reasoning in Smaller Models Feb 18, 2025 Data Augmentation GSM8K
— Unverified 0RoleMRC: A Fine-Grained Composite Benchmark for Role-Playing and Instruction-Following Feb 17, 2025 Instruction Following Machine Reading Comprehension
Code Code Available 0Learning to Keep a Promise: Scaling Language Model Decoding Parallelism with Learned Asynchronous Decoding Feb 17, 2025 Instruction Following Language Modeling
— Unverified 0MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training Feb 17, 2025 Instruction Following
Code Code Available 0SAIF: A Sparse Autoencoder Framework for Interpreting and Steering Instruction Following of Language Models Feb 17, 2025 Instruction Following
— Unverified 0Do we Really Need Visual Instructions? Towards Visual Instruction-Free Fine-tuning for Large Vision-Language Models Feb 17, 2025 Instruction Following visual instruction following
— Unverified 0CORDIAL: Can Multimodal Large Language Models Effectively Understand Coherence Relationships? Feb 16, 2025 Instruction Following
Code Code Available 0Rewrite to Jailbreak: Discover Learnable and Transferable Implicit Harmfulness Instruction Feb 16, 2025 Instruction Following
Code Code Available 0Cuckoo: An IE Free Rider Hatched by Massive Nutrition in LLM's Nest Feb 16, 2025 Instruction Following Nutrition
Code Code Available 0E2LVLM:Evidence-Enhanced Large Vision-Language Model for Multimodal Out-of-Context Misinformation Detection Feb 12, 2025 Instruction Following Language Modeling
— Unverified 0Who Taught You That? Tracing Teachers in Model Distillation Feb 10, 2025 Instruction Following POS
— Unverified 0Temporal Representation Alignment: Successor Features Enable Emergent Compositionality in Robot Instruction Following Feb 8, 2025 Instruction Following
— Unverified 0Hypencoder: Hypernetworks for Information Retrieval Feb 7, 2025 Information Retrieval Instruction Following
— Unverified 0Verifiable Format Control for Large Language Model Generations Feb 6, 2025 Benchmarking Instruction Following
— Unverified 0LLMs can be easily Confused by Instructional Distractions Feb 5, 2025 Bias Detection Code Generation
— Unverified 0Training an LLM-as-a-Judge Model: Pipeline, Insights, and Practical Lessons Feb 5, 2025 Instruction Following Knowledge Distillation
— Unverified 0SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Feb 4, 2025 Instruction Following Language Modeling
— Unverified 0Shuttle Between the Instructions and the Parameters of Large Language Models Feb 4, 2025 Dimensionality Reduction Instruction Following
— Unverified 0CoDe: Blockwise Control for Denoising Diffusion Models Feb 3, 2025 Denoising Instruction Following
Code Code Available 0Learning Human Perception Dynamics for Informative Robot Communication Feb 3, 2025 Data Augmentation Instruction Following
— Unverified 0BARE: Leveraging Base Language Models for Few-Shot Synthetic Data Generation Feb 3, 2025 Diversity GSM8K
— Unverified 0Disentangling Length Bias In Preference Learning Via Response-Conditioned Modeling Feb 2, 2025 Instruction Following
— Unverified 0ReFoRCE: A Text-to-SQL Agent with Self-Refinement, Format Restriction, and Column Exploration Feb 2, 2025 Instruction Following Natural Language Queries
— Unverified 0Self-supervised Quantized Representation for Seamlessly Integrating Knowledge Graphs with Large Language Models Jan 30, 2025 Instruction Following Knowledge Graphs
— Unverified 0Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models Jan 30, 2025 Instruction Following Visual Reasoning
— Unverified 03D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow Jan 28, 2025 Instruction Following Mixture-of-Experts
— Unverified 0How well can LLMs Grade Essays in Arabic? Jan 27, 2025 Automated Essay Scoring In-Context Learning
— Unverified 0Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages Jan 23, 2025 Instruction Following Math
— Unverified 0Compositional Instruction Following with Language Models and Reinforcement Learning Jan 21, 2025 In-Context Learning Instruction Following
— Unverified 0InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model Jan 21, 2025 Instruction Following Mathematical Reasoning
— Unverified 0Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking Jan 18, 2025 Binary Classification Fact Checking
— Unverified 0BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues Jan 18, 2025 Instruction Following Minecraft
— Unverified 0DNA 1.0 Technical Report Jan 18, 2025 Belebele GSM8K
— Unverified 0Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision Jan 14, 2025 Instruction Following Math
Code Code Available 0Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model Jan 13, 2025 Audio captioning Instruction Following
— Unverified 0A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context Jan 12, 2025 Binary Classification Diagnostic
— Unverified 0MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Jan 10, 2025 Instruction Following Language Modeling
— Unverified 0Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models Jan 10, 2025 Form Image Comprehension
— Unverified 0Scalable Vision Language Model Training via High Quality Data Curation Jan 10, 2025 Instruction Following Language Modeling
— Unverified 0LongViTU: Instruction Tuning for Long-Form Video Understanding Jan 9, 2025 EgoSchema Form
— Unverified 0Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models Jan 7, 2025 Instruction Following Vision and Language Navigation
— Unverified 0DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization Jan 5, 2025 Instruction Following
— Unverified 0Instruction-Following Pruning for Large Language Models Jan 3, 2025 Instruction Following Math
— Unverified 0Towards Interactive Deepfake Analysis Jan 2, 2025 DeepFake Detection Face Swapping
Code Code Available 0ProgCo: Program Helps Self-Correction of Large Language Models Jan 2, 2025 Instruction Following
Code Code Available 0MIMO: A Medical Vision Language Model with Visual Referring Multimodal Input and Pixel Grounding Multimodal Output Jan 1, 2025 Instruction Following Language Modeling
Code Code Available 0