Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning Mar 31, 2025 General Reinforcement Learning Instruction Following
Code Code Available 2Effectively Controlling Reasoning Models through Thinking Intervention Mar 31, 2025 Instruction Following Safety Alignment
— Unverified 0Pay More Attention to the Robustness of Prompt for Instruction Data Mining Mar 31, 2025 Instruction Following
— Unverified 0Learning to Instruct for Visual Instruction Tuning Mar 28, 2025 Hallucination Instruction Following
— Unverified 0InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction Mar 26, 2025 Instruction Following Video Editing
Code Code Available 1Qwen2.5-Omni Technical Report Mar 26, 2025 Automatic Speech Recognition (ASR) GSM8K
Code Code Available 7Gemma 3 Technical Report Mar 25, 2025 Instruction Following Math
— Unverified 0SimpleRL-Zoo: Investigating and Taming Zero Reinforcement Learning for Open Base Models in the Wild Mar 24, 2025 Instruction Following Math
Code Code Available 7OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence Mar 20, 2025 Instruction Following Natural Language Understanding
— Unverified 0LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Mar 19, 2025 Instruction Following Multimodal Reasoning
Code Code Available 2Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings Mar 19, 2025 Instruction Following Large Language Model
Code Code Available 0ThinkPatterns-21k: A Systematic Study on the Impact of Thinking Patterns in LLMs Mar 17, 2025 Instruction Following
— Unverified 0Can Language Models Follow Multiple Turns of Entangled Instructions? Mar 17, 2025 Instruction Following Memorization
Code Code Available 1ICCO: Learning an Instruction-conditioned Coordinator for Language-guided Task-aligned Multi-robot Control Mar 15, 2025 Instruction Following Multi-agent Reinforcement Learning
— Unverified 0D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning Mar 14, 2025 Diversity Instruction Following
— Unverified 0ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction Tuning Mar 14, 2025 Code Generation Decoder
Code Code Available 0Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models Mar 13, 2025 Instruction Following
— Unverified 0Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding Mar 12, 2025 Instruction Following Video Understanding
— Unverified 0Got Compute, but No Data: Lessons From Post-training a Finnish LLM Mar 12, 2025 Instruction Following
— Unverified 0DAFE: LLM-Based Evaluation Through Dynamic Arbitration for Free-Form Question-Answering Mar 11, 2025 Form Instruction Following
— Unverified 0Open-World Skill Discovery from Unsegmented Demonstrations Mar 11, 2025 Boundary Detection Event Segmentation
— Unverified 0Robust Multi-Objective Controlled Decoding of Large Language Models Mar 11, 2025 Instruction Following
Code Code Available 0Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Mar 10, 2025 Image Description Image Generation
Code Code Available 2XIFBench: Evaluating Large Language Models on Multilingual Instruction Following Mar 10, 2025 Instruction Following Specificity
— Unverified 0DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs Mar 10, 2025 Code Generation Instruction Following
Code Code Available 2REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding Mar 10, 2025 Instruction Following Keypoint Detection
Code Code Available 1Dr Genre: Reinforcement Learning from Decoupled LLM Feedback for Generic Text Rewriting Mar 9, 2025 Instruction Following Large Language Model
— Unverified 0WildIFEval: Instruction Following in the Wild Mar 9, 2025 Instruction Following
Code Code Available 0RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs Mar 8, 2025 Instruction Following Mathematical Reasoning
Code Code Available 2S2S-Arena, Evaluating Speech2Speech Protocols on Instruction Following with Paralinguistic Information Mar 7, 2025 Instruction Following
— Unverified 0Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment Mar 6, 2025 Instruction Following Transfer Learning
Code Code Available 0FuseChat-3.0: Preference Optimization Meets Heterogeneous Model Fusion Mar 6, 2025 General Knowledge Instruction Following
Code Code Available 1IFIR: A Comprehensive Benchmark for Evaluating Instruction-Following in Expert-Domain Information Retrieval Mar 6, 2025 Information Retrieval Instruction Following
— Unverified 0CodeIF-Bench: Evaluating Instruction-Following Capabilities of Large Language Models in Interactive Code Generation Mar 5, 2025 Code Generation Instruction Following
— Unverified 0LEWIS (LayEr WIse Sparsity) -- A Training Free Guided Model Merging Approach Mar 5, 2025 Instruction Following Math
— Unverified 0Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models Mar 5, 2025 Hallucination Instruction Following
Code Code Available 11Unified Mind Model: Reimagining Autonomous Agents in the LLM Era Mar 5, 2025 Instruction Following
— Unverified 0Robust Learning of Diverse Code Edits Mar 5, 2025 Code Generation Instruction Following
— Unverified 0Iterative Value Function Optimization for Guided Decoding Mar 4, 2025 Decision Making Instruction Following
— Unverified 0InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training Mar 4, 2025 Instruction Following text-to-speech
— Unverified 0CrowdSelect: Synthetic Instruction Data Selection with Multi-LLM Wisdom Mar 3, 2025 Instruction Following
Code Code Available 1In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models Mar 3, 2025 In-Context Learning Instruction Following
— Unverified 0Re-Imagining Multimodal Instruction Tuning: A Representation View Mar 2, 2025 Instruction Following MME
Code Code Available 0Triple Phase Transitions: Understanding the Learning Dynamics of Large Language Models from a Neuroscience Perspective Feb 28, 2025 Instruction Following
— Unverified 0Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge Feb 27, 2025 GSM8K HumanEval
— Unverified 0DataMan: Data Manager for Pre-training Large Language Models Feb 26, 2025 In-Context Learning Instruction Following
— Unverified 0Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems Feb 26, 2025 Instruction Following
Code Code Available 2Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models Feb 26, 2025 Instruction Following Vision-Language-Action
— Unverified 0Ground-level Viewpoint Vision-and-Language Navigation in Continuous Environments Feb 26, 2025 Instruction Following Vision and Language Navigation
— Unverified 0CodeIF: Benchmarking the Instruction-Following Capabilities of Large Language Models for Code Generation Feb 26, 2025 Benchmarking Code Generation
Code Code Available 1