AnyCap Project: A Unified Framework, Dataset, and Benchmark for Controllable Omni-modal Captioning Jul 17, 2025 Instruction Following
— Unverified 0How Many Instructions Can LLMs Follow at Once? Jul 15, 2025 Instruction Following
— Unverified 0DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering Jul 15, 2025 Benchmarking Instruction Following
Code Code Available 2Multilingual Multimodal Software Developer for Code Generation Jul 11, 2025 Code Generation Instruction Following
— Unverified 0TuneShield: Mitigating Toxicity in Conversational AI while Fine-tuning on Untrusted Data Jul 8, 2025 Chatbot Instruction Following
— Unverified 0Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks Jul 3, 2025 Instruction Following
Code Code Available 2DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Jul 3, 2025 cross-modal alignment Instruction Following
Code Code Available 2Kwai Keye-VL Technical Report Jul 2, 2025 Instruction Following Reinforcement Learning (RL)
Code Code Available 4Bridging Offline and Online Reinforcement Learning for LLMs Jun 26, 2025 Instruction Following Math
— Unverified 0LLaVA-Pose: Enhancing Human Pose and Action Understanding via Keypoint-Integrated Instruction Tuning Jun 26, 2025 Action Understanding Instruction Following
Code Code Available 0Multi-lingual Functional Evaluation for Large Language Models Jun 25, 2025 Belebele Instruction Following
— Unverified 0Learning Instruction-Following Policies through Open-Ended Instruction Relabeling with Large Language Models Jun 24, 2025 Instruction Following reinforcement-learning
— Unverified 0JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent Jun 21, 2025 Instruction Following Large Language Model
— Unverified 0InstructTTSEval: Benchmarking Complex Natural-Language Instruction Following in Text-to-Speech Systems Jun 19, 2025 Benchmarking Descriptive
Code Code Available 1Treasure Hunt: Real-time Targeting of the Long Tail using Training-Time Markers Jun 17, 2025 Instruction Following Prompt Engineering
— Unverified 0Instruction Following by Boosting Attention of Large Language Models Jun 16, 2025 Instruction Following Prompt Engineering
— Unverified 0Mixture of Weight-shared Heterogeneous Group Attention Experts for Dynamic Token-wise KV Optimization Jun 16, 2025 Causal Language Modeling Instruction Following
— Unverified 0LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction Jun 16, 2025 Instruction Following Vision-Language-Action
— Unverified 0MM-R5: MultiModal Reasoning-Enhanced ReRanker via Reinforcement Learning for Document Retrieval Jun 14, 2025 Instruction Following Multimodal Reasoning
Code Code Available 0CMI-Bench: A Comprehensive Benchmark for Evaluating Music Instruction Following Jun 14, 2025 Beat Tracking Genre classification
— Unverified 0HalLoc: Token-level Localization of Hallucinations for Vision Language Models Jun 12, 2025 Hallucination Image Captioning
Code Code Available 0AC/DC: LLM-based Audio Comprehension via Dialogue Continuation Jun 12, 2025 AudioCaps Audio captioning
— Unverified 0Conversational Search: From Fundamentals to Frontiers in the LLM Era Jun 12, 2025 Conversational Search Instruction Following
— Unverified 0Magistral Jun 12, 2025 Instruction Following Reinforcement Learning (RL)
— Unverified 0Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning Jun 12, 2025 Instruction Following Mathematical Reasoning
Code Code Available 0Alzheimer's Dementia Detection Using Perplexity from Paired Large Language Models Jun 11, 2025 Data Augmentation Decision Making
— Unverified 0VerIF: Verification Engineering for Reinforcement Learning in Instruction Following Jun 11, 2025 Instruction Following reinforcement-learning
Code Code Available 2LLaVA-c: Continual Improved Visual Instruction Tuning Jun 10, 2025 Continual Learning Continual Pretraining
— Unverified 0RHealthTwin: Towards Responsible and Multimodal Digital Twins for Personalized Well-being Jun 10, 2025 Hallucination Instruction Following
— Unverified 0EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models Jun 10, 2025 Instruction Following Navigate
Code Code Available 0LeVo: High-Quality Song Generation with Multi-Preference Alignment Jun 9, 2025 Instruction Following Music Generation
Code Code Available 5Video Unlearning via Low-Rank Refusal Vector Jun 9, 2025 Instruction Following
— Unverified 0Aligning Text, Images, and 3D Structure Token-by-Token Jun 9, 2025 3D Object Recognition Instruction Following
— Unverified 0Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text Jun 8, 2025 Instruction Following
Code Code Available 1Audio-Aware Large Language Models as Judges for Speaking Styles Jun 6, 2025 Instruction Following Pitch control
— Unverified 0Being Strong Progressively! Enhancing Knowledge Distillation of Large Language Models through a Curriculum Learning Framework Jun 6, 2025 Instruction Following Knowledge Distillation
Code Code Available 0RELIC: Evaluating Compositional Instruction Following via Language Recognition Jun 5, 2025 Instruction Following
— Unverified 0Unleashing Hour-Scale Video Training for Long Video-Language Understanding Jun 5, 2025 Instruction Following Language Modeling
— Unverified 0SeedEdit 3.0: Fast and High-Quality Generative Image Editing Jun 5, 2025 Instruction Following
— Unverified 0Identifying Reliable Evaluation Metrics for Scientific Text Revision Jun 5, 2025 Instruction Following
Code Code Available 0On the Mechanism of Reasoning Pattern Selection in Reinforcement Learning for Language Models Jun 5, 2025 Instruction Following Reinforcement Learning (RL)
— Unverified 0Robust Anti-Backdoor Instruction Tuning in LVLMs Jun 4, 2025 backdoor defense Instruction Following
— Unverified 0RewardAnything: Generalizable Principle-Following Reward Models Jun 4, 2025 Instruction Following Large Language Model
Code Code Available 1MASTER: Enhancing Large Language Model via Multi-Agent Simulated Teaching Jun 3, 2025 Data Augmentation Instruction Following
— Unverified 0TIIF-Bench: How Does Your T2I Model Follow Your Instructions? Jun 2, 2025 Benchmarking Instruction Following
— Unverified 0Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models Jun 2, 2025 Instruction Following Reinforcement Learning (RL)
Code Code Available 1RewardBench 2: Advancing Reward Model Evaluation Jun 2, 2025 Instruction Following model
Code Code Available 4MoDA: Modulation Adapter for Fine-Grained Visual Grounding in Instructional MLLMs Jun 2, 2025 Instruction Following Text Generation
— Unverified 0FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion Jun 1, 2025 Audio captioning Caption Generation
Code Code Available 2PersianMedQA: Language-Centric Evaluation of LLMs in the Persian Medical Domain May 30, 2025 Instruction Following Multiple-choice
— Unverified 0