Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models Jan 30, 2025 Instruction Following Visual Reasoning
— Unverified 0MultiChallenge: A Realistic Multi-Turn Conversation Evaluation Benchmark Challenging to Frontier LLMs Jan 29, 2025 All Instruction Following
Code Code Available 2Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Jan 29, 2025 Instruction Following Math
Code Code Available 2Janus-Pro: Unified Multimodal Understanding and Generation with Data and Model Scaling Jan 29, 2025 Image Generation
Code Code Available 113D-MoE: A Mixture-of-Experts Multi-modal LLM for 3D Vision and Pose Diffusion via Rectified Flow Jan 28, 2025 Instruction Following Mixture-of-Experts
— Unverified 0How well can LLMs Grade Essays in Arabic? Jan 27, 2025 Automated Essay Scoring In-Context Learning
— Unverified 0Advancing Mathematical Reasoning in Language Models: The Impact of Problem-Solving Data, Data Synthesis Methods, and Training Stages Jan 23, 2025 Instruction Following Math
— Unverified 0The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities Jan 23, 2025 General Knowledge Instruction Following
Code Code Available 3Online Preference Alignment for Language Models via Count-based Exploration Jan 22, 2025 Instruction Following
Code Code Available 1Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Jan 22, 2025 Instruction Following
Code Code Available 2Compositional Instruction Following with Language Models and Reinforcement Learning Jan 21, 2025 In-Context Learning Instruction Following
— Unverified 0VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Jan 21, 2025 Image Generation Instruction Following
Code Code Available 3InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model Jan 21, 2025 Instruction Following Mathematical Reasoning
Code Code Available 0Curiosity-Driven Reinforcement Learning from Human Feedback Jan 20, 2025 Diversity Instruction Following
Code Code Available 1BAP v2: An Enhanced Task Framework for Instruction Following in Minecraft Dialogues Jan 18, 2025 Instruction Following Minecraft
— Unverified 0Zero-shot and Few-shot Learning with Instruction-following LLMs for Claim Matching in Automated Fact-checking Jan 18, 2025 Binary Classification Fact Checking
— Unverified 0DNA 1.0 Technical Report Jan 18, 2025 Belebele GSM8K
— Unverified 0A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following Jan 14, 2025 Instruction Following
Code Code Available 1Iterative Label Refinement Matters More than Preference Optimization under Weak Supervision Jan 14, 2025 Instruction Following Math
Code Code Available 0Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness Jan 14, 2025 Event Extraction Instruction Following
Code Code Available 1Audio-CoT: Exploring Chain-of-Thought Reasoning in Large Audio Language Model Jan 13, 2025 Audio captioning Instruction Following
— Unverified 0A Comprehensive Evaluation of Large Language Models on Mental Illnesses in Arabic Context Jan 12, 2025 Binary Classification Diagnostic
— Unverified 0MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Jan 10, 2025 Instruction Following Language Modeling
— Unverified 0Scalable Vision Language Model Training via High Quality Data Curation Jan 10, 2025 Instruction Following Language Modeling
— Unverified 0Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models Jan 10, 2025 Form Image Comprehension
— Unverified 0Demystifying Domain-adaptive Post-training for Financial LLMs Jan 9, 2025 Continual Pretraining Domain Adaptation
Code Code Available 1LongViTU: Instruction Tuning for Long-Form Video Understanding Jan 9, 2025 EgoSchema Form
— Unverified 0Language and Planning in Robotic Navigation: A Multilingual Evaluation of State-of-the-Art Models Jan 7, 2025 Instruction Following Vision and Language Navigation
— Unverified 0DPO Kernels: A Semantically-Aware, Kernel-Enhanced, and Divergence-Rich Paradigm for Direct Preference Optimization Jan 5, 2025 Instruction Following
— Unverified 0Instruction-Following Pruning for Large Language Models Jan 3, 2025 Instruction Following Math
— Unverified 0ProgCo: Program Helps Self-Correction of Large Language Models Jan 2, 2025 Instruction Following
Code Code Available 0Towards Interactive Deepfake Analysis Jan 2, 2025 DeepFake Detection Face Swapping
Code Code Available 0SLADE: Shielding against Dual Exploits in Large Vision-Language Models Jan 1, 2025 Contrastive Learning Instruction Following
— Unverified 0HSI-GPT: A General-Purpose Large Scene-Motion-Language Model for Human Scene Interaction Jan 1, 2025 Descriptive Instruction Following
— Unverified 0MIMO: A Medical Vision Language Model with Visual Referring Multimodal Input and Pixel Grounding Multimodal Output Jan 1, 2025 Instruction Following Language Modeling
Code Code Available 0TinyHelen's First Curriculum: Training and Evaluating Tiny Language Models in a Simpler Language Environment Dec 31, 2024 Instruction Following Language Modeling
Code Code Available 1Hindsight Planner: A Closed-Loop Few-Shot Planner for Embodied Instruction Following Dec 27, 2024 Instruction Following
— Unverified 0Find the Intention of Instruction: Comprehensive Evaluation of Instruction Understanding for Large Language Models Dec 27, 2024 Instruction Following
Code Code Available 0Internalized Self-Correction for Large Language Models Dec 21, 2024 Instruction Following
— Unverified 0LearnLM: Improving Gemini for Learning Dec 21, 2024 Instruction Following
— Unverified 0Align Anything: Training All-Modality Models to Follow Instructions with Language Feedback Dec 20, 2024 All Instruction Following
Code Code Available 7HREF: Human Response-Guided Evaluation of Instruction Following in Language Models Dec 20, 2024 Instruction Following
Code Code Available 0Length Controlled Generation for Black-box LLMs Dec 19, 2024 Abstractive Text Summarization Instruction Following
— Unverified 0Qwen2.5 Technical Report Dec 19, 2024 Common Sense Reasoning
Code Code Available 13Systematic Evaluation of Long-Context LLMs on Financial Concepts Dec 19, 2024 Instruction Following
— Unverified 0A Systematic Examination of Preference Learning through the Lens of Instruction-Following Dec 18, 2024 Instruction Following Synthetic Data Generation
— Unverified 0MetaMorph: Multimodal Understanding and Generation via Instruction Tuning Dec 18, 2024 Instruction Following MORPH
— Unverified 0Pipeline Analysis for Developing Instruct LLMs in Low-Resource Languages: A Case Study on Basque Dec 18, 2024 Instruction Following Natural Language Understanding
— Unverified 0Question: How do Large Language Models perform on the Question Answering tasks? Answer: Dec 17, 2024 Articles Instruction Following
— Unverified 0LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering Dec 16, 2024 In-Context Learning Instruction Following
Code Code Available 0