GSCo: Towards Generalizable AI in Medicine via Generalist-Specialist Collaboration Apr 23, 2024 Collaborative Inference In-Context Learning
Code Code Available 25 Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative Instructions Aug 8, 2023 Caption Generation Image Captioning
Code Code Available 25 MentaLLaMA: Interpretable Mental Health Analysis on Social Media with Large Language Models Sep 24, 2023 Instruction Following
Code Code Available 25 Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks Jul 3, 2025 Instruction Following
Code Code Available 25 PhoGPT: Generative Pre-training for Vietnamese Nov 6, 2023 Instruction Following
Code Code Available 25 DeSTA2: Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data Sep 30, 2024 Instruction Following Language Modeling
Code Code Available 25 From Quantity to Quality: Boosting LLM Performance with Self-Guided Data Selection for Instruction Tuning Aug 23, 2023 Instruction Following
Code Code Available 25 LM-Nav: Robotic Navigation with Large Pre-Trained Models of Language, Vision, and Action Jul 10, 2022 Instruction Following Language Modeling
Code Code Available 25 Long-Context Language Modeling with Parallel Context Encoding Feb 26, 2024 In-Context Learning Instruction Following
Code Code Available 25 DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Jul 3, 2025 cross-modal alignment Instruction Following
Code Code Available 25 GraphWiz: An Instruction-Following Language Model for Graph Problems Feb 25, 2024 Instruction Following Language Modeling
Code Code Available 25 ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning Jan 4, 2024 Data Visualization Decision Making
Code Code Available 25 Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models Mar 19, 2024 Instruction Following visual instruction following
Code Code Available 25 DistiLLM-2: A Contrastive Approach Boosts the Distillation of LLMs Mar 10, 2025 Code Generation Instruction Following
Code Code Available 25 Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems Feb 26, 2025 Instruction Following
Code Code Available 25 Diffusion-ES: Gradient-free Planning with Diffusion for Autonomous Driving and Zero-Shot Instruction Following Feb 9, 2024 Autonomous Driving Denoising
Code Code Available 25 GenAI Arena: An Open Evaluation Platform for Generative Models Jun 6, 2024 Image Generation Instruction Following
Code Code Available 25 GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks Feb 11, 2024 Graph Question Answering Instruction Following
Code Code Available 25 LMDrive: Closed-Loop End-to-End Driving with Large Language Models Dec 12, 2023 Autonomous Driving Instruction Following
Code Code Available 25 LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Mar 19, 2025 Instruction Following Multimodal Reasoning
Code Code Available 25 GeoChat: Grounded Large Vision-Language Model for Remote Sensing Nov 24, 2023 Instruction Following Language Modeling
Code Code Available 25 GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction May 30, 2023 Image Generation Instruction Following
Code Code Available 25 LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents Nov 9, 2023 Instruction Following LLM real-life tasks
Code Code Available 25 GeoGround: A Unified Large Vision-Language Model for Remote Sensing Visual Grounding Nov 16, 2024 Instruction Following Language Modeling
Code Code Available 25 From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large Language Models Apr 24, 2024 Instruction Following
Code Code Available 25 LLaSM: Large Language and Speech Model Aug 30, 2023 Instruction Following Language Modeling
Code Code Available 25 LLaVAR: Enhanced Visual Instruction Tuning for Text-Rich Image Understanding Jun 29, 2023 16k Image Captioning
Code Code Available 25 Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward Apr 1, 2024 Instruction Following Language Modeling
Code Code Available 25 Archon: An Architecture Search Framework for Inference-Time Techniques Sep 23, 2024 Hyperparameter Optimization Instruction Following
Code Code Available 25 Dual-Space Knowledge Distillation for Large Language Models Jun 25, 2024 Instruction Following Knowledge Distillation
Code Code Available 25 Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want Mar 29, 2024 Instruction Following Language Modelling
Code Code Available 25 CoIN: A Benchmark of Continual Instruction tuNing for Multimodel Large Language Model Mar 13, 2024 General Knowledge Instruction Following
Code Code Available 25 LLark: A Multimodal Instruction-Following Language Model for Music Oct 11, 2023 Instruction Following Language Modeling
Code Code Available 25 LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts Dec 16, 2024 General Knowledge Instruction Following
Code Code Available 25 LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models Jun 15, 2023 Hallucination Image Captioning
Code Code Available 25 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts May 9, 2024 Image Captioning Instruction Following
Code Code Available 25 LHRS-Bot-Nova: Improved Multimodal Large Language Model for Remote Sensing Vision-Language Interpretation Nov 14, 2024 Earth Observation Instruction Following
Code Code Available 25 EditWorld: Simulating World Dynamics for Instruction-Following Image Editing May 23, 2024 Instruction Following
Code Code Available 25 Lion: Adversarial Distillation of Proprietary Large Language Models May 22, 2023 Instruction Following Knowledge Distillation
Code Code Available 25 F-LMM: Grounding Frozen Large Multimodal Models Jun 9, 2024 General Knowledge Instruction Following
Code Code Available 25 LITA: Language Instructed Temporal-Localization Assistant Mar 27, 2024 Instruction Following Temporal Localization
Code Code Available 25 Learning to Decode Collaboratively with Multiple Language Models Mar 6, 2024 Instruction Following
Code Code Available 25 CrystalFormer-RL: Reinforcement Fine-Tuning for Materials Design Apr 3, 2025 Band Gap Dielectric Constant
Code Code Available 25 BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs Jul 17, 2023 Instruction Following Sentence
Code Code Available 25 Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic Feb 19, 2024 Instruction Following Math
Code Code Available 25 Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Jan 29, 2025 Instruction Following Math
Code Code Available 25 MiniLLM: Knowledge Distillation of Large Language Models Jun 14, 2023 Instruction Following Knowledge Distillation
Code Code Available 25 Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Nov 6, 2023 Decoder GSM8K
Code Code Available 25 BLSP-Emo: Towards Empathetic Large Speech-Language Models Jun 6, 2024 Emotion Recognition Instruction Following
Code Code Available 25 Large Language Model Instruction Following: A Survey of Progresses and Challenges Mar 18, 2023 Instruction Following Language Modeling
Code Code Available 25