LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day Jun 1, 2023 Image Classification Instruction Following
Code Code Available 4Otter: A Multi-Modal Model with In-Context Instruction Tuning May 5, 2023 GPU In-Context Learning
Code Code Available 4Instruction Tuning with GPT-4 Apr 6, 2023 Instruction Following
Code Code Available 4Not what you've signed up for: Compromising Real-World LLM-Integrated Applications with Indirect Prompt Injection Feb 23, 2023 Code Completion Computer Security
Code Code Available 4IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models May 22, 2025 Benchmarking Instruction Following
Code Code Available 3LaViDa: A Large Diffusion Language Model for Multimodal Understanding May 22, 2025 Instruction Following Language Modeling
Code Code Available 3LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis May 5, 2025 Chatbot Decoder
Code Code Available 3VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning Apr 3, 2025 Image Generation Instruction Following
Code Code Available 3The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities Jan 23, 2025 General Knowledge Instruction Following
Code Code Available 3VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Jan 21, 2025 Image Generation Instruction Following
Code Code Available 3Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception Oct 16, 2024 Binary Classification Chunking
Code Code Available 3ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood Sep 14, 2024 Instruction Following Text Generation
Code Code Available 3ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems Sep 2, 2024 Benchmarking Instruction Following
Code Code Available 31.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data Aug 7, 2024 16k 2k
Code Code Available 3NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models Jul 17, 2024 Instruction Following Vision and Language Navigation
Code Code Available 3AudioBench: A Universal Benchmark for Audio Large Language Models Jun 23, 2024 Audio Scene Understanding Instruction Following
Code Code Available 3Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Jun 19, 2024 Instruction Following
Code Code Available 3Refusal in Language Models Is Mediated by a Single Direction Jun 17, 2024 Instruction Following
Code Code Available 3MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning Jun 13, 2024 Instruction Following Math
Code Code Available 3FlashFace: Human Image Personalization with High-fidelity Identity Preservation Mar 25, 2024 Face Swapping Image Generation
Code Code Available 3ShapeLLM: Universal 3D Object Understanding for Embodied Interaction Feb 27, 2024 3D geometry 3D Object Captioning
Code Code Available 3SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition Feb 27, 2024 Instruction Following Language Modeling
Code Code Available 3EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models Feb 18, 2024 Event Extraction Hallucination
Code Code Available 3Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning Feb 15, 2024 Data Augmentation Instruction Following
Code Code Available 3OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning Feb 10, 2024 Federated Learning Instruction Following
Code Code Available 3DistiLLM: Towards Streamlined Distillation for Large Language Models Feb 6, 2024 Instruction Following Knowledge Distillation
Code Code Available 3LongAlign: A Recipe for Long Context Alignment of Large Language Models Jan 31, 2024 Diversity Instruction Following
Code Code Available 3Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment Jan 23, 2024 All Instruction Following
Code Code Available 3SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling Dec 23, 2023 Instruction Following Language Modeling
Code Code Available 3Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models Nov 14, 2023 Acoustic Scene Classification Audio captioning
Code Code Available 3How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition Oct 9, 2023 Code Generation Instruction Following
Code Code Available 3How Can Recommender Systems Benefit from Large Language Models: A Survey Jun 9, 2023 Ethics Feature Engineering
Code Code Available 3AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback May 22, 2023 Instruction Following
Code Code Available 3MultiModal-GPT: A Vision and Language Model for Dialogue with Humans May 8, 2023 Instruction Following Language Modeling
Code Code Available 3X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages May 7, 2023 Attribute Instruction Following
Code Code Available 3Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models May 4, 2023 Instruction Following
Code Code Available 3Caption Anything: Interactive Image Description with Diverse Multimodal Controls May 4, 2023 controllable image captioning Image Captioning
Code Code Available 3Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks Apr 16, 2022 Benchmarking Instruction Following
Code Code Available 3DrafterBench: Benchmarking Large Language Models for Tasks Automation in Civil Engineering Jul 15, 2025 Benchmarking Instruction Following
Code Code Available 2DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment Jul 3, 2025 cross-modal alignment Instruction Following
Code Code Available 2Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks Jul 3, 2025 Instruction Following
Code Code Available 2VerIF: Verification Engineering for Reinforcement Learning in Instruction Following Jun 11, 2025 Instruction Following reinforcement-learning
Code Code Available 2FusionAudio-1.2M: Towards Fine-grained Audio Captioning with Multimodal Contextual Fusion Jun 1, 2025 Audio captioning Caption Generation
Code Code Available 2When Large Multimodal Models Confront Evolving Knowledge:Challenges and Pathways May 30, 2025 Continual Learning Image Augmentation
Code Code Available 2How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients Apr 14, 2025 Instruction Following
Code Code Available 2MM-IFEngine: Towards Multimodal Instruction Following Apr 10, 2025 Instruction Following
Code Code Available 2Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language Models Apr 7, 2025 Dialogue Evaluation Fairness
Code Code Available 2CrystalFormer-RL: Reinforcement Fine-Tuning for Materials Design Apr 3, 2025 Band Gap Dielectric Constant
Code Code Available 2Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning Mar 31, 2025 General Reinforcement Learning Instruction Following
Code Code Available 2LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning Mar 19, 2025 Instruction Following Multimodal Reasoning
Code Code Available 2