PromptFix: You Prompt and We Fix the Photo May 27, 2024 Denoising Image Generation
Code Code Available 45 LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day Jun 1, 2023 Image Classification Instruction Following
Code Code Available 45 LLaMA Pro: Progressive LLaMA with Block Expansion Jan 4, 2024 Instruction Following Math
Code Code Available 45 FuseChat: Knowledge Fusion of Chat Models Aug 15, 2024 Instruction Following
Code Code Available 45 Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment Jan 23, 2024 All Instruction Following
Code Code Available 35 LaViDa: A Large Diffusion Language Model for Multimodal Understanding May 22, 2025 Instruction Following Language Modeling
Code Code Available 35 AudioBench: A Universal Benchmark for Audio Large Language Models Jun 23, 2024 Audio Scene Understanding Instruction Following
Code Code Available 35 AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback May 22, 2023 Instruction Following
Code Code Available 35 Caption Anything: Interactive Image Description with Diverse Multimodal Controls May 4, 2023 controllable image captioning Image Captioning
Code Code Available 35 VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Jan 21, 2025 Image Generation Instruction Following
Code Code Available 35 VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning Apr 3, 2025 Image Generation Instruction Following
Code Code Available 35 IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models May 22, 2025 Benchmarking Instruction Following
Code Code Available 35 X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages May 7, 2023 Attribute Instruction Following
Code Code Available 35 SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling Dec 23, 2023 Instruction Following Language Modeling
Code Code Available 35 SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition Feb 27, 2024 Instruction Following Language Modeling
Code Code Available 35 Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Jun 19, 2024 Instruction Following
Code Code Available 35 Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning Feb 15, 2024 Data Augmentation Instruction Following
Code Code Available 35 ShapeLLM: Universal 3D Object Understanding for Embodied Interaction Feb 27, 2024 3D geometry 3D Object Captioning
Code Code Available 35 Refusal in Language Models Is Mediated by a Single Direction Jun 17, 2024 Instruction Following
Code Code Available 35 Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models Nov 14, 2023 Acoustic Scene Classification Audio captioning
Code Code Available 35 ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood Sep 14, 2024 Instruction Following Text Generation
Code Code Available 35 How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition Oct 9, 2023 Code Generation Instruction Following
Code Code Available 35 Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks Apr 16, 2022 Benchmarking Instruction Following
Code Code Available 35 Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models May 4, 2023 Instruction Following
Code Code Available 35 OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning Feb 10, 2024 Federated Learning Instruction Following
Code Code Available 35 MultiModal-GPT: A Vision and Language Model for Dialogue with Humans May 8, 2023 Instruction Following Language Modeling
Code Code Available 35 MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning Jun 13, 2024 Instruction Following Math
Code Code Available 35 FlashFace: Human Image Personalization with High-fidelity Identity Preservation Mar 25, 2024 Face Swapping Image Generation
Code Code Available 35 EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models Feb 18, 2024 Event Extraction Hallucination
Code Code Available 35 NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models Jul 17, 2024 Instruction Following Vision and Language Navigation
Code Code Available 35 LongAlign: A Recipe for Long Context Alignment of Large Language Models Jan 31, 2024 Diversity Instruction Following
Code Code Available 35 DistiLLM: Towards Streamlined Distillation for Large Language Models Feb 6, 2024 Instruction Following Knowledge Distillation
Code Code Available 35 1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data Aug 7, 2024 16k 2k
Code Code Available 35 How Can Recommender Systems Benefit from Large Language Models: A Survey Jun 9, 2023 Ethics Feature Engineering
Code Code Available 35 ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems Sep 2, 2024 Benchmarking Instruction Following
Code Code Available 35 LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis May 5, 2025 Chatbot Decoder
Code Code Available 35 Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception Oct 16, 2024 Binary Classification Chunking
Code Code Available 35 The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities Jan 23, 2025 General Knowledge Instruction Following
Code Code Available 35 Learning to Decode Collaboratively with Multiple Language Models Mar 6, 2024 Instruction Following
Code Code Available 25 ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning Jan 4, 2024 Data Visualization Decision Making
Code Code Available 25 Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models Mar 19, 2024 Instruction Following visual instruction following
Code Code Available 25 Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic Feb 19, 2024 Instruction Following Math
Code Code Available 25 MiniLLM: Knowledge Distillation of Large Language Models Jun 14, 2023 Instruction Following Knowledge Distillation
Code Code Available 25 Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Nov 6, 2023 Decoder GSM8K
Code Code Available 25 CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts May 9, 2024 Image Captioning Instruction Following
Code Code Available 25 Large Language Model Instruction Following: A Survey of Progresses and Challenges Mar 18, 2023 Instruction Following Language Modeling
Code Code Available 25 CrystalFormer-RL: Reinforcement Fine-Tuning for Materials Design Apr 3, 2025 Band Gap Dielectric Constant
Code Code Available 25 Aligning Modalities in Vision Large Language Models via Preference Fine-tuning Feb 18, 2024 Hallucination Instruction Following
Code Code Available 25 A Critical Evaluation of AI Feedback for Aligning Large Language Models Feb 19, 2024 Instruction Following reinforcement-learning
Code Code Available 25 Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Jan 29, 2025 Instruction Following Math
Code Code Available 25