RewardBench: Evaluating Reward Models for Language Modeling Mar 20, 2024 Instruction Following Language Modeling
Code Code Available 4LLaVA-Med: Training a Large Language-and-Vision Assistant for Biomedicine in One Day Jun 1, 2023 Image Classification Instruction Following
Code Code Available 4RewardBench 2: Advancing Reward Model Evaluation Jun 2, 2025 Instruction Following model
Code Code Available 4LLaMA Pro: Progressive LLaMA with Block Expansion Jan 4, 2024 Instruction Following Math
Code Code Available 4LaViDa: A Large Diffusion Language Model for Multimodal Understanding May 22, 2025 Instruction Following Language Modeling
Code Code Available 3Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment Jan 23, 2024 All Instruction Following
Code Code Available 3AlpacaFarm: A Simulation Framework for Methods that Learn from Human Feedback May 22, 2023 Instruction Following
Code Code Available 3X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages May 7, 2023 Attribute Instruction Following
Code Code Available 3VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning Apr 3, 2025 Image Generation Instruction Following
Code Code Available 3VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Jan 21, 2025 Image Generation Instruction Following
Code Code Available 3The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities Jan 23, 2025 General Knowledge Instruction Following
Code Code Available 3IFEval-Audio: Benchmarking Instruction-Following Capability in Audio-based Large Language Models May 22, 2025 Benchmarking Instruction Following
Code Code Available 3Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks Apr 16, 2022 Benchmarking Instruction Following
Code Code Available 3AudioBench: A Universal Benchmark for Audio Large Language Models Jun 23, 2024 Audio Scene Understanding Instruction Following
Code Code Available 3SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling Dec 23, 2023 Instruction Following Language Modeling
Code Code Available 3Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Jun 19, 2024 Instruction Following
Code Code Available 3ShapeLLM: Universal 3D Object Understanding for Embodied Interaction Feb 27, 2024 3D geometry 3D Object Captioning
Code Code Available 3SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition Feb 27, 2024 Instruction Following Language Modeling
Code Code Available 3Refusal in Language Models Is Mediated by a Single Direction Jun 17, 2024 Instruction Following
Code Code Available 3ComfyBench: Benchmarking LLM-based Agents in ComfyUI for Autonomously Designing Collaborative AI Systems Sep 2, 2024 Benchmarking Instruction Following
Code Code Available 3ASFT: Aligned Supervised Fine-Tuning through Absolute Likelihood Sep 14, 2024 Instruction Following Text Generation
Code Code Available 3How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition Oct 9, 2023 Code Generation Instruction Following
Code Code Available 3Qwen-Audio: Advancing Universal Audio Understanding via Unified Large-Scale Audio-Language Models Nov 14, 2023 Acoustic Scene Classification Audio captioning
Code Code Available 3Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning Feb 15, 2024 Data Augmentation Instruction Following
Code Code Available 3EventRL: Enhancing Event Extraction with Outcome Supervision for Large Language Models Feb 18, 2024 Event Extraction Hallucination
Code Code Available 3Caption Anything: Interactive Image Description with Diverse Multimodal Controls May 4, 2023 controllable image captioning Image Captioning
Code Code Available 3Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models May 4, 2023 Instruction Following
Code Code Available 3MultiModal-GPT: A Vision and Language Model for Dialogue with Humans May 8, 2023 Instruction Following Language Modeling
Code Code Available 3NavGPT-2: Unleashing Navigational Reasoning Capability for Large Vision-Language Models Jul 17, 2024 Instruction Following Vision and Language Navigation
Code Code Available 3FlashFace: Human Image Personalization with High-fidelity Identity Preservation Mar 25, 2024 Face Swapping Image Generation
Code Code Available 3OpenFedLLM: Training Large Language Models on Decentralized Private Data via Federated Learning Feb 10, 2024 Federated Learning Instruction Following
Code Code Available 3DistiLLM: Towards Streamlined Distillation for Large Language Models Feb 6, 2024 Instruction Following Knowledge Distillation
Code Code Available 3Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical Perception Oct 16, 2024 Binary Classification Chunking
Code Code Available 3LongAlign: A Recipe for Long Context Alignment of Large Language Models Jan 31, 2024 Diversity Instruction Following
Code Code Available 3MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning Jun 13, 2024 Instruction Following Math
Code Code Available 3LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech Synthesis May 5, 2025 Chatbot Decoder
Code Code Available 3How Can Recommender Systems Benefit from Large Language Models: A Survey Jun 9, 2023 Ethics Feature Engineering
Code Code Available 31.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data Aug 7, 2024 16k 2k
Code Code Available 3Learning to Decode Collaboratively with Multiple Language Models Mar 6, 2024 Instruction Following
Code Code Available 2Language Models are Super Mario: Absorbing Abilities from Homologous Models as a Free Lunch Nov 6, 2023 Decoder GSM8K
Code Code Available 2CrystalFormer-RL: Reinforcement Fine-Tuning for Materials Design Apr 3, 2025 Band Gap Dielectric Constant
Code Code Available 2Benchmarking Complex Instruction-Following with Multiple Constraints Composition Jul 4, 2024 Benchmarking Instruction Following
Code Code Available 2MiniLLM: Knowledge Distillation of Large Language Models Jun 14, 2023 Instruction Following Knowledge Distillation
Code Code Available 2CuMo: Scaling Multimodal LLM with Co-Upcycled Mixture-of-Experts May 9, 2024 Image Captioning Instruction Following
Code Code Available 2Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic Feb 19, 2024 Instruction Following Math
Code Code Available 2Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Jan 29, 2025 Instruction Following Math
Code Code Available 2Large Language Model Instruction Following: A Survey of Progresses and Challenges Mar 18, 2023 Instruction Following Language Modeling
Code Code Available 2BayLing: Bridging Cross-lingual Alignment and Instruction Following through Interactive Translation for Large Language Models Jun 19, 2023 Instruction Following Text Generation
Code Code Available 2Aligning Modalities in Vision Large Language Models via Preference Fine-tuning Feb 18, 2024 Hallucination Instruction Following
Code Code Available 2A Critical Evaluation of AI Feedback for Aligning Large Language Models Feb 19, 2024 Instruction Following reinforcement-learning
Code Code Available 2