| MING-MOE: Enhancing Medical Multi-Task Learning in Large Language Models with Sparse Mixture of Low-Rank Adapter Experts | Apr 13, 2024 | DiversityLanguage Modeling | CodeCode Available | 5 |
| WorkArena: How Capable Are Web Agents at Solving Common Knowledge Work Tasks? | Mar 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment | Mar 8, 2024 | DenoisingImage Generation | CodeCode Available | 5 |
| LAB: Large-Scale Alignment for ChatBots | Mar 2, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 5 |
| Retrieval-Augmented Generation for AI-Generated Content: A Survey | Feb 29, 2024 | Information RetrievalLarge Language Model | CodeCode Available | 5 |
| FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning | Feb 29, 2024 | GPULanguage Modeling | CodeCode Available | 5 |
| Datasets for Large Language Models: A Comprehensive Survey | Feb 28, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 |
| MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments | Feb 1, 2024 | Embodied Question AnsweringLanguage Modeling | CodeCode Available | 5 |
| Executable Code Actions Elicit Better LLM Agents | Feb 1, 2024 | Language ModellingLarge Language Model | CodeCode Available | 5 |