| Kosmos-2: Grounding Multimodal Large Language Models to the World | Jun 26, 2023 | Image CaptioningIn-Context Learning | CodeCode Available | 1 |
| Are Sparse Autoencoders Useful? A Case Study in Sparse Probing | Feb 23, 2025 | Inductive BiasLarge Language Model | CodeCode Available | 1 |
| AceGPT, Localizing Large Language Models in Arabic | Sep 21, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| L2MAC: Large Language Model Automatic Computer for Extensive Code Generation | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Jakiro: Boosting Speculative Decoding with Decoupled Multi-Head via MoE | Feb 10, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| JamendoMaxCaps: A Large Scale Music-caption Dataset with Imputed Metadata | Feb 11, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Refer-and-Ground Multimodal Large Language Model for Biomedicine | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Realistic Threat Model for Large Language Model Jailbreaks | Oct 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Aug 1, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| End-to-End Beam Retrieval for Multi-Hop Question Answering | Aug 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |