| GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities | Jun 17, 2024 | Audio Question AnsweringInstruction Following | CodeCode Available | 2 |
| GeoChat: Grounded Large Vision-Language Model for Remote Sensing | Nov 24, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest | Jul 7, 2023 | AttributeCommon Sense Reasoning | CodeCode Available | 2 |
| Implicit Neural Representation for Cooperative Low-light Image Enhancement | Mar 21, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 2 |
| Linear Transformers with Learnable Kernel Functions are Better In-Context Models | Feb 16, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 2 |
| PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PAINT: Paying Attention to INformed Tokens to Mitigate Hallucination in Large Vision-Language Model | Jan 21, 2025 | HallucinationImage Captioning | CodeCode Available | 1 |
| A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability | Mar 12, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| FIRE: Fact-checking with Iterative Retrieval and Verification | Oct 17, 2024 | Claim VerificationFact Checking | CodeCode Available | 1 |
| Masked Structural Growth for 2x Faster Language Model Pre-training | May 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |