| Enhancing Perception of Key Changes in Remote Sensing Image Change Captioning | Sep 19, 2024 | Change DetectionDecoder | CodeCode Available | 1 |
| M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Large Language Models are Learnable Planners for Long-Term Recommendation | Feb 29, 2024 | Decision MakingLanguage Modelling | CodeCode Available | 1 |
| Enhancing Reasoning to Adapt Large Language Models for Domain-Specific Applications | Feb 5, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval | Oct 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Content-Based Collaborative Generation for Recommender Systems | Mar 27, 2024 | Collaborative FilteringLanguage Modelling | CodeCode Available | 1 |
| EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery | Jan 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Automatic Evaluation of Attribution by Large Language Models | May 10, 2023 | Fact CheckingLanguage Modeling | CodeCode Available | 1 |
| Enhancing Conversational Search: Large Language Model-Aided Informative Query Rewriting | Oct 15, 2023 | Conversational SearchLanguage Modeling | CodeCode Available | 1 |
| Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions | Aug 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |