| M^2Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation | Nov 29, 2023 | Image GenerationLanguage Modelling | CodeCode Available | 1 |
| ModaVerse: Efficiently Transforming Modalities with LLMs | Jan 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effective Human-AI Teams via Learned Natural Language Rules and Onboarding | Nov 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Emergent Analogical Reasoning in Large Language Models | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Enhancing the Geometric Problem-Solving Ability of Multimodal LLMs via Symbolic-Neural Integration | Apr 17, 2025 | Geometry Problem SolvingLarge Language Model | CodeCode Available | 1 |
| EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing | Jul 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |
| Dynamic Updates for Language Adaptation in Visual-Language Tracking | Mar 9, 2025 | Large Language Model | CodeCode Available | 1 |
| DynaPipe: Optimizing Multi-task Training through Dynamic Pipelines | Nov 17, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 |
| ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training | Dec 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DStruct2Design: Data and Benchmarks for Data Structure Driven Generative Floor Plan Design | Jul 22, 2024 | Image GenerationLanguage Modelling | CodeCode Available | 1 |