| SCOPE: Sign Language Contextual Processing with Embedding from LLMs | Sep 2, 2024 | DiversityLanguage Modeling | CodeCode Available | 0 |
| SAM4MLLM: Enhance Multi-Modal Large Language Model for Referring Expression Segmentation | Sep 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Multimodal Multi-turn Conversation Stance Detection: A Challenge Dataset and Effective Model | Sep 1, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Comparing Discrete and Continuous Space LLMs for Speech Recognition | Sep 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| OrthoDoc: Multimodal Large Language Model for Assisting Diagnosis in Computed Tomography | Aug 30, 2024 | Computed Tomography (CT)Diagnostic | —Unverified | 0 |
| MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models | Aug 30, 2024 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Joint Estimation and Prediction of City-wide Delivery Demand: A Large Language Model Empowered Graph-based Learning Approach | Aug 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding | Aug 30, 2024 | Language ModellingLarge Language Model | CodeCode Available | 0 |
| Getting Inspiration for Feature Elicitation: App Store- vs. LLM-based Approach | Aug 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ChatSUMO: Large Language Model for Automating Traffic Scenario Generation in Simulation of Urban MObility | Aug 29, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |