| Hierarchical Integration Diffusion Model for Realistic Image Deblurring | May 22, 2023 | DeblurringImage Deblurring | CodeCode Available | 2 | 5 |
| BianCang: A Traditional Chinese Medicine Large Language Model | Nov 17, 2024 | DiagnosticLanguage Modeling | CodeCode Available | 2 | 5 |
| AIN: The Arabic INclusive Large Multimodal Model | Jan 31, 2025 | document understandingmodel | CodeCode Available | 2 | 5 |
| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 | 5 |
| Language Model Powered Digital Biology with BRAD | Sep 4, 2024 | ChatbotCode Generation | CodeCode Available | 2 | 5 |
| Graphic Design with Large Multimodal Model | Apr 22, 2024 | Layout Generationmodel | CodeCode Available | 2 | 5 |
| GestureDiffuCLIP: Gesture Diffusion Model with CLIP Latents | Mar 26, 2023 | Contrastive LearningGesture Generation | CodeCode Available | 2 | 5 |
| GhostFaceNets: Lightweight Face Recognition Model From Cheap Operations | Apr 10, 2023 | Face IdentificationFace Recognition | CodeCode Available | 2 | 5 |
| HiGPT: Heterogeneous Graph Language Model | Feb 25, 2024 | Graph LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| Jailbreaking Attack against Multimodal Large Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |