| HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model | Mar 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BlueLM-2.5-3B Technical Report | Jul 8, 2025 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| Hear Me, See Me, Understand Me: Audio-Visual Autism Behavior Recognition | Mar 22, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| CoT-lized Diffusion: Let's Reinforce T2I Generation Step-by-step | Jul 6, 2025 | DenoisingLarge Language Model | —Unverified | 0 |
| A Large-scale Interpretable Multi-modality Benchmark for Facial Image Forgery Localization | Dec 27, 2024 | Face SwappingImage Segmentation | —Unverified | 0 |
| Comics for Everyone: Generating Accessible Text Descriptions for Comic Strips | Oct 1, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GUIDE: Graphical User Interface Data for Execution | Apr 9, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Guardrails for avoiding harmful medical product recommendations and off-label promotion in generative AI models | Jun 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Text: Implementing Multimodal Large Language Model-Powered Multi-Agent Systems Using a No-Code Platform | Jan 1, 2025 | Code GenerationImage Generation | —Unverified | 0 |
| Make Imagination Clearer! Stable Diffusion-based Visual Imagination for Multimodal Machine Translation | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |