| MuLan: Multimodal-LLM Agent for Progressive and Interactive Multi-Object Diffusion | Feb 20, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 | 5 |
| M^2Chat: Empowering VLM for Multimodal LLM Interleaved Text-Image Generation | Nov 29, 2023 | Image GenerationLanguage Modelling | CodeCode Available | 1 | 5 |
| MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning | Aug 21, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| In-context Autoencoder for Context Compression in a Large Language Model | Jul 13, 2023 | GPULanguage Modeling | CodeCode Available | 1 | 5 |
| Multi-Modal Classifiers for Open-Vocabulary Object Detection | Jun 8, 2023 | Language ModellingLarge Language Model | CodeCode Available | 1 | 5 |
| Dataset Distillation via Vision-Language Category Prototype | Jun 30, 2025 | Dataset DistillationDescriptive | CodeCode Available | 1 | 5 |
| Motif: Intrinsic Motivation from Artificial Intelligence Feedback | Sep 29, 2023 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| Inference with Reference: Lossless Acceleration of Large Language Models | Apr 10, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| DebUnc: Improving Large Language Model Agent Communication With Uncertainty Metrics | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |