| Depicting Beyond Scores: Advancing Image Quality Assessment through Multi-modal Language Models | Dec 14, 2023 | DescriptiveImage Quality Assessment | CodeCode Available | 2 | 5 |
| AmadeusGPT: a natural language interface for interactive animal behavioral analysis | Jul 10, 2023 | Descriptive | CodeCode Available | 2 | 5 |
| RS-Agent: Automating Remote Sensing Tasks through Intelligent Agent | Jun 11, 2024 | AI AgentDescriptive | CodeCode Available | 2 | 5 |
| Composed Image Retrieval for Remote Sensing | May 24, 2024 | Composed Image Retrieval (CoIR)Descriptive | CodeCode Available | 2 | 5 |
| Scalable 3D Captioning with Pretrained Models | Jun 12, 2023 | DescriptiveImage Captioning | CodeCode Available | 2 | 5 |
| SCAMPS: Synthetics for Camera Measurement of Physiological Signals | Jun 8, 2022 | DescriptiveDiversity | CodeCode Available | 2 | 5 |
| SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning | Jun 18, 2025 | Caption GenerationDescriptive | CodeCode Available | 2 | 5 |
| SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description | Aug 24, 2024 | DescriptiveSpeech Synthesis | CodeCode Available | 2 | 5 |
| Deep Graph Matching under Quadratic Constraint | Mar 11, 2021 | DescriptiveGraph Matching | CodeCode Available | 1 | 5 |
| Deep Implicit Statistical Shape Models for 3D Medical Image Delineation | Apr 7, 2021 | DescriptiveLiver Segmentation | CodeCode Available | 1 | 5 |