| A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties | Dec 21, 2023 | Common Sense ReasoningDescriptive | CodeCode Available | 1 |
| Ins-HOI: Instance Aware Human-Object Interactions Recovery | Dec 15, 2023 | DescriptiveDisentanglement | CodeCode Available | 1 |
| Unveiling Parts Beyond Objects:Towards Finer-Granularity Referring Expression Segmentation | Dec 13, 2023 | DescriptiveObject | CodeCode Available | 1 |
| NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations | Dec 11, 2023 | Autonomous DrivingDescriptive | CodeCode Available | 1 |
| JAMMIN-GPT: Text-based Improvisation using LLMs in Ableton Live | Dec 6, 2023 | Descriptive | CodeCode Available | 1 |
| OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition | Nov 30, 2023 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts | Nov 16, 2023 | Binary ClassificationDescriptive | CodeCode Available | 1 |
| Zero-shot audio captioning with audio-language model guidance and audio context keywords | Nov 14, 2023 | Audio captioningDescriptive | CodeCode Available | 1 |
| FaithScore: Fine-grained Evaluations of Hallucinations in Large Vision-Language Models | Nov 2, 2023 | DescriptiveInstruction Following | CodeCode Available | 1 |
| This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models | Oct 24, 2023 | DescriptiveNegation | CodeCode Available | 1 |