| CiteTracker: Correlating Image and Text for Visual Tracking | Aug 22, 2023 | AttributeDescriptive | CodeCode Available | 1 | 5 |
| Hallucination-Aware Multimodal Benchmark for Gastrointestinal Image Analysis with Large Vision-Language Models | May 11, 2025 | DescriptiveDiagnostic | CodeCode Available | 1 | 5 |
| A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM) | Apr 2, 2024 | Descriptive | CodeCode Available | 1 | 5 |
| A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties | Dec 21, 2023 | Common Sense ReasoningDescriptive | CodeCode Available | 1 | 5 |
| Human-like Controllable Image Captioning with Verb-specific Semantic Roles | Mar 22, 2021 | Caption Generationcontrollable image captioning | CodeCode Available | 1 | 5 |
| Hybrid Symbolic-Numeric Library for Power System Modeling and Analysis | Feb 21, 2020 | Descriptive | CodeCode Available | 1 | 5 |
| A Sketch-Based Neural Model for Generating Commit Messages from Diffs | Apr 8, 2021 | Code GenerationDescriptive | CodeCode Available | 1 | 5 |
| IDAS: Intent Discovery with Abstractive Summarization | May 31, 2023 | Abstractive Text SummarizationDescriptive | CodeCode Available | 1 | 5 |
| FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications | Mar 11, 2024 | AttributeDescriptive | CodeCode Available | 1 | 5 |
| From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering | May 30, 2022 | counterfactualDescriptive | CodeCode Available | 1 | 5 |