| CiteTracker: Correlating Image and Text for Visual Tracking | Aug 22, 2023 | AttributeDescriptive | CodeCode Available | 1 | 5 |
| Comprehensive Information Integration Modeling Framework for Video Titling | Jun 24, 2020 | DescriptiveVideo Captioning | CodeCode Available | 1 | 5 |
| A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM) | Apr 2, 2024 | Descriptive | CodeCode Available | 1 | 5 |
| A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties | Dec 21, 2023 | Common Sense ReasoningDescriptive | CodeCode Available | 1 | 5 |
| A Recipe for Creating Multimodal Aligned Datasets for Sequential Tasks | May 19, 2020 | Descriptive | CodeCode Available | 1 | 5 |
| IRB-NLP at SemEval-2022 Task 1: Exploring the Relationship Between Words and Their Semantic Representations | May 13, 2022 | DescriptiveReverse Dictionary | CodeCode Available | 1 | 5 |
| A Sketch-Based Neural Model for Generating Commit Messages from Diffs | Apr 8, 2021 | Code GenerationDescriptive | CodeCode Available | 1 | 5 |
| GraphLIME: Local Interpretable Model Explanations for Graph Neural Networks | Jan 17, 2020 | Descriptivefeature selection | CodeCode Available | 1 | 5 |
| GL-RG: Global-Local Representation Granularity for Video Captioning | May 22, 2022 | Caption GenerationDescriptive | CodeCode Available | 1 | 5 |
| Confidence-aware Pseudo-label Learning for Weakly Supervised Visual Grounding | Jan 1, 2023 | DescriptiveObject | CodeCode Available | 1 | 5 |
| ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax | Mar 2, 2023 | DescriptiveImage Captioning | CodeCode Available | 1 | 5 |
| Contextual Non-Local Alignment over Full-Scale Representation for Text-Based Person Search | Jan 8, 2021 | DescriptiveSentence | CodeCode Available | 1 | 5 |
| Contrastive Learning of Medical Visual Representations from Paired Images and Text | Oct 2, 2020 | Contrastive LearningDescriptive | CodeCode Available | 1 | 5 |
| Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings | Jan 28, 2024 | Contrastive LearningDescriptive | CodeCode Available | 1 | 5 |
| Learning to Color from Language | Apr 17, 2018 | ColorizationDescriptive | CodeCode Available | 1 | 5 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 | 5 |
| Natural scene reconstruction from fMRI signals using generative latent diffusion | Mar 9, 2023 | Brain Computer InterfaceBrain Decoding | CodeCode Available | 1 | 5 |
| GOAL: Global-local Object Alignment Learning | Mar 22, 2025 | DescriptiveObject | CodeCode Available | 1 | 5 |
| GraphXAIN: Narratives to Explain Graph Neural Networks | Nov 4, 2024 | DescriptiveFeature Importance | CodeCode Available | 1 | 5 |
| CTRLsum: Towards Generic Controllable Text Summarization | Dec 8, 2020 | ArticlesDescriptive | CodeCode Available | 1 | 5 |
| Dataset Distillation via Vision-Language Category Prototype | Jun 30, 2025 | Dataset DistillationDescriptive | CodeCode Available | 1 | 5 |
| Bias Loss for Mobile Neural Networks | Jul 23, 2021 | DescriptiveDiversity | CodeCode Available | 1 | 5 |
| Datasheets Aren't Enough: DataRubrics for Automated Quality Metrics and Accountability | Jun 2, 2025 | DescriptiveSynthetic Data Generation | CodeCode Available | 1 | 5 |
| Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training | Jan 4, 2024 | DescriptiveImage Captioning | CodeCode Available | 1 | 5 |
| Beyond Co-occurrence: Multi-modal Session-based Recommendation | Sep 29, 2023 | Contrastive LearningDescriptive | CodeCode Available | 1 | 5 |