| Can Machines Learn Morality? The Delphi Experiment | Oct 14, 2021 | DescriptiveEthics | CodeCode Available | 1 |
| Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval | Oct 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM) | Apr 2, 2024 | Descriptive | CodeCode Available | 1 |
| A Semantic Space is Worth 256 Language Descriptions: Make Stronger Segmentation Models with Descriptive Properties | Dec 21, 2023 | Common Sense ReasoningDescriptive | CodeCode Available | 1 |
| CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions | Dec 8, 2020 | counterfactualDescriptive | CodeCode Available | 1 |
| Field Convolutions for Surface CNNs | Apr 8, 2021 | Descriptive | CodeCode Available | 1 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 |
| Finetune like you pretrain: Improved finetuning of zero-shot vision models | Dec 1, 2022 | DescriptiveFew-Shot Learning | CodeCode Available | 1 |
| CTRLsum: Towards Generic Controllable Text Summarization | Dec 8, 2020 | ArticlesDescriptive | CodeCode Available | 1 |
| FontCLIP: A Semantic Typography Visual-Language Model for Multilingual Font Applications | Mar 11, 2024 | AttributeDescriptive | CodeCode Available | 1 |
| From Representation to Reasoning: Towards both Evidence and Commonsense Reasoning for Video Question-Answering | May 30, 2022 | counterfactualDescriptive | CodeCode Available | 1 |
| Contrastive Learning and Mixture of Experts Enables Precise Vector Embeddings | Jan 28, 2024 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| Generating images from caption and vice versa via CLIP-Guided Generative Latent Space Search | Feb 2, 2021 | DescriptiveImage Generation | CodeCode Available | 1 |
| Generating Parametric BRDFs from Natural Language Descriptions | Jun 19, 2023 | Descriptive | CodeCode Available | 1 |
| GOAL: Global-local Object Alignment Learning | Mar 22, 2025 | DescriptiveObject | CodeCode Available | 1 |
| Graph Backdoor | Jun 21, 2020 | Backdoor AttackDescriptive | CodeCode Available | 1 |
| Contrastive Audio-Language Learning for Music | Aug 25, 2022 | Audio to Text RetrievalDescriptive | CodeCode Available | 1 |
| Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation | Aug 24, 2023 | cross-modal alignmentDescriptive | CodeCode Available | 1 |
| Contrastive Learning of Medical Visual Representations from Paired Images and Text | Oct 2, 2020 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| HDCC: A Hyperdimensional Computing compiler for classification on embedded systems and high-performance computing | Apr 24, 2023 | C++ codeDescriptive | CodeCode Available | 1 |
| Human-like Controllable Image Captioning with Verb-specific Semantic Roles | Mar 22, 2021 | Caption Generationcontrollable image captioning | CodeCode Available | 1 |
| Hybrid Symbolic-Numeric Library for Power System Modeling and Analysis | Feb 21, 2020 | Descriptive | CodeCode Available | 1 |
| IDAS: Intent Discovery with Abstractive Summarization | May 31, 2023 | Abstractive Text SummarizationDescriptive | CodeCode Available | 1 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Conditional Generative Adversarial Nets | Nov 6, 2014 | DescriptiveHuman action generation | CodeCode Available | 1 |
| Comprehensive Information Integration Modeling Framework for Video Titling | Jun 24, 2020 | DescriptiveVideo Captioning | CodeCode Available | 1 |
| Confidence-aware Pseudo-label Learning for Weakly Supervised Visual Grounding | Jan 1, 2023 | DescriptiveObject | CodeCode Available | 1 |
| LaMOT: Language-Guided Multi-Object Tracking | Jun 12, 2024 | DescriptiveMulti-Object Tracking | CodeCode Available | 1 |
| CiteTracker: Correlating Image and Text for Visual Tracking | Aug 22, 2023 | AttributeDescriptive | CodeCode Available | 1 |
| Learning to Color from Language | Apr 17, 2018 | ColorizationDescriptive | CodeCode Available | 1 |
| Leveraging Large Language Models for Enhancing the Understandability of Generated Unit Tests | Aug 21, 2024 | Bug fixingDescriptive | CodeCode Available | 1 |
| Logical Consistency and Greater Descriptive Power for Facial Hair Attribute Learning | Feb 22, 2023 | AttributeDescriptive | CodeCode Available | 1 |
| Mining Fine-Grained Image-Text Alignment for Zero-Shot Captioning via Text-Only Training | Jan 4, 2024 | DescriptiveImage Captioning | CodeCode Available | 1 |
| Mixture of Low-rank Experts for Transferable AI-Generated Image Detection | Apr 7, 2024 | Descriptiveparameter-efficient fine-tuning | CodeCode Available | 1 |
| MMPD: Multi-Domain Mobile Video Physiology Dataset | Feb 8, 2023 | DescriptiveDiversity | CodeCode Available | 1 |
| Möbius Convolutions for Spherical CNNs | Jan 28, 2022 | DescriptiveImage Segmentation | CodeCode Available | 1 |
| Beyond Co-occurrence: Multi-modal Session-based Recommendation | Sep 29, 2023 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models | May 5, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| MultiFace: A Generic Training Mechanism for Boosting Face Recognition Performance | Jan 25, 2021 | ClusteringDescriptive | CodeCode Available | 1 |
| Multi-Grained Multimodal Interaction Network for Entity Linking | Jul 19, 2023 | Contrastive LearningDescriptive | CodeCode Available | 1 |
| CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous Driving | Jul 26, 2022 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 1 |
| ConTEXTual Net: A Multimodal Vision-Language Model for Segmentation of Pneumothorax | Mar 2, 2023 | DescriptiveImage Captioning | CodeCode Available | 1 |
| NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation | Sep 14, 2023 | DescriptiveRecommendation Systems | CodeCode Available | 1 |
| NLQuAD: A Non-Factoid Long Question Answering Data Set | Apr 1, 2021 | DescriptivePosition | CodeCode Available | 1 |
| Can Knowledge Graphs Simplify Text? | Aug 14, 2023 | DescriptiveKG-to-Text Generation | CodeCode Available | 1 |
| OST: Refining Text Knowledge with Optimal Spatio-Temporal Descriptor for General Video Recognition | Nov 30, 2023 | DescriptiveLanguage Modelling | CodeCode Available | 1 |
| A Sparse and Locally Coherent Morphable Face Model for Dense Semantic Correspondence Across Heterogeneous 3D Faces | Jun 6, 2020 | DescriptiveFace Model | CodeCode Available | 1 |
| Predicting emotion from music videos: exploring the relative contribution of visual and auditory information to affective responses | Feb 19, 2022 | DescriptiveEmotion Recognition | CodeCode Available | 1 |
| Causal Modeling of Twitter Activity During COVID-19 | May 16, 2020 | Causal InferenceDescriptive | CodeCode Available | 1 |
| A Variational Algorithm for Quantum Neural Networks | Jun 15, 2020 | DescriptiveGeneral Classification | CodeCode Available | 1 |