| Automated Content Scoring of Spoken Responses in an Assessment for Teachers of English | Jun 1, 2013 | Text Matching | —Unverified | 0 | 0 |
| Automatic Prompt Generation and Grounding Object Detection for Zero-Shot Image Anomaly Detection | Nov 28, 2024 | Anomaly DetectionImage-text matching | —Unverified | 0 | 0 |
| Best of Both Worlds: A Pliable and Generalizable Neuro-Symbolic Approach for Relation Classification | Mar 5, 2024 | Few-Shot Relation ClassificationRelation | —Unverified | 0 | 0 |
| Breaking Through the Noisy Correspondence: A Robust Model for Image-Text Matching | Apr 29, 2024 | Cross-modal retrieval with noisy correspondenceImage-text matching | —Unverified | 0 | 0 |
| Bridging the Modality Gap: Dimension Information Alignment and Sparse Spatial Constraint for Image-Text Matching | Oct 22, 2024 | Contrastive LearningImage-text matching | —Unverified | 0 | 0 |
| BURT: BERT-inspired Universal Representation from Learning Meaningful Segment | Dec 28, 2020 | Information RetrievalQuestion Answering | —Unverified | 0 | 0 |
| CGI: Identifying Conditional Generative Models with Example Images | Jan 23, 2025 | Text Matching | —Unverified | 0 | 0 |
| Characterizing drug mentions in COVID-19 Twitter Chatter | Jul 20, 2020 | Text Matching | —Unverified | 0 | 0 |
| CLIP4Caption: CLIP for Video Caption | Oct 13, 2021 | DecoderSentence | —Unverified | 0 | 0 |
| SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training | Oct 20, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| CLIP-Powered TASS: Target-Aware Single-Stream Network for Audio-Visual Question Answering | May 13, 2024 | Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA) | —Unverified | 0 | 0 |
| SRTube: Video-Language Pre-Training with Action-Centric Video Tube Features and Semantic Role Labeling | Jan 1, 2024 | Semantic Role LabelingText Matching | —Unverified | 0 | 0 |
| ClusterFormer: Neural Clustering Attention for Efficient and Effective Transformer | May 1, 2022 | ClusteringMachine Translation | —Unverified | 0 | 0 |
| CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs | Jan 5, 2024 | Image ComprehensionImage to text | —Unverified | 0 | 0 |
| CodeReviewQA: The Code Review Comprehension Assessment for Large Language Models | Mar 20, 2025 | Code GenerationMultiple-choice | —Unverified | 0 | 0 |
| AdvExpander: Generating Natural Language Adversarial Examples by Expanding Text | Dec 18, 2020 | Text Matching | —Unverified | 0 | 0 |
| COM3D: Leveraging Cross-View Correspondence and Cross-Modal Mining for 3D Retrieval | May 7, 2024 | Cross-Modal RetrievalRetrieval | —Unverified | 0 | 0 |
| Step-Wise Hierarchical Alignment Network for Image-Text Matching | Jun 11, 2021 | Image-text matchingText Matching | —Unverified | 0 | 0 |
| Storyboard guided Alignment for Fine-grained Video Action Recognition | Oct 18, 2024 | Action RecognitionLanguage Modelling | —Unverified | 0 | 0 |
| You Only Submit One Image to Find the Most Suitable Generative Model | Dec 16, 2024 | Image GenerationText Matching | —Unverified | 0 | 0 |
| Constructing Multilingual Visual-Text Datasets Revealing Visual Multilingual Ability of Vision Language Models | Mar 29, 2024 | Image-text matchingObject Recognition | —Unverified | 0 | 0 |
| Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study | May 15, 2024 | Content-Based Image RetrievalImage Retrieval | —Unverified | 0 | 0 |
| Context-Aware Interaction Network for Question Matching | Apr 17, 2021 | SentenceText Matching | —Unverified | 0 | 0 |
| Context Enhanced Short Text Matching using Clickthrough Data | Mar 3, 2022 | Text Matching | —Unverified | 0 | 0 |
| Don't Stop Learning: Towards Continual Learning for the CLIP Model | Jul 19, 2022 | Continual LearningImage-text matching | —Unverified | 0 | 0 |
| Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech | Jan 12, 2024 | Contrastive LearningKeyword Spotting | —Unverified | 0 | 0 |
| CPSeg: Finer-grained Image Semantic Segmentation via Chain-of-Thought Language Prompting | Oct 24, 2023 | Image Segmentationobject-detection | —Unverified | 0 | 0 |
| Cross-lingual Short-text Matching with Deep Learning | Nov 13, 2018 | Deep LearningFeature Engineering | —Unverified | 0 | 0 |
| SummScore: A Comprehensive Evaluation Metric for Summary Quality Based on Cross-Encoder | Jul 11, 2022 | DiversityText Matching | —Unverified | 0 | 0 |
| Advanced Multimodal Deep Learning Architecture for Image-Text Matching | Jun 13, 2024 | Deep LearningImage-text matching | —Unverified | 0 | 0 |
| A New Fine-grained Alignment Method for Image-text Matching | Nov 3, 2023 | Image-text matchingImage-text Retrieval | —Unverified | 0 | 0 |
| Cross-modal Subspace Learning for Fine-grained Sketch-based Image Retrieval | May 28, 2017 | Cross-Modal RetrievalImage Retrieval | —Unverified | 0 | 0 |
| CroVeWA: Crosslingual Vector-Based Writing Assistance | Jun 1, 2015 | Text MatchingWord Embeddings | —Unverified | 0 | 0 |
| CILF-CIAE: CLIP-driven Image-Language Fusion for Correcting Inverse Age Estimation | Dec 4, 2023 | Age EstimationImage-text matching | —Unverified | 0 | 0 |
| DARE: Diverse Visual Question Answering with Robustness Evaluation | Sep 26, 2024 | image-classificationImage Classification | —Unverified | 0 | 0 |
| Deconvolutional Latent-Variable Model for Text Sequence Matching | Sep 21, 2017 | Decodermodel | —Unverified | 0 | 0 |
| Deep Bag-of-Words Model: An Efficient and Interpretable Relevance Architecture for Chinese E-Commerce | Jul 12, 2024 | Computational EfficiencyLanguage Modelling | —Unverified | 0 | 0 |
| Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval | May 16, 2021 | Graph GenerationImage Captioning | —Unverified | 0 | 0 |
| Deep Fusion LSTMs for Text Semantic Matching | Aug 1, 2016 | Machine TranslationQuestion Answering | —Unverified | 0 | 0 |
| Deep Learning Applied to Image and Text Matching | Sep 14, 2015 | Deep LearningImage Retrieval | —Unverified | 0 | 0 |
| Deep Learning for Biomedical Information Retrieval: Learning Textual Relevance from Click Logs | Aug 1, 2017 | Biomedical Information RetrievalInformation Retrieval | —Unverified | 0 | 0 |
| SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining | Apr 1, 2024 | Contrastive LearningImage-text matching | —Unverified | 0 | 0 |
| Deep Neural Review Text Interaction for Recommendation Systems | Mar 16, 2020 | Recommendation SystemsSmall Data Image Classification | —Unverified | 0 | 0 |
| DEMO: A Statistical Perspective for Efficient Image-Text Matching | May 19, 2024 | Image-text matchingModel Optimization | —Unverified | 0 | 0 |
| Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval | Jan 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Dependency Structure Augmented Contextual Scoping Framework for Multimodal Aspect-Based Sentiment Analysis | Apr 15, 2025 | Aspect-Based Sentiment AnalysisDependency Parsing | —Unverified | 0 | 0 |
| Descriptive Image-Text Matching with Graded Contextual Similarity | May 15, 2025 | DescriptiveImage-text matching | —Unverified | 0 | 0 |
| TAG: Toward Accurate Social Media Content Tagging with a Concept Graph | Oct 13, 2021 | Dependency ParsingGraph Matching | —Unverified | 0 | 0 |
| DiPair: Fast and Accurate Distillation for Trillion-Scale Text Matching and Pair Modeling | Oct 7, 2020 | Knowledge DistillationQuestion Answering | —Unverified | 0 | 0 |
| Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching | Apr 21, 2021 | Image-text matchingText Matching | —Unverified | 0 | 0 |