| Match-Prompt: Improving Multi-task Generalization Ability for Neural Text Matching via Prompt Learning | Apr 6, 2022 | Information RetrievalParaphrase Identification | CodeCode Available | 0 |
| DT2I: Dense Text-to-Image Generation from Region Descriptions | Apr 5, 2022 | Conditional Image GenerationImage Generation | —Unverified | 0 |
| Leveraging Search History for Improving Person-Job Fit | Mar 27, 2022 | Text Matching | —Unverified | 0 |
| Two-stream Hierarchical Similarity Reasoning for Image-text Matching | Mar 10, 2022 | Image-text matchingImage to text | —Unverified | 0 |
| Towards Building an Open-Domain Dialogue System Incorporated with Internet Memes | Mar 8, 2022 | Emotion ClassificationResponse Generation | —Unverified | 0 |
| Context Enhanced Short Text Matching using Clickthrough Data | Mar 3, 2022 | Text Matching | —Unverified | 0 |
| Dual Embodied-Symbolic Concept Representations for Deep Learning | Mar 1, 2022 | class-incremental learningClass Incremental Learning | —Unverified | 0 |
| Semantic Matching from Different Perspectives | Feb 14, 2022 | SentenceText Matching | CodeCode Available | 0 |
| MVPTR: Multi-Level Semantic Alignment for Vision-Language Pre-Training via Multi-Stage Learning | Jan 29, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 |
| Unpaired Referring Expression Grounding via Bidirectional Cross-Modal Matching | Jan 18, 2022 | Image-text matchingReferring Expression | —Unverified | 0 |
| Probing the Role of Positional Information in Vision-Language Models | Jan 16, 2022 | Contrastive LearningImage-text matching | —Unverified | 0 |
| Negative-Aware Attention Framework for Image-Text Matching | Jan 1, 2022 | Image-text matchingText Matching | CodeCode Available | 1 |
| Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation | Dec 10, 2021 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| VIRT: Improving Representation-based Models for Text Matching through Virtual Interaction | Dec 8, 2021 | Text Matching | —Unverified | 0 |
| Embedding Arithmetic of Multimodal Queries for Image Retrieval | Dec 6, 2021 | Image RetrievalImage-text matching | —Unverified | 0 |
| DenseCLIP: Language-Guided Dense Prediction with Context-Aware Prompting | Dec 2, 2021 | Image-text matchingInstance Segmentation | CodeCode Available | 1 |
| Object-aware Video-language Pre-training for Retrieval | Dec 1, 2021 | ObjectRetrieval | CodeCode Available | 1 |
| Learning with Noisy Correspondence for Cross-modal Matching | Dec 1, 2021 | Cross-Modal RetrievalCross-modal retrieval with noisy correspondence | CodeCode Available | 1 |
| UFO: A UniFied TransfOrmer for Vision-Language Representation Learning | Nov 19, 2021 | Image CaptioningImage-text matching | —Unverified | 0 |
| Semantic Matching from Different Perspectives | Nov 16, 2021 | SentenceText Matching | —Unverified | 0 |
| More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching | Nov 16, 2021 | Contrastive LearningImage-text matching | —Unverified | 0 |
| MURAL: Multimodal, Multitask Representations Across Languages | Nov 1, 2021 | Cross-Modal RetrievalImage-text matching | —Unverified | 0 |
| Video and Text Matching with Conditioned Embeddings | Oct 21, 2021 | Machine TranslationSentence | CodeCode Available | 1 |
| SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training | Oct 20, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CLIP4Caption: CLIP for Video Caption | Oct 13, 2021 | DecoderSentence | —Unverified | 0 |
| TAG: Toward Accurate Social Media Content Tagging with a Concept Graph | Oct 13, 2021 | Dependency ParsingGraph Matching | —Unverified | 0 |
| Is An Image Worth Five Sentences? A New Look into Semantics for Image-Text Matching | Oct 6, 2021 | Image CaptioningImage-text matching | —Unverified | 0 |
| Protagonists' Tagger in Literary Domain -- New Datasets and a Method for Person Entity Linkage | Oct 4, 2021 | Entity Disambiguationnamed-entity-recognition | —Unverified | 0 |
| K-AID: Enhancing Pre-trained Language Models with Domain Knowledge for Question Answering | Sep 22, 2021 | CPUKnowledge Distillation | —Unverified | 0 |
| ActionCLIP: A New Paradigm for Video Action Recognition | Sep 17, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 |
| MURAL: Multimodal, Multitask Retrieval Across Languages | Sep 10, 2021 | Cross-Modal RetrievalImage-text matching | —Unverified | 0 |
| Supervised Contrastive Learning for Interpretable Long-Form Document Matching | Aug 20, 2021 | ArticlesContrastive Learning | CodeCode Available | 0 |
| Toward the Understanding of Deep Text Matching Models for Information Retrieval | Aug 16, 2021 | Information RetrievalRetrieval | —Unverified | 0 |
| Hashing based Efficient Inference for Image-Text Matching | Aug 1, 2021 | Image-text matchingText Matching | —Unverified | 0 |
| HANet: Hierarchical Alignment Networks for Video-Text Retrieval | Jul 26, 2021 | RetrievalText Matching | CodeCode Available | 1 |
| Parts2Words: Learning Joint Embedding of Point Clouds and Texts by Bidirectional Matching between Parts and Words | Jul 5, 2021 | RetrievalText Matching | CodeCode Available | 1 |
| A Modern Perspective on Query Likelihood with Deep Generative Retrieval Models | Jun 25, 2021 | Passage Re-RankingPassage Retrieval | CodeCode Available | 0 |
| A Self-Boosting Framework for Automated Radiographic Report Generation | Jun 19, 2021 | Image CaptioningImage-text matching | —Unverified | 0 |
| Step-Wise Hierarchical Alignment Network for Image-Text Matching | Jun 11, 2021 | Image-text matchingText Matching | —Unverified | 0 |
| A Deep Local and Global Scene-Graph Matching for Image-Text Retrieval | Jun 4, 2021 | Graph MatchingImage Retrieval | CodeCode Available | 1 |
| TITA: A Two-stage Interaction and Topic-Aware Text Matching Model | Jun 1, 2021 | Text MatchingVocal Bursts Valence Prediction | —Unverified | 0 |
| An Emotional Comfort Framework for Improving User Satisfaction in E-Commerce Customer Service Chatbots | Jun 1, 2021 | Answer SelectionEmotion Classification | —Unverified | 0 |
| Towards Efficient Cross-Modal Visual Textual Retrieval using Transformer-Encoder Deep Features | Jun 1, 2021 | Cross-Modal RetrievalImage Retrieval | —Unverified | 0 |
| More Than Just Attention: Improving Cross-Modal Attentions with Contrastive Constraints for Image-Text Matching | May 20, 2021 | Contrastive LearningCross-Modal Retrieval | —Unverified | 0 |
| Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval | May 16, 2021 | Graph GenerationImage Captioning | —Unverified | 0 |
| VL-NMS: Breaking Proposal Bottlenecks in Two-Stage Visual-Language Matching | May 12, 2021 | Image-text matchingReferring Expression | —Unverified | 0 |
| Learning Fine-grained Fact-Article Correspondence in Legal Cases | Apr 21, 2021 | ArticlesText Matching | CodeCode Available | 0 |
| Discrete-continuous Action Space Policy Gradient-based Attention for Image-Text Matching | Apr 21, 2021 | Image-text matchingText Matching | —Unverified | 0 |
| Context-Aware Interaction Network for Question Matching | Apr 17, 2021 | SentenceText Matching | —Unverified | 0 |
| UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training | Apr 1, 2021 | Image-text matchingImage-text Retrieval | —Unverified | 0 |