| Relation-aware Video Reading Comprehension for Temporal Language Grounding | Oct 12, 2021 | Reading ComprehensionRelation | CodeCode Available | 1 |
| Time Masking for Temporal Language Models | Oct 12, 2021 | Change DetectionLanguage Modeling | CodeCode Available | 1 |
| Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos | Oct 12, 2021 | Semantic correspondenceSemantic Similarity | CodeCode Available | 1 |
| CLIP4Caption ++: Multi-CLIP for Video Caption | Oct 11, 2021 | DecoderSentence | —Unverified | 0 |
| Document-Level Text Simplification: Dataset, Criteria and Baseline | Oct 11, 2021 | SentenceText Simplification | CodeCode Available | 1 |
| Semi-Autoregressive Image Captioning | Oct 11, 2021 | DecoderImage Captioning | CodeCode Available | 0 |
| Batch-Softmax Contrastive Loss for Pairwise Sentence Scoring Tasks | Oct 10, 2021 | Representation LearningSentence | —Unverified | 0 |
| Can Audio Captions Be Evaluated with Image Caption Metrics? | Oct 10, 2021 | AudioCapsAudio captioning | CodeCode Available | 1 |
| What Makes Sentences Semantically Related: A Textual Relatedness Dataset and Empirical Study | Oct 10, 2021 | Question AnsweringSemantic Similarity | CodeCode Available | 1 |
| PASTE: A Tagging-Free Decoding Framework Using Pointer Networks for Aspect Sentiment Triplet Extraction | Oct 10, 2021 | Aspect Sentiment Triplet ExtractionDecoder | CodeCode Available | 1 |