| Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding | Oct 8, 2020 | Intent DetectionSentence | CodeCode Available | 1 | 5 |
| iNLTK: Natural Language Toolkit for Indic Languages | Sep 26, 2020 | Data AugmentationParaphrase Generation | CodeCode Available | 1 | 5 |
| Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language | Mar 1, 2021 | SentenceWorld Knowledge | CodeCode Available | 1 | 5 |
| Instruction Position Matters in Sequence Generation with Large Language Models | Aug 23, 2023 | Instruction FollowingPosition | CodeCode Available | 1 | 5 |
| CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers | Dec 26, 2024 | Backdoor AttackSentence | CodeCode Available | 1 | 5 |
| Intent Classification and Slot Filling for Privacy Policies | Jan 1, 2021 | General Classificationintent-classification | CodeCode Available | 1 | 5 |
| Supplementary Features of BiLSTM for Enhanced Sequence Labeling | May 31, 2023 | Aspect-Based Sentiment AnalysisChinese Named Entity Recognition | CodeCode Available | 1 | 5 |
| CLIP-Fields: Weakly Supervised Semantic Fields for Robotic Memory | Oct 11, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers | Apr 20, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| The MSR-Video to Text Dataset with Clean Annotations | Feb 12, 2021 | SentenceVideo Captioning | CodeCode Available | 1 | 5 |
| A Personalized Conversational Benchmark: Towards Simulating Personalized Conversations | May 20, 2025 | SentenceSentence Classification | CodeCode Available | 1 | 5 |
| Clustering-Aware Negative Sampling for Unsupervised Sentence Representation | May 17, 2023 | ClusteringContrastive Learning | CodeCode Available | 1 | 5 |
| Exploiting News Article Structure for Automatic Corpus Generation of Entailment Datasets | Oct 22, 2020 | ArticlesBenchmarking | CodeCode Available | 1 | 5 |
| C-STS: Conditional Semantic Textual Similarity | May 24, 2023 | Information RetrievalLanguage Model Evaluation | CodeCode Available | 1 | 5 |
| Abstractive Summarization Guided by Latent Hierarchical Document Structure | Nov 17, 2022 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 | 5 |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Nov 2, 2024 | Line DetectionSemantic Similarity | CodeCode Available | 1 | 5 |
| CNN+CNN: Convolutional Decoders for Image Captioning | May 23, 2018 | Image CaptioningSentence | CodeCode Available | 1 | 5 |
| An Open-Source Gloss-Based Baseline for Spoken to Signed Language Translation | May 28, 2023 | Machine TranslationSentence | CodeCode Available | 1 | 5 |
| CoCoA-MT: A Dataset and Benchmark for Contrastive Controlled MT with Application to Formality | May 9, 2022 | Machine TranslationSentence | CodeCode Available | 1 | 5 |
| Is Neural Topic Modelling Better than Clustering? An Empirical Study on Clustering with Contextual Embeddings for Topics | Apr 21, 2022 | ClusteringSentence | CodeCode Available | 1 | 5 |
| Iterative Refinement in the Continuous Space for Non-Autoregressive Neural Machine Translation | Sep 15, 2020 | de-enMachine Translation | CodeCode Available | 1 | 5 |
| CODE-ACCORD: A Corpus of Building Regulatory Data for Rule Generation towards Automatic Compliance Checking | Mar 4, 2024 | Relation ExtractionSentence | CodeCode Available | 1 | 5 |
| COINS: Dynamically Generating COntextualized Inference Rules for Narrative Story Completion | Jun 4, 2021 | SentenceStory Completion | CodeCode Available | 1 | 5 |
| Japanese SimCSE Technical Report | Oct 30, 2023 | SentenceSentence Embedding | CodeCode Available | 1 | 5 |
| A Plug-and-Play Method for Controlled Text Generation | Sep 20, 2021 | SentenceStory Generation | CodeCode Available | 1 | 5 |