| IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language Understanding | Sep 11, 2020 | BenchmarkingDiversity | CodeCode Available | 1 | 5 |
| CiteWorth: Cite-Worthiness Detection for Improved Scientific Document Understanding | May 23, 2021 | document understandingDomain Adaptation | CodeCode Available | 1 | 5 |
| InfoCSE: Information-aggregated Contrastive Learning of Sentence Embeddings | Oct 8, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval | Dec 17, 2024 | Contrastive LearningInformation Retrieval | CodeCode Available | 1 | 5 |
| Injecting Word Information with Multi-Level Word Adapter for Chinese Spoken Language Understanding | Oct 8, 2020 | Intent DetectionSentence | CodeCode Available | 1 | 5 |
| iNLTK: Natural Language Toolkit for Indic Languages | Sep 26, 2020 | Data AugmentationParaphrase Generation | CodeCode Available | 1 | 5 |
| Supplementary Features of BiLSTM for Enhanced Sequence Labeling | May 31, 2023 | Aspect-Based Sentiment AnalysisChinese Named Entity Recognition | CodeCode Available | 1 | 5 |
| CLGC: A Corpus for Chinese Literary Grace Evaluation | Jun 1, 2022 | Sentence | CodeCode Available | 1 | 5 |
| CL-Attack: Textual Backdoor Attacks via Cross-Lingual Triggers | Dec 26, 2024 | Backdoor AttackSentence | CodeCode Available | 1 | 5 |
| Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language | Mar 1, 2021 | SentenceWorld Knowledge | CodeCode Available | 1 | 5 |