| Character-level White-Box Adversarial Attacks against Transformers via Attachable Subwords Substitution | Oct 31, 2022 | Adversarial AttackSentence | CodeCode Available | 1 | 5 |
| IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine Translation | Nov 1, 2020 | Dynamic Time WarpingMachine Translation | CodeCode Available | 1 | 5 |
| CheckEmbed: Effective Verification of LLM Solutions to Open-Ended Tasks | Jun 4, 2024 | Document SummarizationSentence | CodeCode Available | 1 | 5 |
| ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information | Jun 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation | Dec 26, 2019 | BenchmarkingDomain Adaptation | CodeCode Available | 1 | 5 |
| An MRC Framework for Semantic Role Labeling | Sep 14, 2021 | Computational EfficiencyMachine Reading Comprehension | CodeCode Available | 1 | 5 |
| Cost-Sensitive BERT for Generalisable Sentence Classification with Imbalanced Data | Mar 16, 2020 | ClassificationData Augmentation | CodeCode Available | 1 | 5 |
| Improving Contrastive Learning of Sentence Embeddings with Case-Augmented Positives and Retrieved Negatives | Jun 6, 2022 | AttributeContrastive Learning | CodeCode Available | 1 | 5 |
| AnnIE: An Annotation Platform for Constructing Complete Open Information Extraction Benchmark | Sep 15, 2021 | Open Information ExtractionSentence | CodeCode Available | 1 | 5 |
| CORWA: A Citation-Oriented Related Work Annotation Dataset | May 7, 2022 | Sentence | CodeCode Available | 1 | 5 |