| AdvAug: Robust Adversarial Augmentation for Neural Machine Translation | Jun 21, 2020 | Data AugmentationMachine Translation | —Unverified | 0 |
| An Annotated Dataset of Discourse Modes in Hindi Stories | May 1, 2020 | DescriptiveSentence | —Unverified | 0 |
| Developing a Dataset of Overridden Information in Wikipedia | Jun 1, 2022 | Binary ClassificationSentence | —Unverified | 0 |
| Developing an Informal-Formal Persian Corpus | Aug 10, 2023 | Sentence | —Unverified | 0 |
| A Unified Sentence Space for Categorical Distributional-Compositional Semantics: Theory and Experiments | Dec 1, 2012 | Sentence | —Unverified | 0 |
| An Annotated Dataset and Automatic Approaches for Discourse Mode Identification in Low-resource Bengali Language | Jul 1, 2022 | DescriptiveLanguage Modeling | —Unverified | 0 |
| A Unified Neural Coherence Model | Sep 1, 2019 | Machine Translationmodel | —Unverified | 0 |
| An Annotated Corpus of Webtables for Information Extraction Tasks | Aug 18, 2020 | Question AnsweringRelation Extraction | —Unverified | 0 |
| A Comprehensive Comparative Study of Word and Sentence Similarity Measures | Feb 17, 2016 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Developing a Clinical Language Model for Swedish: Continued Pretraining of Generic BERT with In-Domain Data | Sep 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |