| CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model | Aug 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Rakuten’s Participation in WAT 2021: Examining the Effectiveness of Pre-trained Models for Multilingual and Multimodal Machine Translation | Aug 1, 2021 | DenoisingLanguage Modeling | —Unverified | 0 |
| Time-Efficient Code Completion Model for the R Programming Language | Aug 1, 2021 | Code CompletionLanguage Modeling | CodeCode Available | 0 |
| Meta-Learning for Few-Shot Named Entity Recognition | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MVP-BERT: Multi-Vocab Pre-training for Chinese BERT | Aug 1, 2021 | Chinese Word SegmentationLanguage Modeling | —Unverified | 0 |
| Stereotyping Norwegian Salmon: An Inventory of Pitfalls in Fairness Benchmark Datasets | Aug 1, 2021 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Measuring and Improving BERT's Mathematical Abilities by Predicting the Order of Reasoning. | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction | Aug 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ProtAugment: Intent Detection Meta-Learning through Unsupervised Diverse Paraphrasing | Aug 1, 2021 | DiversityIntent Detection | CodeCode Available | 1 |
| QASR: QCRI Aljazeera Speech Resource A Large Scale Annotated Arabic Speech Corpus | Aug 1, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MulDA: A Multilingual Data Augmentation Framework for Low-Resource Cross-Lingual NER | Aug 1, 2021 | Cross-Lingual NERCross-Lingual Transfer | —Unverified | 0 |
| PRAL: A Tailored Pre-Training Model for Task-Oriented Dialog Generation | Aug 1, 2021 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Unleash GPT-2 Power for Event Detection | Aug 1, 2021 | Event DetectionLanguage Modeling | —Unverified | 0 |
| Selecting Informative Contexts Improves Language Model Fine-tuning | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| nmT5 - Is parallel data still relevant for pre-training massively multilingual language models? | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PHMOSpell: Phonological and Morphological Knowledge Guided Chinese Spelling Check | Aug 1, 2021 | Chinese Spell CheckingLanguage Modeling | —Unverified | 0 |
| Evaluating morphological typology in zero-shot cross-lingual transfer | Aug 1, 2021 | Cross-Lingual TransferLanguage Modeling | —Unverified | 0 |
| Attending Self-Attention: A Case Study of Visually Grounded Supervision in Vision-and-Language Transformers | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AND does not mean OR: Using Formal Languages to Study Language Models' Representations | Aug 1, 2021 | FormLanguage Modeling | —Unverified | 0 |
| DeepBlueAI at SemEval-2021 Task 7: Detecting and Rating Humor and Offense with Stacking Diverse Language Model-Based Methods | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EnsLM: Ensemble Language Model for Data Diversity by Semantic Clustering | Aug 1, 2021 | ClusteringDiversity | CodeCode Available | 0 |
| A Targeted Assessment of Incremental Processing in Neural Language Models and Humans | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AStarTwice at SemEval-2021 Task 5: Toxic Span Detection Using RoBERTa-CRF, Domain Specific Pre-Training and Self-Training | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Entity at SemEval-2021 Task 5: Weakly Supervised Token Labelling for Toxic Spans Detection | Aug 1, 2021 | ClassificationLanguage Modeling | CodeCode Available | 0 |
| Cambridge at SemEval-2021 Task 2: Neural WiC-Model with Data Augmentation and Exploration of Representation | Aug 1, 2021 | Data AugmentationLanguage Modeling | —Unverified | 0 |