| Should we Stop Training More Monolingual Models, and Simply Use Machine Translation Instead? | Apr 21, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| On Sampling-Based Training Criteria for Neural Language Modeling | Apr 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Pre-training for Spoken Language Understanding with Joint Textual and Phonetic Representation Learning | Apr 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Improving Biomedical Pretrained Language Models with Knowledge | Apr 21, 2021 | Entity LinkingLanguage Modeling | CodeCode Available | 1 |
| Frustratingly Easy Edit-based Linguistic Steganography with a Masked Language Model | Apr 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| B-PROP: Bootstrapped Pre-training with Representative Words Prediction for Ad-hoc Retrieval | Apr 20, 2021 | Information RetrievalLanguage Modeling | CodeCode Available | 0 |
| Differentiable Model Compression via Pseudo Quantization Noise | Apr 20, 2021 | Audio Source Separationimage-classification | CodeCode Available | 1 |
| BERTić -- The Transformer Language Model for Bosnian, Croatian, Montenegrin and Serbian | Apr 19, 2021 | Commonsense Causal ReasoningLanguage Modeling | —Unverified | 0 |
| ELECTRAMed: a new pre-trained language representation model for biomedical NLP | Apr 19, 2021 | Drug–drug Interaction ExtractionLanguage Modeling | CodeCode Available | 1 |
| Operationalizing a National Digital Library: The Case for a Norwegian Transformer Model | Apr 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| When FastText Pays Attention: Efficient Estimation of Word Representations using Constrained Positional Weighting | Apr 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Understanding Chinese Video and Language via Contrastive Multimodal Pre-Training | Apr 19, 2021 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Go Forth and Prosper: Language Modeling with Ancient Textual History | Apr 18, 2021 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations | Apr 18, 2021 | DiversityLanguage Modeling | CodeCode Available | 1 |
| On the Influence of Masking Policies in Intermediate Pre-training | Apr 18, 2021 | Abstractive Text SummarizationLanguage Modeling | —Unverified | 0 |
| Probing Across Time: What Does RoBERTa Know and When? | Apr 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Text2App: A Framework for Creating Android Apps from Text Descriptions | Apr 16, 2021 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| Back to Square One: Artifact Detection, Training and Commonsense Disentanglement in the Winograd Schema | Apr 16, 2021 | Artifact DetectionBias Detection | —Unverified | 0 |
| Enriching a Model's Notion of Belief using a Persistent Memory | Apr 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Masked Segmental Language Model for Unsupervised Natural Language Segmentation | Apr 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Effect of Visual Extensions on Natural Language Understanding in Vision-and-Language Models | Apr 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Detecting Polarized Topics Using Partisanship-aware Contextualized Topic Embeddings | Apr 15, 2021 | ArticlesLanguage Modeling | CodeCode Available | 0 |
| KnowPrompt: Knowledge-aware Prompt-tuning with Synergistic Optimization for Relation Extraction | Apr 15, 2021 | Dialog Relation ExtractionLanguage Modeling | CodeCode Available | 1 |
| How to Train BERT with an Academic Budget | Apr 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bilingual alignment transfers to multilingual alignment for unsupervised parallel text mining | Apr 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SINA-BERT: A pre-trained Language Model for Analysis of Medical Texts in Persian | Apr 15, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Quantifying Gender Bias Towards Politicians in Cross-Lingual Language Models | Apr 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Time-Stamped Language Model: Teaching Language Models to Understand the Flow of Events | Apr 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| UDALM: Unsupervised Domain Adaptation through Language Modeling | Apr 14, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Mean-Squared Accuracy of Good-Turing Estimator | Apr 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little | Apr 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TSDAE: Using Transformer-based Sequential Denoising Auto-Encoder for Unsupervised Sentence Embedding Learning | Apr 14, 2021 | DenoisingDomain Adaptation | CodeCode Available | 1 |
| Event Detection as Question Answering with Entity Information | Apr 14, 2021 | Event DetectionLanguage Modeling | CodeCode Available | 0 |
| IGA : An Intent-Guided Authoring Assistant | Apr 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Large-Scale Self- and Semi-Supervised Learning for Speech Translation | Apr 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce | Apr 14, 2021 | DecoderKnowledge Base Completion | CodeCode Available | 1 |
| Learning How to Ask: Querying LMs with Mixtures of Soft Prompts | Apr 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EAT: Enhanced ASR-TTS for Self-supervised Speech Recognition | Apr 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What's in your Head? Emergent Behaviour in Multi-Task Transformer Models | Apr 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transformer-based Methods for Recognizing Ultra Fine-grained Entities (RUFES) | Apr 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Restoring and Mining the Records of the Joseon Dynasty via Neural Language Modeling and Machine Translation | Apr 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Paragraph-level Simplification of Medical Texts | Apr 12, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| On the Inductive Bias of Masked Language Modeling: From Statistical to Syntactic Dependencies | Apr 12, 2021 | Inductive BiasLanguage Modeling | CodeCode Available | 1 |
| Estimating Subjective Crowd-Evaluations as an Additional Objective to Improve Natural Language Generation | Apr 12, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Building a Swedish Open-Domain Conversational Language Model | Apr 12, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Investigating Methods to Improve Language Model Integration for Attention-based Encoder-Decoder ASR Models | Apr 12, 2021 | DecoderLanguage Modeling | —Unverified | 0 |
| Lookup-Table Recurrent Language Models for Long Tail Speech Recognition | Apr 9, 2021 | GPULanguage Modeling | —Unverified | 0 |
| Language model fusion for streaming end to end speech recognition | Apr 9, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Large-Scale Language Model Training on GPU Clusters Using Megatron-LM | Apr 9, 2021 | GPULanguage Modeling | CodeCode Available | 0 |
| Extended Parallel Corpus for Amharic-English Machine Translation | Apr 8, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |