| Exploring Unsupervised Pretraining Objectives for Machine Translation | Jun 10, 2021 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models | Jun 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MST: Masked Self-Supervised Transformer for Visual Representation | Jun 10, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Auto-tagging of Short Conversational Sentences using Natural Language Processing Methods | Jun 9, 2021 | ChatbotLanguage Modeling | CodeCode Available | 0 |
| DGA-Net Dynamic Gaussian Attention Network for Sentence Semantic Matching | Jun 9, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Hash Layers For Large Sparse Models | Jun 8, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model | Jun 8, 2021 | Entity TypingLanguage Modeling | CodeCode Available | 1 |
| Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks | Jun 8, 2021 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Staircase Attention for Recurrent Processing of Sequences | Jun 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Interpretable and Low-Resource Entity Matching via Decoupling Feature Learning from Decision Making | Jun 8, 2021 | AttributeDecision Making | CodeCode Available | 0 |
| Exploiting Language Relatedness for Low Web-Resource Language Model Adaptation: An Indic Languages Study | Jun 7, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 0 |
| Generating Hypothetical Events for Abductive Inference | Jun 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Top-KAST: Top-K Always Sparse Training | Jun 7, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Pre-trained Language Model for Web-scale Retrieval in Baidu Search | Jun 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Measuring and Improving BERT's Mathematical Abilities by Predicting the Order of Reasoning | Jun 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Video Imprint | Jun 7, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RoSearch: Search for Robust Student Architectures When Distilling Pre-trained Language Models | Jun 7, 2021 | Adversarial RobustnessKnowledge Distillation | —Unverified | 0 |
| Semantic-Enhanced Explainable Finetuning for Open-Domain Dialogues | Jun 6, 2021 | InformativenessLanguage Modeling | —Unverified | 0 |
| On the Effectiveness of Adapter-based Tuning for Pretrained Language Model Adaptation | Jun 6, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Targeted Assessment of Incremental Processing in Neural LanguageModels and Humans | Jun 6, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Let's be explicit about that: Distant supervision for implicit discourse relation classification via connective prediction | Jun 6, 2021 | ClassificationImplicit Discourse Relation Classification | —Unverified | 0 |
| Extracting Weighted Automata for Approximate Minimization in Language Modelling | Jun 5, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BERTnesia: Investigating the capture and forgetting of knowledge in BERT | Jun 5, 2021 | Knowledge Base CompletionLanguage Modeling | CodeCode Available | 0 |
| Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--Hastings | Jun 4, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Language Model Metrics and Procrustes Analysis for Improved Vector Transformation of NLP Embeddings | Jun 4, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enabling Lightweight Fine-tuning for Pre-trained Language Model Compression based on Matrix Product Operators | Jun 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Bi-Granularity Contrastive Learning for Post-Training in Few-Shot Scene | Jun 4, 2021 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition | Jun 4, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MPC-BERT: A Pre-Trained Language Model for Multi-Party Conversation Understanding | Jun 3, 2021 | Conversational Response SelectionLanguage Modeling | CodeCode Available | 1 |
| nmT5 -- Is parallel data still relevant for pre-training massively multilingual language models? | Jun 3, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Template-Based Named Entity Recognition Using BART | Jun 3, 2021 | Few-shot NERLanguage Modeling | CodeCode Available | 1 |
| Provably Secure Generative Linguistic Steganography | Jun 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution | Jun 3, 2021 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| Luna: Linear Unified Nested Attention | Jun 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education | Jun 2, 2021 | Knowledge TracingLanguage Modeling | CodeCode Available | 1 |
| One Teacher is Enough? Pre-trained Language Model Distillation from Multiple Teachers | Jun 2, 2021 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection | Jun 2, 2021 | DecoderEmotion Recognition in Conversation | —Unverified | 0 |
| Lower Perplexity is Not Always Human-Like | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Differential Privacy for Text Analytics via Natural Text Sanitization | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Span Extraction Approach for Information Extraction on Visually-Rich Documents | Jun 2, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning to Select: A Fully Attentive Approach for Novel Object Captioning | Jun 2, 2021 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Attention-based Contextual Language Model Adaptation for Speech Recognition | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| belabBERT: a Dutch RoBERTa-based language model applied to psychiatric classification | Jun 2, 2021 | Audio ClassificationClassification | —Unverified | 0 |
| A Generalizable Approach to Learning Optimizers | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Decision Transformer: Reinforcement Learning via Sequence Modeling | Jun 2, 2021 | Atari GamesD4RL | CodeCode Available | 1 |
| Transferring Representations of Logical Connectives | Jun 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Low-Resource Machine Translation Using Cross-Lingual Language Model Pretraining | Jun 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Predicting Numerals in Natural Language Text Using a Language Model Considering the Quantitative Aspects of Numerals | Jun 1, 2021 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| ERNIE-NLI: Analyzing the Impact of Domain-Specific External Knowledge on Enhanced Representations for NLI | Jun 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |