| DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment | Oct 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Automatic Learning of Subword Dependent Model Scales | Oct 18, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NormFormer: Improved Transformer Pretraining with Extra Normalization | Oct 18, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Training Deep Neural Networks with Adaptive Momentum Inspired by the Quadratic Optimization | Oct 18, 2021 | BIG-bench Machine Learningimage-classification | CodeCode Available | 1 |
| Reminding the Incremental Language Model via Data-Free Self-Distillation | Oct 17, 2021 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| GNN-LM: Language Modeling based on Global Contexts via GNN | Oct 17, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Novel Metric for Evaluating Semantics Preservation | Oct 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Echo-Attention: Attend Once and Get N Attentions for Free | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DEMix Layers: Disentangling Domains for Modular Language Modeling | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking | Oct 16, 2021 | Dialogue State TrackingLanguage Modeling | —Unverified | 0 |
| xGQA: Cross-Lingual Visual Question Answering | Oct 16, 2021 | Cross-Lingual TransferLanguage Modeling | —Unverified | 0 |
| On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation | Oct 16, 2021 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| Prix-LM: Pretraining for Multilingual Knowledge Base Construction | Oct 16, 2021 | Bilingual Lexicon InductionCausal Language Modeling | CodeCode Available | 0 |
| Sharpness-Aware Minimization Improves Language Model Generalization | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multilingual unsupervised sequence segmentation transfers to extremely low-resource languages | Oct 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Improving Transformers with Probabilistic Attention Keys | Oct 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models | Oct 16, 2021 | counterfactualData Augmentation | CodeCode Available | 1 |
| HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression | Oct 16, 2021 | Few-Shot LearningKnowledge Distillation | CodeCode Available | 0 |
| ASR4REAL: An extended benchmark for speech models | Oct 16, 2021 | DiversityLanguage Modeling | —Unverified | 0 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Oct 16, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Invariant Language Modeling | Oct 16, 2021 | Domain GeneralizationLanguage Modeling | CodeCode Available | 1 |
| Hydra: A System for Large Multi-Model Deep Learning | Oct 16, 2021 | Deep LearningGPU | CodeCode Available | 1 |
| Leveraging Knowledge in Multilingual Commonsense Reasoning | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Multilingual Bag-of-Entities Model for Zero-Shot Cross-Lingual Text Classification | Oct 15, 2021 | ClassificationEntity Typing | —Unverified | 0 |
| DS-TOD: Efficient Domain Specialization for Task Oriented Dialog | Oct 15, 2021 | dialog state trackingLanguage Modeling | CodeCode Available | 0 |
| Generated Knowledge Prompting for Commonsense Reasoning | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Coherence boosting: When your pretrained language model is not paying enough attention | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Control Prefixes for Parameter-Efficient Text Generation | Oct 15, 2021 | Abstractive Text SummarizationAttribute | CodeCode Available | 1 |
| Kronecker Decomposition for GPT Compression | Oct 15, 2021 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| mLUKE: The Power of Entity Representations in Multilingual Pretrained Language Models | Oct 15, 2021 | Cross-Lingual Question AnsweringCross-Lingual Transfer | CodeCode Available | 1 |
| The World of an Octopus: How Reporting Bias Influences a Language Model's Perception of Color | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Meta-learning via Language Model In-context Tuning | Oct 15, 2021 | In-Context LearningInductive Bias | CodeCode Available | 1 |
| Tracing Origins: Coreference-aware Machine Reading Comprehension | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Sparks: Inspiration for Science Writing using Language Models | Oct 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MIMICause: Representation and automatic extraction of causal relation types from clinical notes | Oct 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Spoken ObjectNet: A Bias-Controlled Spoken Caption Dataset | Oct 14, 2021 | Image RetrievalLanguage Modeling | CodeCode Available | 0 |
| Symbolic Knowledge Distillation: from General Language Models to Commonsense Models | Oct 14, 2021 | Knowledge DistillationKnowledge Graphs | CodeCode Available | 1 |
| P-Tuning v2: Prompt Tuning Can Be Comparable to Fine-tuning Universally Across Scales and Tasks | Oct 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning | Oct 14, 2021 | DiversityLanguage Modeling | CodeCode Available | 1 |
| Composable Sparse Fine-Tuning for Cross-Lingual Transfer | Oct 14, 2021 | Cross-Lingual TransferLanguage Modeling | CodeCode Available | 1 |
| bert2BERT: Towards Reusable Pretrained Language Models | Oct 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dict-BERT: Enhancing Language Model Pre-training with Dictionary | Oct 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On Language Model Integration for RNN Transducer based Speech Recognition | Oct 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Maximizing Efficiency of Language Model Pre-training for Learning Representation | Oct 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Deep Learning for Bias Detection: From Inception to Deployment | Oct 12, 2021 | Bias DetectionDeep Learning | —Unverified | 0 |
| Multi-Modal Pre-Training for Automated Speech Recognition | Oct 12, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Småprat: DialoGPT for Natural Language Generation of Swedish Dialogue by Transfer Learning | Oct 12, 2021 | ChatbotLanguage Modeling | —Unverified | 0 |
| Time Masking for Temporal Language Models | Oct 12, 2021 | Change DetectionLanguage Modeling | CodeCode Available | 1 |