| Transfer Learning with Shallow Decoders: BSC at WMT2021’s Multilingual Low-Resource Translation for Indo-European Languages Shared Task | Nov 1, 2021 | ArticlesDecoder | CodeCode Available | 0 |
| NICT Kyoto Submission for the WMT’21 Quality Estimation Task: Multimetric Multilingual Pretraining for Critical Error Detection | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unsupervised Multi-View Post-OCR Error Correction With Language Models | Nov 1, 2021 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Unsupervised Adverbial Identification in Modern Chinese Literature | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Scaffolded input promotes atomic organization in the recurrent neural network language model | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Who’s on First?: Probing the Learning and Representation Capabilities of Language Models on Deterministic Closed Domains | Nov 1, 2021 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Unsupervised Discovery of Unaccusative and Unergative Verbs | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Role of Corpus Ordering in Language Modeling | Nov 1, 2021 | Language AcquisitionLanguage Modeling | —Unverified | 0 |
| ProSPer: Probing Human and Neural Network Language Model Understanding of Spatial Perspective | Nov 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What Can a Generative Language Model Answer About a Passage? | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| What BERT Based Language Model Learns in Spoken Transcripts: An Empirical Study | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Stacked AMR Parsing with Silver Data | Nov 1, 2021 | Abstract Meaning RepresentationAMR Parsing | CodeCode Available | 0 |
| UnClE: Explicitly Leveraging Semantic Similarity to Reduce the Parameters of Word Embeddings | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| R-BERT-CNN: Drug-target interactions extraction from biomedical literature | Oct 31, 2021 | ArticlesDrug Discovery | —Unverified | 0 |
| PnPOOD : Out-Of-Distribution Detection for Text Classification via Plug andPlay Data Augmentation | Oct 31, 2021 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Automatic Knowledge Augmentation for Generative Commonsense Reasoning | Oct 30, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EmpBot: A T5-based Empathetic Chatbot focusing on Sentiments | Oct 30, 2021 | ChatbotLanguage Modeling | —Unverified | 0 |
| Combining Unsupervised and Text Augmented Semi-Supervised Learning for Low Resourced Autoregressive Speech Recognition | Oct 29, 2021 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Pre-training Co-evolutionary Protein Representation via A Pairwise Masked Language Model | Oct 29, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Semi-Siamese Bi-encoder Neural Ranking Model Using Lightweight Fine-Tuning | Oct 28, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| No News is Good News: A Critique of the One Billion Word Benchmark | Oct 25, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Paradigm Shift in Language Modeling: Revisiting CNN for Modeling Sanskrit Originated Bengali and Hindi Language | Oct 25, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distributionally Robust Recurrent Decoders with Random Network Distillation | Oct 25, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sentence Punctuation for Collaborative Commentary Generation in Esports Live-Streaming | Oct 24, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Text Counterfactuals via Latent Optimization and Shapley-Guided Search | Oct 22, 2021 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| SLAM: A Unified Encoder for Speech and Language Modeling via Speech-Text Joint Pre-Training | Oct 20, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledge distillation from language model to acoustic model: a hierarchical multi-task learning approach | Oct 20, 2021 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| JavaBERT: Training a transformer-based model for the Java programming language | Oct 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Knowledge Graph informed Fake News Classification via Heterogeneous Representation Ensembles | Oct 20, 2021 | ClassificationFake News Detection | CodeCode Available | 0 |
| Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair Prediction | Oct 20, 2021 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| DEEPAGÉ: Answering Questions in Portuguese about the Brazilian Environment | Oct 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Automatic Learning of Subword Dependent Model Scales | Oct 18, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| NormFormer: Improved Transformer Pretraining with Extra Normalization | Oct 18, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Reminding the Incremental Language Model via Data-Free Self-Distillation | Oct 17, 2021 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| On the Complementarity of Data Selection and Fine Tuning for Domain Adaptation | Oct 16, 2021 | Domain AdaptationDomain Generalization | —Unverified | 0 |
| Sharpness-Aware Minimization Improves Language Model Generalization | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| N-Shot Learning for Augmenting Task-Oriented Dialogue State Tracking | Oct 16, 2021 | Dialogue State TrackingLanguage Modeling | —Unverified | 0 |
| Prix-LM: Pretraining for Multilingual Knowledge Base Construction | Oct 16, 2021 | Bilingual Lexicon InductionCausal Language Modeling | CodeCode Available | 0 |
| xGQA: Cross-Lingual Visual Question Answering | Oct 16, 2021 | Cross-Lingual TransferLanguage Modeling | —Unverified | 0 |
| Multilingual unsupervised sequence segmentation transfers to extremely low-resource languages | Oct 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Models In a Spelling Bee: Language Models Implicitly Learn the Character Composition of Tokens | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ASR4REAL: An extended benchmark for speech models | Oct 16, 2021 | DiversityLanguage Modeling | —Unverified | 0 |
| Leveraging Knowledge in Multilingual Commonsense Reasoning | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DEMix Layers: Disentangling Domains for Modular Language Modeling | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Novel Metric for Evaluating Semantics Preservation | Oct 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BitFit: Simple Parameter-efficient Fine-tuning for Transformer-based Masked Language-models | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Echo-Attention: Attend Once and Get N Attentions for Free | Oct 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model Compression | Oct 16, 2021 | Few-Shot LearningKnowledge Distillation | CodeCode Available | 0 |
| Kronecker Decomposition for GPT Compression | Oct 15, 2021 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| DS-TOD: Efficient Domain Specialization for Task Oriented Dialog | Oct 15, 2021 | dialog state trackingLanguage Modeling | CodeCode Available | 0 |