| Phrase-aware Unsupervised Constituency Parsing | Nov 16, 2021 | Constituency ParsingLanguage Modeling | —Unverified | 0 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Nov 16, 2021 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| Probing BERT’s priors with serial reproduction chains | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Composable Sparse Fine-Tuning for Cross-Lingual Transfer | Nov 16, 2021 | Cross-Lingual TransferLanguage Modeling | —Unverified | 0 |
| DAWSON: Data Augmentation using Weak Supervision On Natural Language | Nov 16, 2021 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Unsupervised Dependency Graph Network | Nov 16, 2021 | Dependency ParsingLanguage Modeling | —Unverified | 0 |
| Prompt-Learning for Fine-Grained Entity Typing | Nov 16, 2021 | Entity TypingKnowledge Probing | —Unverified | 0 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contextual Representation Learning beyond Masked Language Modeling | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TACO: Pre-training of Deep Transformers with Attention Convolution using Disentangled Positional Representation | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DS-TOD: Efficient Domain Specialization for Task-Oriented Dialog | Nov 16, 2021 | dialog state trackingLanguage Modeling | —Unverified | 0 |
| Joint Unsupervised and Supervised Training for Multilingual ASR | Nov 15, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Modeling Mathematical Notation Semantics in Academic Papers | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NICT Kyoto Submission for the WMT’21 Quality Estimation Task: Multimetric Multilingual Pretraining for Critical Error Detection | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| JavaBERT: Training a transformer-based model for the Java programming language | Oct 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| NormFormer: Improved Transformer Pretraining with Extra Normalization | Oct 18, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DS-TOD: Efficient Domain Specialization for Task Oriented Dialog | Oct 15, 2021 | dialog state trackingLanguage Modeling | CodeCode Available | 0 |
| Dict-BERT: Enhancing Language Model Pre-training with Dictionary | Oct 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Maximizing Efficiency of Language Model Pre-training for Learning Representation | Oct 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multi-Modal Pre-Training for Automated Speech Recognition | Oct 12, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contextualized Semantic Distance between Highly Overlapped Texts | Oct 4, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Image BERT Pre-training with Online Tokenizer | Sep 29, 2021 | image-classificationImage Classification | —Unverified | 0 |
| Predicting Attention Sparsity in Transformers | Sep 24, 2021 | DecoderLanguage Modeling | —Unverified | 0 |
| MLIM: Vision-and-Language Model Pre-training with Masked Language and Image Modeling | Sep 24, 2021 | Image ReconstructionLanguage Modeling | —Unverified | 0 |