| Phrase-aware Unsupervised Constituency Parsing | Nov 16, 2021 | Constituency ParsingLanguage Modeling | —Unverified | 0 |
| Temporal Language Modeling for Short Text Document Classification with Transformers | Nov 16, 2021 | ClassificationDocument Classification | —Unverified | 0 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DS-TOD: Efficient Domain Specialization for Task-Oriented Dialog | Nov 16, 2021 | dialog state trackingLanguage Modeling | —Unverified | 0 |
| Generative Prompt Tuning for Relation Classification | Nov 16, 2021 | ClassificationLanguage Modeling | —Unverified | 0 |
| Towards Unified Prompt Tuning for Few-shot Learning | Nov 16, 2021 | Few-Shot LearningLanguage Modeling | —Unverified | 0 |
| Contextual Representation Learning beyond Masked Language Modeling | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Nov 16, 2021 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction | Nov 16, 2021 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| Predicting Attention Sparsity in Transformers | Nov 16, 2021 | DecoderLanguage Modeling | —Unverified | 0 |