| Efficient Nearest Neighbor Language Models | Sep 9, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| TruthfulQA: Measuring How Models Mimic Human Falsehoods | Sep 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PermuteFormer: Efficient Relative Position Encoding for Long Sequences | Sep 6, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Data Efficient Masked Language Modeling for Vision and Language | Sep 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Learning Hierarchical Structures with Differentiable Nondeterministic Stacks | Sep 5, 2021 | Inductive BiasLanguage Modeling | CodeCode Available | 1 |
| Frustratingly Simple Pretraining Alternatives to Masked Language Modeling | Sep 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Imposing Relation Structure in Language-Model Embeddings Using Contrastive Learning | Sep 2, 2021 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| -former: Infinite Memory Transformer | Sep 1, 2021 | Dialogue GenerationLanguage Modeling | CodeCode Available | 1 |
| CTAL: Pre-training Cross-modal Transformer for Audio-and-Language Representations | Sep 1, 2021 | Emotion ClassificationLanguage Modeling | CodeCode Available | 1 |
| Efficient conformer: Progressive downsampling and grouped attention for automatic speech recognition | Aug 31, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NER | Aug 31, 2021 | Cross-Lingual NERData Augmentation | CodeCode Available | 1 |
| Sentence Bottleneck Autoencoders from Transformer Language Models | Aug 31, 2021 | DecoderDenoising | CodeCode Available | 1 |
| Effective Sequence-to-Sequence Dialogue State Tracking | Aug 31, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 |
| Differentiable Prompt Makes Pre-trained Language Models Better Few-shot Learners | Aug 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Selective Differential Privacy for Language Modeling | Aug 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Want To Reduce Labeling Cost? GPT-3 Can Help | Aug 30, 2021 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| Dealing with Typos for BERT-based Passage Retrieval and Ranking | Aug 27, 2021 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| SimVLM: Simple Visual Language Model Pretraining with Weak Supervision | Aug 24, 2021 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| From Two to One: A New Scene Text Recognizer with Visual Language Modeling Network | Aug 22, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Pre-training for Ad-hoc Retrieval: Hyperlink is Also You Need | Aug 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions | Aug 20, 2021 | Code GenerationDiversity | CodeCode Available | 1 |
| SMedBERT: A Knowledge-Enhanced Pre-trained Language Model with Structured Semantics for Medical Text Mining | Aug 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge Perceived Multi-modal Pretraining in E-commerce | Aug 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Modeling Protein Using Large-scale Pretrain Language Model | Aug 17, 2021 | Drug DiscoveryLanguage Modeling | CodeCode Available | 1 |
| Unsupervised Corpus Aware Language Model Pre-training for Dense Passage Retrieval | Aug 12, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| DEMix Layers: Disentangling Domains for Modular Language Modeling | Aug 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents | Aug 10, 2021 | Key Information ExtractionLanguage Modeling | CodeCode Available | 1 |
| Noisy Channel Language Model Prompting for Few-Shot Text Classification | Aug 9, 2021 | AttributeClassification | CodeCode Available | 1 |
| Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification | Aug 5, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Finetuning Pretrained Transformers into Variational Autoencoders | Aug 5, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Controlled Text Generation as Continuous Optimization with Multiple Constraints | Aug 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Aug 4, 2021 | ClassificationFew-Shot Text Classification | CodeCode Available | 1 |
| Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation | Aug 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction | Aug 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ProtAugment: Intent Detection Meta-Learning through Unsupervised Diverse Paraphrasing | Aug 1, 2021 | DiversityIntent Detection | CodeCode Available | 1 |
| Controllable Sentence Simplification with a Unified Text-to-Text Transfer Transformer | Aug 1, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CommitBERT: Commit Message Generation Using Pre-Trained Programming Language Model | Aug 1, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Structural Guidance for Transformer Language Models | Jul 30, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving | Jul 28, 2021 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 1 |
| Pre-train, Prompt, and Predict: A Systematic Survey of Prompting Methods in Natural Language Processing | Jul 28, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| gaBERT -- an Irish Language Model | Jul 27, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Brazilian Portuguese Speech Recognition Using Wav2vec 2.0 | Jul 23, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Human-in-the-Loop for Data Collection: a Multi-Target Counter Narrative Dataset to Fight Online Hate Speech | Jul 19, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Comparison of Methods for OOV-word Recognition on a New Public Dataset | Jul 16, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Picard understanding Darmok: A Dataset and Model for Metaphor-Rich Translation in a Constructed Language | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| TAPEX: Table Pre-training via Learning a Neural SQL Executor | Jul 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning Skills | Jul 15, 2021 | DecoderLanguage Modeling | CodeCode Available | 1 |
| FLEX: Unifying Evaluation for Few-Shot NLP | Jul 15, 2021 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| Codified audio language modeling learns useful representations for music information retrieval | Jul 12, 2021 | Emotion RecognitionGenre classification | CodeCode Available | 1 |
| VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer | Jul 6, 2021 | Image RetrievalKnowledge Distillation | CodeCode Available | 1 |