| Developing Language Resources and NLP Tools for the North Korean Language | Jun 1, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Discovering Financial Hypernyms by Prompting Masked Language Models | Jun 1, 2022 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-training | Jun 1, 2022 | Contrastive LearningCross-Lingual Transfer | CodeCode Available | 1 |
| Training and Inference on Any-Order Autoregressive Models the Right Way | May 26, 2022 | Image InpaintingLanguage Modeling | CodeCode Available | 1 |
| Transcormer: Transformer for Sentence Scoring with Sliding Language Modeling | May 25, 2022 | Causal Language ModelingLanguage Modeling | CodeCode Available | 1 |
| MaskEval: Weighted MLM-Based Evaluation for Text Summarization and Simplification | May 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 |
| Enhancing Continual Learning with Global Prototypes: Counteracting Negative Representation Drift | May 24, 2022 | Continual LearningLanguage Modeling | —Unverified | 0 |
| Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models | May 22, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Multilingual Normalization of Temporal Expressions with Masked Language Models | May 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Foundation Posteriors for Approximate Probabilistic Inference | May 19, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Unified Prompt Tuning for Few-shot Text Classification | May 11, 2022 | ClassificationFew-Shot Learning | CodeCode Available | 0 |
| An Empirical Study Of Self-supervised Learning Approaches For Object Detection With Transformers | May 11, 2022 | image-classificationImage Classification | CodeCode Available | 0 |
| KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering | May 6, 2022 | Contrastive LearningExtractive Question-Answering | CodeCode Available | 0 |
| Declaration-based Prompt Tuning for Visual Question Answering | May 5, 2022 | Image-text matchingLanguage Modeling | CodeCode Available | 1 |
| Contrastive Learning for Prompt-Based Few-Shot Language Learners | May 3, 2022 | Contrastive LearningIn-Context Learning | CodeCode Available | 1 |
| Adversarial Soft Prompt Tuning for Cross-Domain Sentiment Analysis | May 1, 2022 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Unsupervised Dependency Graph Network | May 1, 2022 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Phrase-aware Unsupervised Constituency Parsing | May 1, 2022 | Constituency ParsingLanguage Modeling | —Unverified | 0 |
| Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual Templates | May 1, 2022 | Cross-Lingual Natural Language InferenceCross-Lingual Transfer | CodeCode Available | 0 |
| DS-TOD: Efficient Domain Specialization for Task-Oriented Dialog | May 1, 2022 | dialog state trackingLanguage Modeling | CodeCode Available | 0 |
| “Is Whole Word Masking Always Better for Chinese BERT?”: Probing on Chinese Grammatical Error Correction | May 1, 2022 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| Vision-Language Pre-Training for Boosting Scene Text Detectors | Apr 29, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| A Comprehensive Understanding of Code-mixed Language Semantics using Hierarchical Transformer | Apr 27, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pretraining Chinese BERT for Detecting Word Insertion and Deletion Errors | Apr 26, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unsupervised Representation Learning of Player Behavioral Data with Confidence Guided Masking | Apr 25, 2022 | Feature EngineeringLanguage Modeling | CodeCode Available | 0 |
| LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking | Apr 18, 2022 | cross-modal alignmentDocument AI | CodeCode Available | 0 |
| WordAlchemy: A transformer-based Reverse Dictionary | Apr 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SimpleBERT: A Pre-trained Model That Learns to Generate Simple Words | Apr 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Text Revision by On-the-Fly Representation Optimization | Apr 15, 2022 | AttributeLanguage Modeling | CodeCode Available | 0 |
| Generative power of a protein language model trained on multiple sequence alignments | Apr 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| What Language Model Architecture and Pretraining Objective Work Best for Zero-Shot Generalization? | Apr 12, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Data Augmentation for Biomedical Factoid Question Answering | Apr 10, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 0 |
| Contextual Representation Learning beyond Masked Language Modeling | Apr 8, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SecureBERT: A Domain-Specific Language Model for Cybersecurity | Apr 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| POS-BERT: Point Cloud One-Stage BERT Pre-Training | Apr 3, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 |
| Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| LinkBERT: Pretraining Language Models with Document Links | Mar 29, 2022 | Document ClassificationLanguage Modeling | CodeCode Available | 2 |
| Token Dropping for Efficient BERT Pretraining | Mar 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining? | Mar 24, 2022 | Argument MiningLanguage Modeling | CodeCode Available | 0 |
| What to Hide from Your Students: Attention-Guided Masked Image Modeling | Mar 23, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation | Mar 22, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Mar 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Geographic Adaptation of Pretrained Language Models | Mar 16, 2022 | Language IdentificationLanguage Modeling | CodeCode Available | 0 |
| SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding | Mar 7, 2022 | Language ModellingMasked Language Modeling | —Unverified | 0 |
| "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction | Mar 1, 2022 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| Probing BERT's priors with serial reproduction chains | Feb 24, 2022 | Language ModellingMasked Language Modeling | CodeCode Available | 0 |
| VU-BERT: A Unified framework for Visual Dialog | Feb 22, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Transformer Quality in Linear Time | Feb 21, 2022 | 8kLanguage Modeling | CodeCode Available | 1 |
| Should You Mask 15% in Masked Language Modeling? | Feb 16, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |