| Investigating Masking-based Data Generation in Language Models | Jun 16, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Personalized Image Enhancement Featuring Masked Style Modeling | Jun 15, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 0 |
| Recipes for Sequential Pre-training of Multilingual Encoder and Seq2Seq Models | Jun 14, 2023 | DecoderLanguage Modeling | —Unverified | 0 |
| Absformer: Transformer-based Model for Unsupervised Multi-Document Abstractive Summarization | Jun 7, 2023 | Abstractive Text SummarizationDecoder | —Unverified | 0 |
| Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems | Jun 7, 2023 | Conversational Response SelectionDecoder | CodeCode Available | 0 |
| Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction | Jun 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fair multilingual vandalism detection system for Wikipedia | Jun 2, 2023 | Feature EngineeringLanguage Modeling | CodeCode Available | 0 |
| Understanding Augmentation-based Self-Supervised Representation Learning via RKHS Approximation and Regression | Jun 1, 2023 | Contrastive LearningData Augmentation | —Unverified | 0 |
| LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding | May 30, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Adapting Learned Sparse Retrieval for Long Documents | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Investigation of Noise in Morphological Inflection | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Self-Evolution Learning for Discriminative Language Model Pretraining | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Dynamic Masking Rate Schedules for MLM Pretraining | May 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection | May 23, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 0 |
| AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Federated Learning of Medical Concepts Embedding using BEHRT | May 22, 2023 | Federated LearningLanguage Modeling | CodeCode Available | 0 |
| Extrapolating Multilingual Understanding Models as Multilingual Generators | May 22, 2023 | DenoisingLanguage Modeling | —Unverified | 0 |
| Bidirectional Transformer Reranker for Grammatical Error Correction | May 22, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 0 |
| A Pilot Study on Dialogue-Level Dependency Parsing for Chinese | May 21, 2023 | Dependency ParsingLanguage Modeling | —Unverified | 0 |
| Patton: Language Model Pretraining on Text-Rich Networks | May 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How does the task complexity of masked pretraining objectives affect downstream performance? | May 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pre-training Language Model as a Multi-perspective Course Learner | May 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mapping of attention mechanisms to a generalized Potts model | Apr 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unsupervised Improvement of Factual Knowledge in Language Models | Apr 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation | Apr 3, 2023 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| Joint unsupervised and supervised learning for context-aware language identification | Mar 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| HOP+: History-enhanced and Order-aware Pre-training for Vision-and-Language Navigation | Mar 20, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| CCPL: Cross-modal Contrastive Protein Learning | Mar 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Do Transformers Parse while Predicting the Masked Word? | Mar 14, 2023 | Constituency ParsingLanguage Modeling | —Unverified | 0 |
| Generating multiple-choice questions for medical question answering with distractors and cue-masking | Mar 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Domain-adapted large language models for classifying nuclear medicine reports | Mar 1, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training | Mar 1, 2023 | Document Image Classificationimage-classification | CodeCode Available | 0 |
| Weighted Sampling for Masked Language Modeling | Feb 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficient Masked Autoencoders with Self-Consistency | Feb 28, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Symbolic Discovery of Optimization Algorithms | Feb 13, 2023 | Contrastive Learningimage-classification | CodeCode Available | 0 |
| Capturing Topic Framing via Masked Language Modeling | Feb 7, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval | Jan 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Cohesive Distillation Architecture for Neural Language Models | Jan 12, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Jan 1, 2023 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mu^2SLAM: Multitask, Multilingual Speech and Language Models | Dec 19, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning | Dec 19, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Uniform Masking Prevails in Vision-Language Pretraining | Dec 10, 2022 | Image-text matchingLanguage Modeling | —Unverified | 0 |
| Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE | Dec 4, 2022 | Common Sense Reasoningcoreference-resolution | —Unverified | 0 |
| Global memory transformer for processing long documents | Dec 3, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Comparison Study Between Token Classification and Sequence Classification In Text Classification | Nov 25, 2022 | ClassificationLanguage Modeling | —Unverified | 0 |
| Enhancing Crisis-Related Tweet Classification with Entity-Masked Language Modeling and Multi-Task Learning | Nov 21, 2022 | Hierarchical Multi-label ClassificationLanguage Modeling | CodeCode Available | 0 |
| Embracing Ambiguity: Improving Similarity-oriented Tasks with Contextual Synonym Knowledge | Nov 20, 2022 | Entity LinkingLanguage Modeling | —Unverified | 0 |