| LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding | May 30, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Preserving Pre-trained Features Helps Calibrate Fine-tuned Language Models | May 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adapting Learned Sparse Retrieval for Long Documents | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Rethinking Masked Language Modeling for Chinese Spelling Correction | May 28, 2023 | DiversityDomain Generalization | CodeCode Available | 1 |
| Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Investigation of Noise in Morphological Inflection | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 |
| Dynamic Masking Rate Schedules for MLM Pretraining | May 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Self-Evolution Learning for Discriminative Language Model Pretraining | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection | May 23, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 0 |
| AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bidirectional Transformer Reranker for Grammatical Error Correction | May 22, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 0 |
| Extrapolating Multilingual Understanding Models as Multilingual Generators | May 22, 2023 | DenoisingLanguage Modeling | —Unverified | 0 |
| Federated Learning of Medical Concepts Embedding using BEHRT | May 22, 2023 | Federated LearningLanguage Modeling | CodeCode Available | 0 |
| A Pilot Study on Dialogue-Level Dependency Parsing for Chinese | May 21, 2023 | Dependency ParsingLanguage Modeling | —Unverified | 0 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| Patton: Language Model Pretraining on Text-Rich Networks | May 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Syllable Discovery and Cross-Lingual Generalization in a Visually Grounded, Self-Supervised Speech Model | May 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| How does the task complexity of masked pretraining objectives affect downstream performance? | May 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DinoSR: Self-Distillation and Online Clustering for Self-supervised Speech Representation Learning | May 17, 2023 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Pre-training Language Model as a Multi-perspective Course Learner | May 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mapping of attention mechanisms to a generalized Potts model | Apr 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unsupervised Improvement of Factual Knowledge in Language Models | Apr 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation | Apr 3, 2023 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| Joint unsupervised and supervised learning for context-aware language identification | Mar 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Fine-grained Audible Video Description | Mar 27, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Accelerating Vision-Language Pretraining with Free Language Modeling | Mar 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Cross-Modal Implicit Relation Reasoning and Aligning for Text-to-Image Person Retrieval | Mar 22, 2023 | Image-text matchingLanguage Modeling | CodeCode Available | 2 |
| HOP+: History-enhanced and Order-aware Pre-training for Vision-and-Language Navigation | Mar 20, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| CCPL: Cross-modal Contrastive Protein Learning | Mar 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Do Transformers Parse while Predicting the Masked Word? | Mar 14, 2023 | Constituency ParsingLanguage Modeling | —Unverified | 0 |
| Generating multiple-choice questions for medical question answering with distractors and cue-masking | Mar 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Domain-adapted large language models for classifying nuclear medicine reports | Mar 1, 2023 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training | Mar 1, 2023 | Document Image Classificationimage-classification | CodeCode Available | 0 |
| Efficient Masked Autoencoders with Self-Consistency | Feb 28, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Weighted Sampling for Masked Language Modeling | Feb 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Symbolic Discovery of Optimization Algorithms | Feb 13, 2023 | Contrastive Learningimage-classification | CodeCode Available | 0 |
| Capturing Topic Framing via Masked Language Modeling | Feb 7, 2023 | ArticlesLanguage Modeling | —Unverified | 0 |
| Representation Deficiency in Masked Language Modeling | Feb 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval | Jan 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Debiasing the Cloze Task in Sequential Recommendation with Bidirectional Transformers | Jan 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Cohesive Distillation Architecture for Neural Language Models | Jan 12, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Jan 1, 2023 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| Cramming: Training a Language Model on a Single GPU in One Day | Dec 28, 2022 | GPULanguage Modeling | CodeCode Available | 3 |
| MicroBERT: Effective Training of Low-resource Monolingual BERTs through Parameter Reduction and Multitask Learning | Dec 23, 2022 | Dependency ParsingLanguage Modeling | CodeCode Available | 1 |
| Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mu^2SLAM: Multitask, Multilingual Speech and Language Models | Dec 19, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| APOLLO: A Simple Approach for Adaptive Pretraining of Language Models for Logical Reasoning | Dec 19, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Efficient Pre-training of Masked Language Model via Concept-based Curriculum Masking | Dec 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Uniform Masking Prevails in Vision-Language Pretraining | Dec 10, 2022 | Image-text matchingLanguage Modeling | —Unverified | 0 |