| How does the pre-training objective affect what large language models learn about linguistic properties? | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Jan 1, 2023 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data | Jan 22, 2020 | Image RetrievalImage-text matching | —Unverified | 0 |
| Image BERT Pre-training with Online Tokenizer | Sep 29, 2021 | image-classificationImage Classification | —Unverified | 0 |
| Improving BERT with Hybrid Pooling Network and Drop Mask | Jul 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Low-Resource Morphological Inflection via Self-Supervised Objectives | Jun 5, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Improving the Reusability of Pre-trained Language Models in Real-world Applications | Jul 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| In-Context Learning can distort the relationship between sequence likelihoods and biological fitness | Apr 23, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Investigating Masking-based Data Generation in Language Models | Jun 16, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction | Nov 16, 2021 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction | Mar 1, 2022 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| “Is Whole Word Masking Always Better for Chinese BERT?”: Probing on Chinese Grammatical Error Correction | May 1, 2022 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Jan 3, 2024 | Data Augmentationfill-mask | —Unverified | 0 |
| Joint unsupervised and supervised learning for context-aware language identification | Mar 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Joint Unsupervised and Supervised Training for Multilingual ASR | Nov 15, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering | May 6, 2022 | Contrastive LearningExtractive Question-Answering | —Unverified | 0 |
| Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search | Dec 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Nov 16, 2021 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget | Apr 30, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| KUL@SMM4H’22: Template Augmented Adaptive Pre-training for Tweet Classification | Oct 1, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| LakotaBERT: A Transformer-based Model for Low Resource Lakota Language | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LAnoBERT: System Log Anomaly Detection based on BERT Masked Language Model | Nov 18, 2021 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| Larger-Scale Transformers for Multilingual Masked Language Modeling | May 2, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding | May 30, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Enhancing Continual Learning with Global Prototypes: Counteracting Negative Representation Drift | May 24, 2022 | Continual LearningLanguage Modeling | —Unverified | 0 |