| Embracing Ambiguity: Improving Similarity-oriented Tasks with Contextual Synonym Knowledge | Nov 20, 2022 | Entity LinkingLanguage Modeling | —Unverified | 0 |
| Emerging Cross-lingual Structure in Pretrained Language Models | Nov 4, 2019 | Cross-Lingual TransferLanguage Modeling | —Unverified | 0 |
| Emerging Property of Masked Token for Effective Pre-training | Apr 12, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| Enabling Autoregressive Models to Fill In Masked Tokens | Feb 9, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Enhancing BERT-Based Visual Question Answering through Keyword-Driven Sentence Selection | Oct 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing Domain-Specific Encoder Models with LLM-Generated Data: How to Leverage Ontologies, and How to Do Without Them | Mar 27, 2025 | Continual PretrainingLanguage Modeling | —Unverified | 0 |
| ESALE: Enhancing Code-Summary Alignment Learning for Source Code Summarization | Jul 1, 2024 | Code SummarizationDecoder | —Unverified | 0 |
| Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection | Aug 30, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Exposing the Implicit Energy Networks behind Masked Language Models via Metropolis--Hastings | Jun 4, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Extrapolating Multilingual Understanding Models as Multilingual Generators | May 22, 2023 | DenoisingLanguage Modeling | —Unverified | 0 |
| FARM: Functional Group-Aware Representations for Small Molecules | Oct 2, 2024 | Contrastive LearningDrug Discovery | —Unverified | 0 |
| How Useful is Continued Pre-Training for Generative Unsupervised Domain Adaptation? | Jan 31, 2024 | ClassificationDomain Adaptation | —Unverified | 0 |
| Foundation Posteriors for Approximate Probabilistic Inference | May 19, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gender-tuning: Empowering Fine-tuning for Debiasing Pre-trained Language Models | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| General Framework for Reversible Data Hiding in Texts Based on Masked Language Modeling | Jun 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generating multiple-choice questions for medical question answering with distractors and cue-masking | Mar 13, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Generative Prompt Tuning for Relation Classification | Nov 16, 2021 | ClassificationLanguage Modeling | —Unverified | 0 |
| GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining | Jun 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Global memory transformer for processing long documents | Dec 3, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GPTs at Factify 2022: Prompt Aided Fact-Verification | Jun 29, 2022 | Fact VerificationLanguage Modeling | —Unverified | 0 |
| GraphCodeBERT: Pre-training Code Representations with Data Flow | Sep 17, 2020 | Clone DetectionCode Completion | —Unverified | 0 |
| HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments | Dec 20, 2023 | Hate Speech DetectionLanguage Modeling | —Unverified | 0 |
| HOP+: History-enhanced and Order-aware Pre-training for Vision-and-Language Navigation | Mar 20, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Jan 1, 2023 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 |
| ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data | Jan 22, 2020 | Image RetrievalImage-text matching | —Unverified | 0 |
| Image BERT Pre-training with Online Tokenizer | Sep 29, 2021 | image-classificationImage Classification | —Unverified | 0 |
| Improving BERT with Hybrid Pooling Network and Drop Mask | Jul 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving Low-Resource Morphological Inflection via Self-Supervised Objectives | Jun 5, 2025 | DecoderLanguage Modeling | —Unverified | 0 |
| Improving the Reusability of Pre-trained Language Models in Real-world Applications | Jul 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| In-Context Learning can distort the relationship between sequence likelihoods and biological fitness | Apr 23, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Investigating Masking-based Data Generation in Language Models | Jun 16, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction | Nov 16, 2021 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction | Mar 1, 2022 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| “Is Whole Word Masking Always Better for Chinese BERT?”: Probing on Chinese Grammatical Error Correction | May 1, 2022 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 |
| Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Jan 3, 2024 | Data Augmentationfill-mask | —Unverified | 0 |
| Joint unsupervised and supervised learning for context-aware language identification | Mar 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Joint Unsupervised and Supervised Training for Multilingual ASR | Nov 15, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering | May 6, 2022 | Contrastive LearningExtractive Question-Answering | —Unverified | 0 |
| Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search | Dec 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Nov 16, 2021 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 |
| Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget | Apr 30, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| KUL@SMM4H’22: Template Augmented Adaptive Pre-training for Tweet Classification | Oct 1, 2022 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| LakotaBERT: A Transformer-based Model for Low Resource Lakota Language | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LAnoBERT: System Log Anomaly Detection based on BERT Masked Language Model | Nov 18, 2021 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| Larger-Scale Transformers for Multilingual Masked Language Modeling | May 2, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LayoutMask: Enhance Text-Layout Interaction in Multi-modal Pre-training for Document Understanding | May 30, 2023 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Enhancing Continual Learning with Global Prototypes: Counteracting Negative Representation Drift | May 24, 2022 | Continual LearningLanguage Modeling | —Unverified | 0 |