| SyncMask: Synchronized Attentional Masking for Fashion-centric Vision-Language Pretraining | Apr 1, 2024 | Contrastive LearningImage-text matching | —Unverified | 0 | 0 |
| TACO: Pre-training of Deep Transformers with Attention Convolution using Disentangled Positional Representation | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Tagging before Alignment: Integrating Multi-Modal Tags for Video-Text Retrieval | Jan 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Taking Actions Separately: A Bidirectionally-Adaptive Transfer Learning Method for Low-Resource Neural Machine Translation | Oct 1, 2022 | Generative Adversarial NetworkLanguage Modeling | —Unverified | 0 | 0 |
| BERTwich: Extending BERT's Capabilities to Model Dialectal and Noisy Text | Oct 31, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| BERT Masked Language Modeling for Co-reference Resolution | Aug 1, 2019 | General ClassificationLanguage Modeling | —Unverified | 0 | 0 |
| Target-Aware Data Augmentation for Stance Detection | Jun 1, 2021 | Data AugmentationLanguage Modeling | —Unverified | 0 | 0 |
| VU-BERT: A Unified framework for Visual Dialog | Feb 22, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Temporal Language Modeling for Short Text Document Classification with Transformers | Nov 16, 2021 | ClassificationDocument Classification | —Unverified | 0 | 0 |
| TemPrompt: Multi-Task Prompt Learning for Temporal Relation Extraction in RAG-based Crowdsourcing Systems | Jun 21, 2024 | Contrastive LearningLanguage Modeling | —Unverified | 0 | 0 |
| TensorCoder: Dimension-Wise Attention via Tensor Representation for Natural Language Modeling | Jul 28, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Cohesive Distillation Architecture for Neural Language Models | Jan 12, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 | 0 |
| Text Style Transfer for Bias Mitigation using Masked Language Modeling | Jan 21, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| The Bottom-up Evolution of Representations in the Transformer: A Study with Machine Translation and Language Modeling Objectives | Sep 3, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Weighted Sampling for Masked Language Modeling | Feb 28, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Closer Look at Parameter Contributions When Training Neural Language and Translation Models | Oct 1, 2022 | Causal Language ModelingLanguage Modeling | —Unverified | 0 | 0 |
| Automated Scoring of Clinical Patient Notes using Advanced NLP and Pseudo Labeling | Jan 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Augmenting Vision Language Pretraining by Learning Codebook with Visual Semantics | Jul 31, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Token Dropping for Efficient BERT Pretraining | Mar 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE | Dec 4, 2022 | Common Sense Reasoningcoreference-resolution | —Unverified | 0 | 0 |
| Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks | Jan 1, 2023 | Cross-Modal RetrievalImage Captioning | —Unverified | 0 | 0 |
| ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data | Jan 22, 2020 | Image RetrievalImage-text matching | —Unverified | 0 | 0 |
| Image BERT Pre-training with Online Tokenizer | Sep 29, 2021 | image-classificationImage Classification | —Unverified | 0 | 0 |
| Improving BERT with Hybrid Pooling Network and Drop Mask | Jul 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Improving Low-Resource Morphological Inflection via Self-Supervised Objectives | Jun 5, 2025 | DecoderLanguage Modeling | —Unverified | 0 | 0 |
| HOP+: History-enhanced and Order-aware Pre-training for Vision-and-Language Navigation | Mar 20, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| Improving the Reusability of Pre-trained Language Models in Real-world Applications | Jul 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| HCDIR: End-to-end Hate Context Detection, and Intensity Reduction model for online comments | Dec 20, 2023 | Hate Speech DetectionLanguage Modeling | —Unverified | 0 | 0 |
| In-Context Learning can distort the relationship between sequence likelihoods and biological fitness | Apr 23, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 | 0 |
| HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| GraphCodeBERT: Pre-training Code Representations with Data Flow | Sep 17, 2020 | Clone DetectionCode Completion | —Unverified | 0 | 0 |
| Towards Effective Time-Aware Language Representation: Exploring Enhanced Temporal Understanding in Language Models | Jun 4, 2024 | Document DatingLanguage Modeling | —Unverified | 0 | 0 |
| GPTs at Factify 2022: Prompt Aided Fact-Verification | Jun 29, 2022 | Fact VerificationLanguage Modeling | —Unverified | 0 | 0 |
| Go-tuning: Improving Zero-shot Learning Abilities of Smaller Language Models | Dec 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Investigating Masking-based Data Generation in Language Models | Jun 16, 2023 | Data AugmentationLanguage Modeling | —Unverified | 0 | 0 |
| "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction | Nov 16, 2021 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 | 0 |
| "Is Whole Word Masking Always Better for Chinese BERT?": Probing on Chinese Grammatical Error Correction | Mar 1, 2022 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 | 0 |
| “Is Whole Word Masking Always Better for Chinese BERT?”: Probing on Chinese Grammatical Error Correction | May 1, 2022 | Grammatical Error CorrectionLanguage Modeling | —Unverified | 0 | 0 |
| Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Jan 3, 2024 | Data Augmentationfill-mask | —Unverified | 0 | 0 |
| Towards Making the Most of Pre-trained Translation Model for Quality Estimation | Oct 1, 2022 | DenoisingLanguage Modeling | —Unverified | 0 | 0 |
| Joint unsupervised and supervised learning for context-aware language identification | Mar 29, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| Joint Unsupervised and Supervised Training for Multilingual ASR | Nov 15, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| KECP: Knowledge Enhanced Contrastive Prompting for Few-shot Extractive Question Answering | May 6, 2022 | Contrastive LearningExtractive Question-Answering | —Unverified | 0 | 0 |
| Knowing Where to Focus: Attention-Guided Alignment for Text-based Person Search | Dec 19, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Global memory transformer for processing long documents | Dec 3, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Nov 16, 2021 | Few-Shot Text ClassificationLanguage Modeling | —Unverified | 0 | 0 |
| Knowledge Distillation vs. Pretraining from Scratch under a Fixed (Computation) Budget | Apr 30, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 | 0 |
| A Transfer Learning Pipeline for Educational Resource Discovery with Application in Leading Paragraph Generation | Jan 7, 2022 | Information RetrievalLanguage Modeling | —Unverified | 0 | 0 |
| GeoRecon: Graph-Level Representation Learning for 3D Molecules via Reconstruction-Based Pretraining | Jun 16, 2025 | DenoisingLanguage Modeling | —Unverified | 0 | 0 |