| Learning to Sample Replacements for ELECTRA Pre-Training | Jun 25, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning Visual Representations with Caption Annotations | Aug 4, 2020 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Leveraging Explicit Procedural Instructions for Data-Efficient Action Prediction | Jun 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging per Image-Token Consistency for Vision-Language Pre-training | Nov 20, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection | Dec 9, 2024 | Alzheimer's Disease DetectionAutomatic Speech Recognition | —Unverified | 0 |
| LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models | Dec 1, 2023 | image-classificationImage Classification | —Unverified | 0 |
| CCPL: Cross-modal Contrastive Protein Learning | Mar 19, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLMcap: Large Language Model for Unsupervised PCAP Failure Detection | Jul 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models | Mar 27, 2025 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little | Apr 14, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Masked Language Modeling Becomes Conditional Density Estimation for Tabular Data Synthesis | May 31, 2024 | Density EstimationImputation | —Unverified | 0 |
| Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers | Jun 5, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Masked Vision and Language Modeling for Multi-modal Representation Learning | Aug 3, 2022 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| MaskEval: Weighted MLM-Based Evaluation for Text Summarization and Simplification | May 24, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Maximizing Efficiency of Language Model Pre-training for Learning Representation | Oct 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Memorization Without Overfitting: Analyzing the Training Dynamics of Large Language Models | May 22, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MERMAID: Metaphor Generation with Symbolism and Discriminative Decoding | Jan 23, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MG-BERT: Multi-Graph Augmented BERT for Masked Language Modeling | Jun 1, 2021 | Knowledge GraphsLanguage Modeling | —Unverified | 0 |
| MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling | Aug 9, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| Misinformation Detection in Social Media Video Posts | Feb 15, 2022 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| Mitigating Gender Bias in Contextual Word Embeddings | Nov 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MLIM: Vision-and-Language Model Pre-training with Masked Language and Image Modeling | Sep 24, 2021 | Image ReconstructionLanguage Modeling | —Unverified | 0 |
| Modeling Mathematical Notation Semantics in Academic Papers | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MSA Transformer | Feb 13, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MST: Masked Self-Supervised Transformer for Visual Representation | Jun 10, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mu^2SLAM: Multitask, Multilingual Speech and Language Models | Dec 19, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Multi-Modal Pre-Training for Automated Speech Recognition | Oct 12, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| N-gram Prediction and Word Difference Representations for Language Modeling | Sep 5, 2024 | Causal Language ModelingLanguage Modeling | —Unverified | 0 |
| NICT Kyoto Submission for the WMT’21 Quality Estimation Task: Multimetric Multilingual Pretraining for Critical Error Detection | Nov 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Noobs at Semeval-2021 Task 4: Masked Language Modeling for abstract answer prediction | Aug 1, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| NormFormer: Improved Transformer Pretraining with Extra Normalization | Oct 18, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SkillNet-NLU: A Sparsely Activated Model for General-Purpose Natural Language Understanding | Mar 7, 2022 | Language ModellingMasked Language Modeling | —Unverified | 0 |
| On the Influence of Masking Policies in Intermediate Pre-training | Apr 18, 2021 | Abstractive Text SummarizationLanguage Modeling | —Unverified | 0 |
| OPSD: an Offensive Persian Social media Dataset and its baseline evaluations | Apr 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mapping of attention mechanisms to a generalized Potts model | Apr 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PASTA: Pretrained Action-State Transformer Agents | Jul 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Patton: Language Model Pretraining on Text-Rich Networks | May 20, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PerPLM: Personalized Fine-tuning of Pretrained Language Models via Writer-specific Intermediate Learning and Prompts | Sep 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Phrase-aware Unsupervised Constituency Parsing | Nov 16, 2021 | Constituency ParsingLanguage Modeling | —Unverified | 0 |
| Phrase-aware Unsupervised Constituency Parsing | May 1, 2022 | Constituency ParsingLanguage Modeling | —Unverified | 0 |
| Position Masking for Language Models | Jun 2, 2020 | Language ModelingLanguage Modelling | —Unverified | 0 |
| POSTECH-ETRI’s Submission to the WMT2020 APE Shared Task: Automatic Post-Editing with Cross-lingual Language Model | Nov 1, 2020 | Automatic Post-EditingLanguage Modeling | —Unverified | 0 |
| Predicting Attention Sparsity in Transformers | Sep 24, 2021 | DecoderLanguage Modeling | —Unverified | 0 |
| Predicting Attention Sparsity in Transformers | Nov 16, 2021 | DecoderLanguage Modeling | —Unverified | 0 |
| Pre-Training and Prompting for Few-Shot Node Classification on Text-Attributed Graphs | Jul 22, 2024 | Few-Shot LearningGraph Neural Network | —Unverified | 0 |
| Pretraining Chinese BERT for Detecting Word Insertion and Deletion Errors | Apr 26, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning | Apr 29, 2020 | AllHellaSwag | —Unverified | 0 |
| Pre-training Language Model as a Multi-perspective Course Learner | May 6, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Probing BERT’s priors with serial reproduction chains | Nov 16, 2021 | Language ModelingLanguage Modelling | —Unverified | 0 |