| Less is More: Pre-Training Cross-Lingual Small-Scale Language Models with Cognitively-Plausible Curriculum Learning Strategies | Oct 30, 2024 | Language AcquisitionMasked Language Modeling | CodeCode Available | 0 | 5 |
| Leveraging Open Information Extraction for More Robust Domain Transfer of Event Trigger Detection | May 23, 2023 | Event DetectionLanguage Modeling | CodeCode Available | 0 | 5 |
| Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers | Jun 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked Language Models are Good Heterogeneous Graph Generalizers | Jun 6, 2025 | Graph LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 | 5 |
| Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection | Sep 16, 2021 | AttributeLanguage Modeling | CodeCode Available | 0 | 5 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| MSA Transformer | Feb 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Multilingual Normalization of Temporal Expressions with Masked Language Models | May 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Multilinguals at SemEval-2022 Task 11: Complex NER in Semantically Ambiguous Settings for Low Resource Languages | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| NormFormer: Improved Transformer Pretraining with Extra Normalization | Oct 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| On the Cross-lingual Transferability of Monolingual Representations | Oct 25, 2019 | Cross-Lingual Question AnsweringLanguage Modeling | CodeCode Available | 0 | 5 |
| PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation | Apr 3, 2023 | DenoisingLanguage Modeling | CodeCode Available | 0 | 5 |
| Personalized Image Enhancement Featuring Masked Style Modeling | Jun 15, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 0 | 5 |
| Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training | Oct 14, 2022 | HallucinationImage Augmentation | CodeCode Available | 0 | 5 |
| Boosting Point-BERT by Multi-choice Tokens | Jul 27, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Pre-Training of Deep Bidirectional Protein Sequence Representations with Structural Information | Nov 25, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval | Aug 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Probing BERT's priors with serial reproduction chains | Feb 24, 2022 | Language ModellingMasked Language Modeling | CodeCode Available | 0 | 5 |
| Promises and Pitfalls of Generative Masked Language Modeling: Theoretical Framework and Practical Guidelines | Jul 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning | Apr 27, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Punctuation Restoration Improves Structure Understanding Without Supervision | Feb 13, 2024 | ChunkingLanguage Modeling | CodeCode Available | 0 | 5 |
| QueerBench: Quantifying Discrimination in Language Models Toward Queer Identities | Jun 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| RAVEN: In-Context Learning with Retrieval-Augmented Encoder-Decoder Language Models | Aug 15, 2023 | DecoderIn-Context Learning | CodeCode Available | 0 | 5 |
| ReCAM@IITK at SemEval-2021 Task 4: BERT and ALBERT based Ensemble for Abstract Word Prediction | Apr 4, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Revisiting and Advancing Chinese Natural Language Understanding with Accelerated Heterogeneous Knowledge Pre-training | Oct 11, 2022 | GPUKnowledge Graphs | CodeCode Available | 0 | 5 |
| S2SNet: A Pretrained Neural Network for Superconductivity Discovery | Jun 28, 2023 | Electrical EngineeringLanguage Modeling | CodeCode Available | 0 | 5 |
| SAS: Self-Augmentation Strategy for Language Model Pre-training | Jun 14, 2021 | Data AugmentationLanguage Modeling | CodeCode Available | 0 | 5 |
| SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics | Oct 2, 2024 | ClassificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Self-Distillation Improves DNA Sequence Inference | May 14, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Self-Evolution Learning for Discriminative Language Model Pretraining | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Selfie: Self-supervised Pretraining for Image Embedding | Jun 7, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Seventeenth-Century Spanish American Notary Records for Fine-Tuning Spanish Large Language Models | Jun 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| SJ_AJ@DravidianLangTech-EACL2021: Task-Adaptive Pre-Training of Multilingual BERT models for Offensive Language Identification | Feb 1, 2021 | Language IdentificationLanguage Modeling | CodeCode Available | 0 | 5 |
| StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training | Mar 1, 2023 | Document Image Classificationimage-classification | CodeCode Available | 0 | 5 |
| Structural Self-Supervised Objectives for Transformers | Sep 15, 2023 | Fact VerificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Symbolic Discovery of Optimization Algorithms | Feb 13, 2023 | Contrastive Learningimage-classification | CodeCode Available | 0 | 5 |
| Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text | Feb 18, 2025 | Authorship AttributionLanguage Modeling | CodeCode Available | 0 | 5 |
| Text Revision by On-the-Fly Representation Optimization | Apr 15, 2022 | AttributeLanguage Modeling | CodeCode Available | 0 | 5 |
| The Distributional Hypothesis Does Not Fully Explain the Benefits of Masked Language Model Pretraining | Oct 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| The Effectiveness of Masked Language Modeling and Adapters for Factual Knowledge Injection | Oct 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Towards Unified Prompt Tuning for Few-shot Text Classification | May 11, 2022 | ClassificationFew-Shot Learning | CodeCode Available | 0 | 5 |
| Towards Unifying Reference Expression Generation and Comprehension | Oct 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Transformer based neural networks for emotion recognition in conversations | May 18, 2024 | Causal Language ModelingEmotion Classification | CodeCode Available | 0 | 5 |