| Multilingual Normalization of Temporal Expressions with Masked Language Models | May 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| DS-TOD: Efficient Domain Specialization for Task Oriented Dialog | Oct 15, 2021 | dialog state trackingLanguage Modeling | CodeCode Available | 0 | 5 |
| Multilinguals at SemEval-2022 Task 11: Complex NER in Semantically Ambiguous Settings for Low Resource Languages | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining? | Mar 24, 2022 | Argument MiningLanguage Modeling | CodeCode Available | 0 | 5 |
| Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters | Jul 1, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 | 5 |
| Distributionally robust self-supervised learning for tabular data | Oct 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Distilling Knowledge Learned in BERT for Text Generation | Nov 10, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification | Dec 8, 2023 | ClassificationFew-Shot Text Classification | CodeCode Available | 0 | 5 |
| MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection | Sep 16, 2021 | AttributeLanguage Modeling | CodeCode Available | 0 | 5 |
| A character-based steganography using masked language modeling | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 | 5 |
| DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked Language Models are Good Heterogeneous Graph Generalizers | Jun 6, 2025 | Graph LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Dict-BERT: Enhancing Language Model Pre-training with Dictionary | Oct 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Adapting Learned Sparse Retrieval for Long Documents | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers | Jun 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 | 5 |
| DIBERT: Dependency Injected Bidirectional Encoder Representations from Transformers | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Lil-Bevo: Explorations of Strategies for Training Language Models in More Humanlike Ways | Oct 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Contextualized Semantic Distance between Highly Overlapped Texts | Oct 4, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 | 5 |
| Deep Transformers with Latent Depth | Sep 28, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Bidirectional Transformer Reranker for Grammatical Error Correction | May 22, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 0 | 5 |