| Multilingual Normalization of Temporal Expressions with Masked Language Models | May 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Contextualized Semantic Distance between Highly Overlapped Texts | Oct 4, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 | 5 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Multilinguals at SemEval-2022 Task 11: Complex NER in Semantically Ambiguous Settings for Low Resource Languages | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Historical Ink: Semantic Shift Detection for 19th Century Spanish | Jul 8, 2024 | Masked Language ModelingSemantic Shift Detection | CodeCode Available | 0 | 5 |
| Deep Transformers with Latent Depth | Sep 28, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Bidirectional Transformer Reranker for Grammatical Error Correction | May 22, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 0 | 5 |
| HanTrans: An Empirical Study on Cross-Era Transferability of Chinese Pre-trained Language Model | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 | 5 |
| Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| How transformers learn structured data: insights from hierarchical filtering | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| I-BERT: Inductive Generalization of Transformer to Arbitrary Context Lengths | Jun 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| GMAT: Global Memory Augmentation for Transformers | Jun 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| An Investigation of Noise in Morphological Inflection | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Data Augmentation for Biomedical Factoid Question Answering | Apr 10, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 0 | 5 |
| Masked Language Models are Good Heterogeneous Graph Generalizers | Jun 6, 2025 | Graph LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| On the Cross-lingual Transferability of Monolingual Representations | Oct 25, 2019 | Cross-Lingual Question AnsweringLanguage Modeling | CodeCode Available | 0 | 5 |
| BERTnesia: Investigating the capture and forgetting of knowledge in BERT | Jun 5, 2021 | Knowledge Base CompletionLanguage Modeling | CodeCode Available | 0 | 5 |
| An Empirical Study Of Self-supervised Learning Approaches For Object Detection With Transformers | May 11, 2022 | image-classificationImage Classification | CodeCode Available | 0 | 5 |
| DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Geographic Adaptation of Pretrained Language Models | Mar 16, 2022 | Language IdentificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Generating Synthetic Free-text Medical Records with Low Re-identification Risk using Masked Language Modeling | Sep 15, 2024 | Causal Language ModelingDe-identification | CodeCode Available | 0 | 5 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |