| JavaBERT: Training a transformer-based model for the Java programming language | Oct 20, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Dict-BERT: Enhancing Language Model Pre-training with Dictionary | Oct 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification | Dec 8, 2023 | ClassificationFew-Shot Text Classification | CodeCode Available | 0 |
| Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity | Sep 5, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 |
| Bidirectional Transformer Reranker for Grammatical Error Correction | May 22, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 0 |
| DIBERT: Dependency Injected Bidirectional Encoder Representations from Transformers | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| On the Cross-lingual Transferability of Monolingual Representations | Oct 25, 2019 | Cross-Lingual Question AnsweringLanguage Modeling | CodeCode Available | 0 |
| Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks | Aug 22, 2022 | AllCross-Modal Retrieval | CodeCode Available | 0 |
| IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach | Sep 8, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 0 |
| Transformer based neural networks for emotion recognition in conversations | May 18, 2024 | Causal Language ModelingEmotion Classification | CodeCode Available | 0 |
| BERTnesia: Investigating the capture and forgetting of knowledge in BERT | Jun 5, 2021 | Knowledge Base CompletionLanguage Modeling | CodeCode Available | 0 |
| StrucTexTv2: Masked Visual-Textual Prediction for Document Image Pre-training | Mar 1, 2023 | Document Image Classificationimage-classification | CodeCode Available | 0 |
| I-BERT: Inductive Generalization of Transformer to Arbitrary Context Lengths | Jun 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Deep Transformers with Latent Depth | Sep 28, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Structural Self-Supervised Objectives for Transformers | Sep 15, 2023 | Fact VerificationLanguage Modeling | CodeCode Available | 0 |
| PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation | Apr 3, 2023 | DenoisingLanguage Modeling | CodeCode Available | 0 |
| How transformers learn structured data: insights from hierarchical filtering | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| BERT-Defense: A Probabilistic Model Based on BERT to Combat Cognitively Inspired Orthographic Adversarial Attacks | Jun 2, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Personalized Image Enhancement Featuring Masked Style Modeling | Jun 15, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 0 |
| AxomiyaBERTa: A Phonologically-aware Transformer Model for Assamese | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Substrate Prediction for RiPP Biosynthetic Enzymes via Masked Language Modeling and Transfer Learning | Feb 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training | Oct 14, 2022 | HallucinationImage Augmentation | CodeCode Available | 0 |
| Boosting Point-BERT by Multi-choice Tokens | Jul 27, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 |
| How does the task complexity of masked pretraining objectives affect downstream performance? | May 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Data Augmentation for Biomedical Factoid Question Answering | Apr 10, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 0 |
| Counterfactually Probing Language Identity in Multilingual Models | Oct 29, 2023 | counterfactualLanguage Modeling | CodeCode Available | 0 |
| Symbolic Discovery of Optimization Algorithms | Feb 13, 2023 | Contrastive Learningimage-classification | CodeCode Available | 0 |
| Contextualized Semantic Distance between Highly Overlapped Texts | Oct 4, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Historical Ink: Semantic Shift Detection for 19th Century Spanish | Jul 8, 2024 | Masked Language ModelingSemantic Shift Detection | CodeCode Available | 0 |
| Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling | Apr 4, 2019 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 |
| Unsupervised Improvement of Factual Knowledge in Language Models | Apr 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| An Investigation of Noise in Morphological Inflection | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Controlling the Imprint of Passivization and Negation in Contextualized Representations | Nov 1, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pre-Training of Deep Bidirectional Protein Sequence Representations with Structural Information | Nov 25, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HanTrans: An Empirical Study on Cross-Era Transferability of Chinese Pre-trained Language Model | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Dial-MAE: ConTextual Masked Auto-Encoder for Retrieval-based Dialogue Systems | Jun 7, 2023 | Conversational Response SelectionDecoder | CodeCode Available | 0 |
| GMAT: Global Memory Augmentation for Transformers | Jun 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval | Aug 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Probing BERT's priors with serial reproduction chains | Feb 24, 2022 | Language ModellingMasked Language Modeling | CodeCode Available | 0 |
| Unsupervised Representation Learning of Player Behavioral Data with Confidence Guided Masking | Apr 25, 2022 | Feature EngineeringLanguage Modeling | CodeCode Available | 0 |
| Task-Informed Anti-Curriculum by Masking Improves Downstream Performance on Text | Feb 18, 2025 | Authorship AttributionLanguage Modeling | CodeCode Available | 0 |
| MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training | Jul 28, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning | Apr 27, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 |
| A character-based steganography using masked language modeling | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction Tuning | Mar 14, 2025 | Code GenerationDecoder | CodeCode Available | 0 |
| Text Revision by On-the-Fly Representation Optimization | Apr 15, 2022 | AttributeLanguage Modeling | CodeCode Available | 0 |
| An Empirical Study Of Self-supervised Learning Approaches For Object Detection With Transformers | May 11, 2022 | image-classificationImage Classification | CodeCode Available | 0 |
| AllenNLP Interpret: A Framework for Explaining Predictions of NLP Models | Sep 19, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 |