| PromptCL: Improving Event Representation via Prompt Template and Contrastive Learning | Apr 27, 2024 | Contrastive LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| DS-TOD: Efficient Domain Specialization for Task Oriented Dialog | Oct 15, 2021 | dialog state trackingLanguage Modeling | CodeCode Available | 0 | 5 |
| Probing BERT's priors with serial reproduction chains | Feb 24, 2022 | Language ModellingMasked Language Modeling | CodeCode Available | 0 | 5 |
| Punctuation Restoration Improves Structure Understanding Without Supervision | Feb 13, 2024 | ChunkingLanguage Modeling | CodeCode Available | 0 | 5 |
| SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics | Oct 2, 2024 | ClassificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Pre-Training of Deep Bidirectional Protein Sequence Representations with Structural Information | Nov 25, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining? | Mar 24, 2022 | Argument MiningLanguage Modeling | CodeCode Available | 0 | 5 |
| Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data | Mar 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training | Oct 14, 2022 | HallucinationImage Augmentation | CodeCode Available | 0 | 5 |
| Adapting Multilingual LLMs to Low-Resource Languages with Knowledge Graphs via Adapters | Jul 1, 2024 | Knowledge GraphsLanguage Modeling | CodeCode Available | 0 | 5 |
| Boosting Point-BERT by Multi-choice Tokens | Jul 27, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Distributionally robust self-supervised learning for tabular data | Oct 11, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Distilling Knowledge Learned in BERT for Text Generation | Nov 10, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Boosting Prompt-Based Self-Training With Mapping-Free Automatic Verbalizer for Multi-Class Classification | Dec 8, 2023 | ClassificationFew-Shot Text Classification | CodeCode Available | 0 | 5 |
| A character-based steganography using masked language modeling | Jan 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Biomedical Language Models are Robust to Sub-optimal Tokenization | Jun 30, 2023 | Entity LinkingLanguage Modeling | CodeCode Available | 0 | 5 |
| PEACH: Pre-Training Sequence-to-Sequence Multilingual Models for Translation with Semi-Supervised Pseudo-Parallel Document Generation | Apr 3, 2023 | DenoisingLanguage Modeling | CodeCode Available | 0 | 5 |
| DiFair: A Benchmark for Disentangled Assessment of Gender Knowledge and Bias | Oct 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| NormFormer: Improved Transformer Pretraining with Extra Normalization | Oct 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| On the Cross-lingual Transferability of Monolingual Representations | Oct 25, 2019 | Cross-Lingual Question AnsweringLanguage Modeling | CodeCode Available | 0 | 5 |
| Dict-BERT: Enhancing Language Model Pre-training with Dictionary | Oct 13, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks | Aug 22, 2022 | AllCross-Modal Retrieval | CodeCode Available | 0 | 5 |
| IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach | Sep 8, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 0 | 5 |
| Adapting Learned Sparse Retrieval for Long Documents | May 29, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Personalized Image Enhancement Featuring Masked Style Modeling | Jun 15, 2023 | Image EnhancementLanguage Modeling | CodeCode Available | 0 | 5 |
| Pre-training with Aspect-Content Text Mutual Prediction for Multi-Aspect Dense Retrieval | Aug 22, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| DIBERT: Dependency Injected Bidirectional Encoder Representations from Transformers | Dec 5, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| How transformers learn structured data: insights from hierarchical filtering | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| How does the task complexity of masked pretraining objectives affect downstream performance? | May 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Contextualized Semantic Distance between Highly Overlapped Texts | Oct 4, 2021 | Domain AdaptationLanguage Modeling | CodeCode Available | 0 | 5 |
| Mistral-SPLADE: LLMs for better Learned Sparse Retrieval | Aug 20, 2024 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| Honey, I Shrunk the Language: Language Model Behavior at Reduced Scale | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Historical Ink: Semantic Shift Detection for 19th Century Spanish | Jul 8, 2024 | Masked Language ModelingSemantic Shift Detection | CodeCode Available | 0 | 5 |
| Deep Transformers with Latent Depth | Sep 28, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Bidirectional Transformer Reranker for Grammatical Error Correction | May 22, 2023 | DecoderGrammatical Error Correction | CodeCode Available | 0 | 5 |
| HanTrans: An Empirical Study on Cross-Era Transferability of Chinese Pre-trained Language Model | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality | Feb 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| MeLT: Message-Level Transformer with Masked Document Representations as Pre-Training for Stance Detection | Sep 16, 2021 | AttributeLanguage Modeling | CodeCode Available | 0 | 5 |
| I-BERT: Inductive Generalization of Transformer to Arbitrary Context Lengths | Jun 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| GraphCodeBERT: Pre-training Code Representations with Data Flow | Sep 17, 2020 | Clone DetectionCode Completion | CodeCode Available | 0 | 5 |
| Masked Latent Semantic Modeling: an Efficient Pre-training Alternative to Masked Language Modeling | Jul 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers | Jun 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Masked Language Models are Good Heterogeneous Graph Generalizers | Jun 6, 2025 | Graph LearningLanguage Modeling | CodeCode Available | 0 | 5 |
| Multilinguals at SemEval-2022 Task 11: Complex NER in Semantically Ambiguous Settings for Low Resource Languages | Jul 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Mask-Enhanced Autoregressive Prediction: Pay Less Attention to Learn More | Feb 11, 2025 | DecoderInformation Retrieval | CodeCode Available | 0 | 5 |
| GMAT: Global Memory Augmentation for Transformers | Jun 5, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| An Investigation of Noise in Morphological Inflection | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Data Augmentation for Biomedical Factoid Question Answering | Apr 10, 2022 | Data AugmentationInformation Retrieval | CodeCode Available | 0 | 5 |
| Masked and Permuted Implicit Context Learning for Scene Text Recognition | May 25, 2023 | DecoderLanguage Modeling | CodeCode Available | 0 | 5 |
| BERTnesia: Investigating the capture and forgetting of knowledge in BERT | Jun 5, 2021 | Knowledge Base CompletionLanguage Modeling | CodeCode Available | 0 | 5 |