| Adaptive KalmanNet: Data-Driven Kalman Filter with Fast Adaptation | Sep 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Explaining Datasets in Words: Statistical Models with Natural Language Parameters | Sep 13, 2024 | ClusteringLanguage Modeling | CodeCode Available | 1 |
| Byte Pair Encoding is Suboptimal for Language Model Pretraining | Apr 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERT got a Date: Introducing Transformers to Temporal Tagging | Sep 30, 2021 | ClassificationDecoder | CodeCode Available | 1 |
| Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification | Aug 4, 2021 | ClassificationFew-Shot Text Classification | CodeCode Available | 1 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 |
| AuGPT: Auxiliary Tasks and Data Augmentation for End-To-End Dialogue with Pre-Trained Language Models | Feb 9, 2021 | DiversityEnd-To-End Dialogue Modelling | CodeCode Available | 1 |
| ByGPT5: End-to-End Style-conditioned Poetry Generation with Token-free Language Models | Dec 20, 2022 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Knowledge Distillation from BERT Transformer to Speech Transformer for Intent Classification | Aug 5, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering | Nov 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ExaRanker: Explanation-Augmented Neural Ranker | Jan 25, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge graph enhanced retrieval-augmented generation for failure mode and effects analysis | Jun 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Knowledge Graphs and Pre-trained Language Models enhanced Representation Learning for Conversational Recommender Systems | Dec 18, 2023 | Knowledge GraphsLanguage Modeling | CodeCode Available | 1 |
| Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive Input Representations for Neural Language Modeling | Sep 28, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERTje: A Dutch BERT Model | Dec 19, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERT-kNN: Adding a kNN Search Component to Pretrained Language Models for Better QA | May 2, 2020 | Information RetrievalLanguage Modeling | CodeCode Available | 1 |
| BERT Loses Patience: Fast and Robust Inference with Early Exit | Jun 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evolving Deep Neural Networks | Mar 1, 2017 | Deep LearningImage Captioning | CodeCode Available | 1 |
| BERT, mBERT, or BiBERT? A Study on Contextualized Embeddings for Neural Machine Translation | Sep 9, 2021 | de-enLanguage Modeling | CodeCode Available | 1 |
| KnowMAN: Weakly Supervised Multinomial Adversarial Networks | Sep 16, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Excuse me, sir? Your language model is leaking (information) | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exploiting BERT For Multimodal Target Sentiment Classification Through Input Space Translation | Aug 3, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation | Oct 14, 2022 | FairnessLanguage Modeling | CodeCode Available | 1 |
| Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning | Apr 8, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| BERTweet: A pre-trained language model for English Tweets | May 20, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Event Causality Identification via Derivative Prompt Joint Learning | Oct 1, 2022 | Event Causality IdentificationLanguage Modeling | CodeCode Available | 1 |
| Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training | Sep 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| L2MAC: Large Language Model Automatic Computer for Extensive Code Generation | Oct 2, 2023 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| L^2M: Mutual Information Scaling Law for Long-Context Language Modeling | Mar 6, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Labrador: Exploring the Limits of Masked Language Modeling for Laboratory Data | Dec 9, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LabTOP: A Unified Model for Lab Test Outcome Prediction on Electronic Health Records | Feb 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LAMBERT: Layout-Aware (Language) Modeling for information extraction | Feb 19, 2020 | Key Information ExtractionLanguage Modeling | CodeCode Available | 1 |
| Can ChatGPT Replace Traditional KBQA Models? An In-depth Analysis of the Question Answering Performance of the GPT LLM Family | Mar 14, 2023 | Knowledge Base Question AnsweringLanguage Modeling | CodeCode Available | 1 |
| Evaluating Retrieval Quality in Retrieval-Augmented Generation | Apr 21, 2024 | GPULanguage Modeling | CodeCode Available | 1 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media | Sep 7, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Models as Synthetic Data Generators | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models | Oct 16, 2021 | counterfactualData Augmentation | CodeCode Available | 1 |
| Language Conditioned Traffic Generation | Jul 16, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Language Generation with Strictly Proper Scoring Rules | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts | Oct 31, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Evaluation Benchmarks for Spanish Sentence Representations | Apr 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Model Decoding as Likelihood-Utility Alignment | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SecureBERT: A Domain-Specific Language Model for Cybersecurity | Apr 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Human-Language Model Interaction | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolution | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |