| Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation | Sep 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 | 5 |
| Improved training of end-to-end attention models for speech recognition | May 8, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AuditWen:An Open-Source Large Language Model for Audit | Oct 9, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 | 5 |
| CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation | Sep 13, 2021 | DecoderDenoising | CodeCode Available | 1 | 5 |
| BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model | Apr 8, 2022 | Entity LinkingLanguage Modeling | CodeCode Available | 1 | 5 |
| Evaluating Human-Language Model Interaction | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CPM: A Large-scale Generative Chinese Pre-trained Language Model | Dec 1, 2020 | Cloze TestLanguage Modeling | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Bioformer: an efficient transformer language model for biomedical text mining | Feb 3, 2023 | ArticlesDocument Classification | CodeCode Available | 1 | 5 |
| Crafting Large Language Models for Enhanced Interpretability | Jul 5, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Protein Structure Tokenization: Benchmarking and New Recipe | Feb 28, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 1 | 5 |
| Coupling Large Language Models with Logic Programming for Robust and General Reasoning from Text | Jul 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improving antibody language models with native pairing | Aug 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Imputing Out-of-Vocabulary Embeddings with LOVE Makes LanguageModels Robust with Little Cost | May 1, 2022 | Contrastive LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| Counterfactual Data Augmentation for Neural Machine Translation | Jun 1, 2021 | counterfactualData Augmentation | CodeCode Available | 1 | 5 |
| Evaluating Language Model Finetuning Techniques for Low-resource Languages | Jun 30, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 | 5 |
| LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention | Oct 2, 2020 | Common Sense ReasoningEntity Typing | CodeCode Available | 1 | 5 |
| Implicit Language Models are RNNs: Balancing Parallelization and Expressivity | Feb 10, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LXMERT: Learning Cross-Modality Encoder Representations from Transformers | Aug 20, 2019 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Biomedical Event Extraction with Hierarchical Knowledge Graphs | Sep 20, 2020 | Event ExtractionLanguage Modeling | CodeCode Available | 1 | 5 |
| Implicit Unlikelihood Training: Improving Neural Text Generation with Reinforcement Learning | Jan 11, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Evaluating Morphological Alignment of Tokenizers in 70 Languages | Jul 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Supervised Learning and Large Language Model Benchmarks on Mental Health Datasets: Cognitive Distortions and Suicidal Risks in Chinese Social Media | Sep 7, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |