| CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference | Jun 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| cosFormer: Rethinking Softmax in Attention | Feb 17, 2022 | D4RLLanguage Modeling | CodeCode Available | 1 | 5 |
| Housekeep: Tidying Virtual Households using Commonsense Reasoning | May 22, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Automated Spinal MRI Labelling from Reports Using a Large Language Model | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DeepStruct: Pretraining of Language Models for Structure Prediction | May 21, 2022 | coreference-resolutionCoreference Resolution | CodeCode Available | 1 | 5 |
| Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling | Oct 22, 2022 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 | 5 |
| HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation | Mar 22, 2022 | Decision MakingLanguage Modeling | CodeCode Available | 1 | 5 |
| In-context Pretraining: Language Modeling Beyond Document Boundaries | Oct 16, 2023 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| How does GPT-2 compute greater-than?: Interpreting mathematical abilities in a pre-trained language model | Apr 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DeLighT: Deep and Light-weight Transformer | Aug 3, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Analysing The Impact of Sequence Composition on Language Model Pre-Training | Feb 21, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| AVocaDo: Strategy for Adapting Vocabulary to Downstream Domain | Oct 26, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| History Matters: Temporal Knowledge Editing in Large Language Model | Dec 9, 2023 | knowledge editingLanguage Modeling | CodeCode Available | 1 | 5 |
| LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction | Dec 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 | 5 |
| Avoiding Inference Heuristics in Few-shot Prompt-based Finetuning | Sep 9, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| hmBERT: Historical Multilingual Language Models for Named Entity Recognition | May 31, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| EvoMoE: An Evolutional Mixture-of-Experts Training Framework via Dense-To-Sparse Gate | Dec 29, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Describe Anything Model for Visual Question Answering on Text-rich Images | Jul 16, 2025 | DescriptiveLanguage Modeling | CodeCode Available | 1 | 5 |
| HoneyBee: Progressive Instruction Finetuning of Large Language Models for Materials Science | Oct 12, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| DesCo: Learning Object Recognition with Rich Language Descriptions | Jun 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dependency-based Mixture Language Models | Mar 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dependency Transformer Grammars: Integrating Dependency Structures into Transformer Language Models | Jul 24, 2024 | ARCInductive Bias | CodeCode Available | 1 | 5 |
| How does the pre-training objective affect what large language models learn about linguistic properties? | Mar 20, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |