| MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration | Nov 14, 2023 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Models as Synthetic Data Generators | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Byte Pair Encoding is Suboptimal for Language Model Pretraining | Apr 7, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Character-Aware Neural Language Models | Aug 26, 2015 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generating Query Focused Summaries from Query-Free Resources | Dec 29, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CAB: Comprehensive Attention Benchmarking on Long Sequence Modeling | Oct 14, 2022 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Extracting Definienda in Mathematical Scholarly Articles with Transformers | Nov 21, 2023 | ArticlesLanguage Modeling | CodeCode Available | 1 |
| Fine-tuning a Large Language Model for Automating Computational Fluid Dynamics Simulations | Apr 13, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aligning Large Language Models through Synthetic Feedback | May 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Mapping Memes to Words for Multimodal Hateful Meme Classification | Oct 12, 2023 | Hateful Meme ClassificationLanguage Modeling | CodeCode Available | 1 |
| Euphemistic Phrase Detection by Masked Language Model | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation | Aug 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| Estimating Contamination via Perplexity: Quantifying Memorisation in Language Model Evaluation | Sep 19, 2023 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Attention-based Contextual Language Model Adaptation for Speech Recognition | Jun 2, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| MATE-KD: Masked Adversarial TExt, a Companion to Knowledge Distillation | May 12, 2021 | Adversarial TextData Augmentation | CodeCode Available | 1 |
| Materials Informatics Transformer: A Language Model for Interpretable Materials Properties Prediction | Aug 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aligning Knowledge Concepts to Whole Slide Images for Precise Histopathology Image Analysis | Nov 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| BOND: BERT-Assisted Open-Domain Named Entity Recognition with Distant Supervision | Jun 28, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MBA-RAG: a Bandit Approach for Adaptive Retrieval-Augmented Generation through Question Complexity | Dec 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EscapeBench: Pushing Language Models to Think Outside the Box | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |