| Estimating the Carbon Footprint of BLOOM, a 176B Parameter Language Model | Nov 3, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Coherence boosting: When your pretrained language model is not paying enough attention | Oct 15, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Annotation-Efficient Preference Optimization for Language Model Alignment | May 22, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences | Jan 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control | Jul 12, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension | Feb 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| FANformer: Improving Large Language Models Through Effective Periodicity Modeling | Feb 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |