| Lite Transformer with Long-Short Range Attention | Apr 24, 2020 | Abstractive Text SummarizationAutoML | CodeCode Available | 1 |
| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Adaptive Attention Span in Transformers | May 19, 2019 | 8kLanguage Modeling | CodeCode Available | 1 |
| BioBART: Pretraining and Evaluation of A Biomedical Generative Language Model | Apr 8, 2022 | Entity LinkingLanguage Modeling | CodeCode Available | 1 |
| Evaluating Language Model Context Windows: A "Working Memory" Test and Inference-time Correction | Jul 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Language Models as Synthetic Data Generators | Dec 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |
| Bioformer: an efficient transformer language model for biomedical text mining | Feb 3, 2023 | ArticlesDocument Classification | CodeCode Available | 1 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 |
| Evaluating Human-Language Model Interaction | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators | Mar 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Euphemistic Phrase Detection by Masked Language Model | Sep 10, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Exchange-of-Thought: Enhancing Large Language Model Capabilities through Cross-Model Communication | Dec 4, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Espresso: A Fast End-to-end Neural Speech Recognition Toolkit | Sep 18, 2019 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| ESCOXLM-R: Multilingual Taxonomy-driven Pre-training for the Job Market Domain | May 20, 2023 | De-identificationLanguage Modeling | CodeCode Available | 1 |
| ESRL: Efficient Sampling-based Reinforcement Learning for Sequence Generation | Aug 4, 2023 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 1 |
| AttributionBench: How Hard is Automatic Attribution Evaluation? | Feb 23, 2024 | Binary ClassificationLanguage Modeling | CodeCode Available | 1 |
| Adaptive Attention Span in Computer Vision | Apr 18, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EscapeBench: Pushing Language Models to Think Outside the Box | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Biomedical Event Extraction with Hierarchical Knowledge Graphs | Sep 20, 2020 | Event ExtractionLanguage Modeling | CodeCode Available | 1 |
| Establishing baselines for generative discovery of inorganic crystals | Jan 4, 2025 | Band GapLanguage Modeling | CodeCode Available | 1 |
| Attribution Analysis Meets Model Editing: Advancing Knowledge Correction in Vision Language Models with VisEdit | Aug 19, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 |
| RARR: Researching and Revising What Language Models Say, Using Language Models | Oct 17, 2022 | Few-Shot LearningLanguage Modeling | CodeCode Available | 1 |
| Measuring Progress in Dictionary Learning for Language Model Interpretability with Board Game Models | Jul 31, 2024 | Dictionary LearningLanguage Modeling | CodeCode Available | 1 |