| ELMER: A Non-Autoregressive Pre-trained Language Model for Efficient and Effective Text Generation | Oct 24, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Efficient recurrent architectures through activity sparsity and sparse back-propagation through time | Jun 13, 2022 | Gesture RecognitionLanguage Modeling | CodeCode Available | 1 |
| ARS: Automatic Routing Solver with Large Language Models | Feb 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Generalization through Memorization: Nearest Neighbor Language Models | Nov 1, 2019 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 |
| ELECTRAMed: a new pre-trained language representation model for biomedical NLP | Apr 19, 2021 | Drug–drug Interaction ExtractionLanguage Modeling | CodeCode Available | 1 |
| CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking | Feb 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CL-ReLKT: Cross-lingual Language Knowledge Transfer for Multilingual Retrieval Question Answering | Jul 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Citekit: A Modular Toolkit for Large Language Model Citation Generation | Aug 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CloudEval-YAML: A Practical Benchmark for Cloud Configuration Generation | Nov 10, 2023 | BenchmarkingCloud Computing | CodeCode Available | 1 |
| ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators | Mar 23, 2020 | GPULanguage Modeling | CodeCode Available | 1 |