| A context-aware knowledge transferring strategy for CTC-based ASR | Oct 12, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| GPT-NeoX-20B: An Open-Source Autoregressive Language Model | Apr 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model | Dec 18, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Automated Spinal MRI Labelling from Reports Using a Large Language Model | Oct 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints | May 22, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| LLaVA-SpaceSGG: Visual Instruct Tuning for Open-vocabulary Scene Graph Generation with Enhanced Spatial Relations | Dec 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| GradInit: Learning to Initialize Neural Networks for Stable and Efficient Training | Feb 16, 2021 | Image ClassificationLanguage Modeling | CodeCode Available | 1 | 5 |
| LLatrieval: LLM-Verified Retrieval for Verifiable Generation | Nov 14, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models | Jul 22, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 | 5 |