| CPT: Efficient Deep Neural Network Training via Cyclic Precision | Jan 25, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CREAM: Consistency Regularized Self-Rewarding Language Models | Oct 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Improved training of end-to-end attention models for speech recognition | May 8, 2018 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| "Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation | Sep 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| AudioBERT: Audio Knowledge Augmented Language Model | Sep 12, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Adaptive Attention Span in Transformers | May 19, 2019 | 8kLanguage Modeling | CodeCode Available | 1 | 5 |
| CPLLM: Clinical Prediction with Large Language Models | Sep 20, 2023 | Disease PredictionLanguage Modeling | CodeCode Available | 1 | 5 |
| Improving antibody language models with native pairing | Aug 28, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Counterfactual Token Generation in Large Language Models | Sep 25, 2024 | Bias Detectioncounterfactual | CodeCode Available | 1 | 5 |
| Align-KD: Distilling Cross-Modal Alignment Knowledge for Mobile Vision-Language Model | Dec 2, 2024 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 1 | 5 |