| Are Intermediate Layers and Labels Really Necessary? A General Language Model Distillation Method | Jun 11, 2023 | Knowledge DistillationLanguage Modeling | CodeCode Available | 1 |
| Effective Attention Sheds Light On Interpretability | May 18, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Asynchronous Local-SGD Training for Language Modeling | Jan 17, 2024 | Distributed OptimizationLanguage Modeling | CodeCode Available | 1 |
| AlephBERT:A Hebrew Large Pre-Trained Language Model to Start-off your Hebrew NLP Application With | Apr 8, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Effect of Pre-Training Scale on Intra- and Inter-Domain Full and Few-Shot Transfer Learning for Natural and Medical X-Ray Chest Images | May 31, 2021 | Few-Shot LearningImage Classification | CodeCode Available | 1 |
| CCC-wav2vec 2.0: Clustering aided Cross Contrastive Self-supervised learning of speech representations | Oct 5, 2022 | Automatic Speech Recognition (ASR)Clustering | CodeCode Available | 1 |
| CoLA: Conditional Dropout and Language-driven Robust Dual-modal Salient Object Detection | Jul 9, 2024 | CoLALanguage Modeling | CodeCode Available | 1 |
| KALA: Knowledge-Augmented Language Model Adaptation | Apr 22, 2022 | Domain AdaptationGeneral Knowledge | CodeCode Available | 1 |
| ECG-Byte: A Tokenizer for End-to-End Generative Electrocardiogram Language Modeling | Dec 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EasyJudge: an Easy-to-use Tool for Comprehensive Response Evaluation of LLMs | Oct 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |