| PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Wave Network: An Ultra-Small Language Model | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network | Nov 4, 2024 | ChunkingLanguage Modelling | CodeCode Available | 1 |
| Improving In-Context Learning with Small Language Model Ensembles | Oct 29, 2024 | Domain LabellingIn-Context Learning | CodeCode Available | 0 |
| Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors | Oct 25, 2024 | Reinforcement Learning (RL)Small Language Model | —Unverified | 0 |
| A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Methods of improving LLM training stability | Oct 22, 2024 | Small Language Model | —Unverified | 0 |
| Bridging Large Language Models and Graph Structure Learning Models for Robust Representation Learning | Oct 15, 2024 | Graph Representation LearningGraph structure learning | —Unverified | 0 |
| SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bilinear MLPs enable weight-based mechanistic interpretability | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 1 |