| PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Wave Network: An Ultra-Small Language Model | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network | Nov 4, 2024 | ChunkingLanguage Modelling | CodeCode Available | 1 |
| Improving In-Context Learning with Small Language Model Ensembles | Oct 29, 2024 | Domain LabellingIn-Context Learning | CodeCode Available | 0 |
| Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors | Oct 25, 2024 | Reinforcement Learning (RL)Small Language Model | —Unverified | 0 |
| A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Methods of improving LLM training stability | Oct 22, 2024 | Small Language Model | —Unverified | 0 |
| Bridging Large Language Models and Graph Structure Learning Models for Robust Representation Learning | Oct 15, 2024 | Graph Representation LearningGraph structure learning | —Unverified | 0 |
| SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bilinear MLPs enable weight-based mechanistic interpretability | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Generating long-horizon stock "buy" signals with a neural language model | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages | Oct 4, 2024 | ChatbotCross-Lingual Transfer | —Unverified | 0 |
| Cross-Domain Content Generation with Domain-Specific Small Language Models | Sep 19, 2024 | Small Language Model | —Unverified | 0 |
| Small Language Models are Equation Reasoners | Sep 19, 2024 | Arithmetic ReasoningKnowledge Distillation | —Unverified | 0 |
| Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs | Sep 17, 2024 | Language ModellingSmall Language Model | CodeCode Available | 0 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 |
| TinyAgent: Function Calling at the Edge | Sep 1, 2024 | Language ModellingQuantization | CodeCode Available | 3 |
| Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain | Aug 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| InkubaLM: A small language model for low-resource African languages | Aug 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs | Aug 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection | Aug 22, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards | Aug 21, 2024 | ChunkingComputational Efficiency | CodeCode Available | 1 |
| VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary | Jul 28, 2024 | AttributeFairness | CodeCode Available | 0 |