| ReaderLM-v2: Small Language Model for HTML to Markdown and JSON | Mar 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bridging the Gap: Enabling Natural Language Queries for NoSQL Databases through Text-to-NoSQL Translation | Feb 16, 2025 | Natural Language QueriesRAG | —Unverified | 0 |
| Towards Achieving Concept Completeness for Textual Concept Bottleneck Models | Feb 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| 3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning | Feb 13, 2025 | Code GenerationScene Understanding | —Unverified | 0 |
| Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model | Feb 4, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| From Superficial Patterns to Semantic Understanding: Fine-Tuning Language Models on Contrast Sets | Jan 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Technical Report: Small Language Model for Japanese Clinical and Medicine | Dec 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Small Language Model as Data Prospector for Large Language Model | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services | Nov 27, 2024 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Is Training Data Quality or Quantity More Impactful to Small Language Model Performance? | Nov 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| RadPhi-3: Small Language Models for Radiology | Nov 19, 2024 | 4kLanguage Modeling | —Unverified | 0 |
| Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SlimLM: An Efficient Small Language Model for On-Device Document Assistance | Nov 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SecEncoder: Logs are All You Need in Security | Nov 12, 2024 | AllLanguage Modelling | —Unverified | 0 |
| Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Wave Network: An Ultra-Small Language Model | Nov 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improving In-Context Learning with Small Language Model Ensembles | Oct 29, 2024 | Domain LabellingIn-Context Learning | CodeCode Available | 0 |
| Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors | Oct 25, 2024 | Reinforcement Learning (RL)Small Language Model | —Unverified | 0 |
| A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs | Oct 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Methods of improving LLM training stability | Oct 22, 2024 | Small Language Model | —Unverified | 0 |
| SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments | Oct 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bridging Large Language Models and Graph Structure Learning Models for Robust Representation Learning | Oct 15, 2024 | Graph Representation LearningGraph structure learning | —Unverified | 0 |
| Generating long-horizon stock "buy" signals with a neural language model | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara | Oct 9, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models | Oct 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages | Oct 4, 2024 | ChatbotCross-Lingual Transfer | —Unverified | 0 |
| Cross-Domain Content Generation with Domain-Specific Small Language Models | Sep 19, 2024 | Small Language Model | —Unverified | 0 |
| Small Language Models are Equation Reasoners | Sep 19, 2024 | Arithmetic ReasoningKnowledge Distillation | —Unverified | 0 |
| Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs | Sep 17, 2024 | Language ModellingSmall Language Model | CodeCode Available | 0 |
| InkubaLM: A small language model for low-resource African languages | Aug 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain | Aug 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary | Jul 28, 2024 | AttributeFairness | CodeCode Available | 0 |
| Exploring Domain Robust Lightweight Reward Models based on Router Mechanism | Jul 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Graph-Structured Speculative Decoding | Jul 23, 2024 | Language ModellingSmall Language Model | —Unverified | 0 |
| RDBE: Reasoning Distillation-Based Evaluation Enhances Automatic Essay Scoring | Jul 3, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| PDSS: A Privacy-Preserving Framework for Step-by-Step Distillation of Large Language Models | Jun 18, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| HARE: HumAn pRiors, a key to small language model Efficiency | Jun 17, 2024 | DiversityLanguage Modeling | —Unverified | 0 |
| Efficient Medical Question Answering with Knowledge-Augmented Question Generation | May 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples | May 8, 2024 | In-Context LearningLanguage Modeling | CodeCode Available | 0 |
| Plug and Play with Prompts: A Prompt Tuning Approach for Controlling Text Generation | Apr 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Pareto Optimal Throughput in Small Language Model Serving | Apr 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding | Feb 26, 2024 | DecoderInstruction Following | —Unverified | 0 |
| Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement | Feb 21, 2024 | Language ModellingSmall Language Model | —Unverified | 0 |
| Purifying Large Language Models by Ensembling a Small Language Model | Feb 19, 2024 | Data PoisoningLanguage Modeling | —Unverified | 0 |
| Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction | Feb 17, 2024 | Few-Shot LearningLanguage Modelling | CodeCode Available | 0 |
| Small Language Model Meets with Reinforced Vision Vocabulary | Jan 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Small Language Model Can Self-correct | Jan 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PerfRL: A Small Language Model Framework for Efficient Code Optimization | Dec 9, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |