| Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training | Jul 16, 2025 | Code GenerationMath | —Unverified | 0 |
| Domain-Adaptive Small Language Models for Structured Tax Code Prediction | Jul 15, 2025 | DecoderSmall Language Model | —Unverified | 0 |
| Towards Privacy-Preserving and Personalized Smart Homes via Tailored Small Language Models | Jul 10, 2025 | Privacy PreservingSmall Language Model | —Unverified | 0 |
| Counterfactual Influence as a Distributional Quantity | Jun 25, 2025 | counterfactualimage-classification | —Unverified | 0 |
| Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content | Jun 25, 2025 | ArticlesContinual Pretraining | —Unverified | 0 |
| Distilling On-device Language Models for Robot Planning with Minimal Human Intervention | Jun 20, 2025 | Small Language Model | —Unverified | 0 |
| Lightweight Relevance Grader in RAG | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance | Jun 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards a Small Language Model Lifecycle Framework | Jun 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction | Jun 6, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation | Jun 4, 2025 | Small Language Modeltext-classification | CodeCode Available | 1 |
| Adaptive Task Vectors for Large Language Models | Jun 3, 2025 | In-Context LearningSmall Language Model | —Unverified | 0 |
| Zero-Shot Vision Encoder Grafting via LLM Surrogates | May 28, 2025 | DecoderLanguage Modeling | CodeCode Available | 2 |
| A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction | May 27, 2025 | Domain AdaptationHallucination | —Unverified | 0 |
| Skip-Thinking: Chunk-wise Chain-of-Thought Distillation Enable Smaller Language Models to Reason Better and Faster | May 24, 2025 | Heuristic SearchLanguage Modeling | —Unverified | 0 |
| Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TinyRS-R1: Compact Multimodal Language Model for Remote Sensing | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MilChat: Introducing Chain of Thought Reasoning and GRPO to a Multimodal Small Language Model for Remote Sensing | May 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sadeed: Advancing Arabic Diacritization Through Small Language Model | Apr 30, 2025 | Arabic Text DiacritizationBenchmarking | —Unverified | 0 |
| CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs | Apr 21, 2025 | Claim VerificationLogical Reasoning | CodeCode Available | 0 |
| Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases | Apr 8, 2025 | Data IntegrationLanguage Modeling | —Unverified | 0 |
| Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration | Apr 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph | Apr 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model | Mar 27, 2025 | EgoSchemaLanguage Modeling | CodeCode Available | 2 |
| Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 |
| Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities | Mar 6, 2025 | Audio captioningLanguage Modeling | —Unverified | 0 |
| ReaderLM-v2: Small Language Model for HTML to Markdown and JSON | Mar 3, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Achieving Concept Completeness for Textual Concept Bottleneck Models | Feb 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bridging the Gap: Enabling Natural Language Queries for NoSQL Databases through Text-to-NoSQL Translation | Feb 16, 2025 | Natural Language QueriesRAG | —Unverified | 0 |
| 3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning | Feb 13, 2025 | Code GenerationScene Understanding | —Unverified | 0 |
| Small Language Model Makes an Effective Long Text Extractor | Feb 11, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training | Feb 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model | Feb 4, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model | Jan 19, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| From Superficial Patterns to Semantic Understanding: Fine-Tuning Language Models on Contrast Sets | Jan 5, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Technical Report: Small Language Model for Japanese Clinical and Medicine | Dec 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Embodied CoT Distillation From LLM To Off-the-shelf Agents | Dec 16, 2024 | Decision MakingIn-Context Learning | CodeCode Available | 3 |
| Small Language Model as Data Prospector for Large Language Model | Dec 13, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services | Nov 27, 2024 | Deep Reinforcement LearningLanguage Modeling | —Unverified | 0 |
| Is Training Data Quality or Quantity More Impactful to Small Language Model Performance? | Nov 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View Graphs | Nov 21, 2024 | Clone DetectionCode Search | CodeCode Available | 2 |
| RadPhi-3: Small Language Models for Radiology | Nov 19, 2024 | 4kLanguage Modeling | —Unverified | 0 |
| Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM Interactions | Nov 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SlimLM: An Efficient Small Language Model for On-Device Document Assistance | Nov 15, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SecEncoder: Logs are All You Need in Security | Nov 12, 2024 | AllLanguage Modelling | —Unverified | 0 |
| Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model | Nov 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |