| TinyLlama: An Open-Source Small Language Model | Jan 4, 2024 | Computational EfficiencyLanguage Modeling | CodeCode Available | 11 |
| Embodied CoT Distillation From LLM To Off-the-shelf Agents | Dec 16, 2024 | Decision MakingIn-Context Learning | CodeCode Available | 3 |
| TinyAgent: Function Calling at the Edge | Sep 1, 2024 | Language ModellingQuantization | CodeCode Available | 3 |
| LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs | Aug 24, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model | Jan 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Zero-Shot Vision Encoder Grafting via LLM Surrogates | May 28, 2025 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model | Mar 27, 2025 | EgoSchemaLanguage Modeling | CodeCode Available | 2 |
| CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View Graphs | Nov 21, 2024 | Clone DetectionCode Search | CodeCode Available | 2 |
| PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training | Nov 7, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Small-E: Small Language Model with Linear Attention for Efficient Speech Synthesis | Jun 6, 2024 | DecoderInductive Bias | CodeCode Available | 2 |
| Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation | Jun 4, 2025 | Small Language Modeltext-classification | CodeCode Available | 1 |
| Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM Collaboration | Apr 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 |
| Small Language Model Makes an Effective Long Text Extractor | Feb 11, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| Atla Selene Mini: A General Purpose Evaluation Model | Jan 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model | Jan 19, 2025 | In-Context LearningLanguage Modeling | CodeCode Available | 1 |
| TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network | Nov 4, 2024 | ChunkingLanguage Modelling | CodeCode Available | 1 |
| Bilinear MLPs enable weight-based mechanistic interpretability | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 |
| SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection | Aug 22, 2024 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards | Aug 21, 2024 | ChunkingComputational Efficiency | CodeCode Available | 1 |
| TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings | Jun 21, 2024 | AttributeLanguage Modeling | CodeCode Available | 1 |
| CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models | May 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment | Feb 21, 2024 | Language ModellingQuestion Answering | CodeCode Available | 1 |
| Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models | Jan 24, 2024 | Hateful Meme ClassificationLanguage Modelling | CodeCode Available | 1 |
| PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMs | May 21, 2023 | Data AugmentationGraph Generation | CodeCode Available | 1 |
| Siamese BERT-based Model for Web Search Relevance Ranking Evaluated on a New Czech Dataset | Dec 3, 2021 | Document RankingLanguage Modeling | CodeCode Available | 1 |
| Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training | Jul 16, 2025 | Code GenerationMath | —Unverified | 0 |
| Domain-Adaptive Small Language Models for Structured Tax Code Prediction | Jul 15, 2025 | DecoderSmall Language Model | —Unverified | 0 |
| Towards Privacy-Preserving and Personalized Smart Homes via Tailored Small Language Models | Jul 10, 2025 | Privacy PreservingSmall Language Model | —Unverified | 0 |
| Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content | Jun 25, 2025 | ArticlesContinual Pretraining | —Unverified | 0 |
| Counterfactual Influence as a Distributional Quantity | Jun 25, 2025 | counterfactualimage-classification | —Unverified | 0 |
| Distilling On-device Language Models for Robot Planning with Minimal Human Intervention | Jun 20, 2025 | Small Language Model | —Unverified | 0 |
| Lightweight Relevance Grader in RAG | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance | Jun 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards a Small Language Model Lifecycle Framework | Jun 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction | Jun 6, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| Adaptive Task Vectors for Large Language Models | Jun 3, 2025 | In-Context LearningSmall Language Model | —Unverified | 0 |
| A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction | May 27, 2025 | Domain AdaptationHallucination | —Unverified | 0 |
| Skip-Thinking: Chunk-wise Chain-of-Thought Distillation Enable Smaller Language Models to Reason Better and Faster | May 24, 2025 | Heuristic SearchLanguage Modeling | —Unverified | 0 |
| Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| TinyRS-R1: Compact Multimodal Language Model for Remote Sensing | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MilChat: Introducing Chain of Thought Reasoning and GRPO to a Multimodal Small Language Model for Remote Sensing | May 12, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sadeed: Advancing Arabic Diacritization Through Small Language Model | Apr 30, 2025 | Arabic Text DiacritizationBenchmarking | —Unverified | 0 |
| CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMs | Apr 21, 2025 | Claim VerificationLogical Reasoning | CodeCode Available | 0 |
| Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases | Apr 8, 2025 | Data IntegrationLanguage Modeling | —Unverified | 0 |
| Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph | Apr 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities | Mar 6, 2025 | Audio captioningLanguage Modeling | —Unverified | 0 |