| Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training | Jul 16, 2025 | Code GenerationMath | —Unverified | 0 |
| Domain-Adaptive Small Language Models for Structured Tax Code Prediction | Jul 15, 2025 | DecoderSmall Language Model | —Unverified | 0 |
| Towards Privacy-Preserving and Personalized Smart Homes via Tailored Small Language Models | Jul 10, 2025 | Privacy PreservingSmall Language Model | —Unverified | 0 |
| Counterfactual Influence as a Distributional Quantity | Jun 25, 2025 | counterfactualimage-classification | —Unverified | 0 |
| Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content | Jun 25, 2025 | ArticlesContinual Pretraining | —Unverified | 0 |
| Distilling On-device Language Models for Robot Planning with Minimal Human Intervention | Jun 20, 2025 | Small Language Model | —Unverified | 0 |
| Lightweight Relevance Grader in RAG | Jun 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance | Jun 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards a Small Language Model Lifecycle Framework | Jun 9, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction | Jun 6, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |