SOTAVerified

Small Language Model

Papers

Showing 150 of 109 papers

TitleStatusHype
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training0
Domain-Adaptive Small Language Models for Structured Tax Code Prediction0
Towards Privacy-Preserving and Personalized Smart Homes via Tailored Small Language Models0
Counterfactual Influence as a Distributional Quantity0
Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content0
Distilling On-device Language Models for Robot Planning with Minimal Human Intervention0
Lightweight Relevance Grader in RAGCode0
HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance0
Towards a Small Language Model Lifecycle Framework0
WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction0
Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data AnnotationCode1
Adaptive Task Vectors for Large Language Models0
Zero-Shot Vision Encoder Grafting via LLM SurrogatesCode2
A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction0
Skip-Thinking: Chunk-wise Chain-of-Thought Distillation Enable Smaller Language Models to Reason Better and Faster0
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language ModelCode0
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission0
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing0
MilChat: Introducing Chain of Thought Reasoning and GRPO to a Multimodal Small Language Model for Remote Sensing0
Sadeed: Advancing Arabic Diacritization Through Small Language Model0
CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMsCode0
Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases0
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM CollaborationCode1
Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph0
Mobile-VideoGPT: Fast and Accurate Video Understanding Language ModelCode2
Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures0
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future DirectionsCode1
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities0
ReaderLM-v2: Small Language Model for HTML to Markdown and JSON0
Towards Achieving Concept Completeness for Textual Concept Bottleneck Models0
Bridging the Gap: Enabling Natural Language Queries for NoSQL Databases through Text-to-NoSQL Translation0
3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning0
Small Language Model Makes an Effective Long Text ExtractorCode1
Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training0
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model0
Atla Selene Mini: A General Purpose Evaluation ModelCode1
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language ModelCode1
From Superficial Patterns to Semantic Understanding: Fine-Tuning Language Models on Contrast Sets0
Technical Report: Small Language Model for Japanese Clinical and Medicine0
Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models0
Embodied CoT Distillation From LLM To Off-the-shelf AgentsCode3
Small Language Model as Data Prospector for Large Language Model0
JPPO: Joint Power and Prompt Optimization for Accelerated Large Language Model Services0
Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?Code0
CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View GraphsCode2
RadPhi-3: Small Language Models for Radiology0
Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM InteractionsCode0
SlimLM: An Efficient Small Language Model for On-Device Document Assistance0
SecEncoder: Logs are All You Need in Security0
Towards Multi-Modal Mastery: A 4.5B Parameter Truly Multi-Modal Small Language Model0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.