SOTAVerified

Small Language Model

Papers

Showing 2650 of 109 papers

TitleStatusHype
PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMsCode1
Siamese BERT-based Model for Web Search Relevance Ranking Evaluated on a New Czech DatasetCode1
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training0
Domain-Adaptive Small Language Models for Structured Tax Code Prediction0
Towards Privacy-Preserving and Personalized Smart Homes via Tailored Small Language Models0
Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content0
Counterfactual Influence as a Distributional Quantity0
Distilling On-device Language Models for Robot Planning with Minimal Human Intervention0
Lightweight Relevance Grader in RAGCode0
HypER: Literature-grounded Hypothesis Generation and Distillation with Provenance0
Towards a Small Language Model Lifecycle Framework0
WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction0
Adaptive Task Vectors for Large Language Models0
A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction0
Skip-Thinking: Chunk-wise Chain-of-Thought Distillation Enable Smaller Language Models to Reason Better and Faster0
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language ModelCode0
Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission0
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing0
MilChat: Introducing Chain of Thought Reasoning and GRPO to a Multimodal Small Language Model for Remote Sensing0
Sadeed: Advancing Arabic Diacritization Through Small Language Model0
CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMsCode0
Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases0
Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph0
Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures0
Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities0
Show:102550
← PrevPage 2 of 5Next →

No leaderboard results yet.