SOTAVerified

Small Language Model

Papers

Showing 150 of 109 papers

TitleStatusHype
TinyLlama: An Open-Source Small Language ModelCode11
TinyAgent: Function Calling at the EdgeCode3
Embodied CoT Distillation From LLM To Off-the-shelf AgentsCode3
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language ModelCode3
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMsCode3
CodeSAM: Source Code Representation Learning by Infusing Self-Attention with Multi-Code-View GraphsCode2
Zero-Shot Vision Encoder Grafting via LLM SurrogatesCode2
Mobile-VideoGPT: Fast and Accurate Video Understanding Language ModelCode2
PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-trainingCode2
Small-E: Small Language Model with Linear Attention for Efficient Speech SynthesisCode2
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for NetworkCode1
Bilinear MLPs enable weight-based mechanistic interpretabilityCode1
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language ModelCode1
Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future DirectionsCode1
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination DetectionCode1
Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language ModelsCode1
Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP StandardsCode1
PiVe: Prompting with Iterative Verification Improving Graph-based Generative Capability of LLMsCode1
Collab-RAG: Boosting Retrieval-Augmented Generation for Complex Question Answering via White-Box and Black-Box LLM CollaborationCode1
Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge AlignmentCode1
Small Language Model Makes an Effective Long Text ExtractorCode1
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal ModelsCode1
Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data AnnotationCode1
Atla Selene Mini: A General Purpose Evaluation ModelCode1
Siamese BERT-based Model for Web Search Relevance Ranking Evaluated on a New Czech DatasetCode1
AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language ModelCode1
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship EmbeddingsCode1
Is Training Data Quality or Quantity More Impactful to Small Language Model Performance?Code0
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMsCode0
XAMPLER: Learning to Retrieve Cross-Lingual In-Context ExamplesCode0
CRAVE: A Conflicting Reasoning Approach for Explainable Claim Verification Using LLMsCode0
Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language ModelCode0
VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative AdversaryCode0
The Birth of Bias: A case study on the evolution of gender bias in an English language modelCode0
Improving In-Context Learning with Small Language Model EnsemblesCode0
SOUL: Towards Sentiment and Opinion Understanding of LanguageCode0
Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation ExtractionCode0
Preempting Text Sanitization Utility in Resource-Constrained Privacy-Preserving LLM InteractionsCode0
Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change DomainCode0
Lightweight Relevance Grader in RAGCode0
Efficient Medical Question Answering with Knowledge-Augmented Question GenerationCode0
Domain-Adaptive Small Language Models for Structured Tax Code Prediction0
Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph0
Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures0
Distilling On-device Language Models for Robot Planning with Minimal Human Intervention0
Biomed-Enriched: A Biomedical Dataset Enriched with LLMs for Pretraining and Extracting Rare and Hidden Content0
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs0
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models0
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages0
PerfRL: A Small Language Model Framework for Efficient Code Optimization0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.