Small Language Model

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 109 papers

Title	Date	Tasks	Status	Hype
PhoneLM:an Efficient and Capable Small Language Model Family through Principled Pre-training	Nov 7, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Wave Network: An Ultra-Small Language Model	Nov 4, 2024	Language ModelingLanguage Modelling	—Unverified	0
TeleOracle: Fine-Tuned Retrieval-Augmented Generation with Long-Context Support for Network	Nov 4, 2024	ChunkingLanguage Modelling	CodeCode Available	1
Improving In-Context Learning with Small Language Model Ensembles	Oct 29, 2024	Domain LabellingIn-Context Learning	CodeCode Available	0
Humanizing the Machine: Proxy Attacks to Mislead LLM Detectors	Oct 25, 2024	Reinforcement Learning (RL)Small Language Model	—Unverified	0
A Little Help Goes a Long Way: Efficient LLM Training by Leveraging Small LMs	Oct 24, 2024	Language ModelingLanguage Modelling	—Unverified	0
Methods of improving LLM training stability	Oct 22, 2024	Small Language Model	—Unverified	0
Bridging Large Language Models and Graph Structure Learning Models for Robust Representation Learning	Oct 15, 2024	Graph Representation LearningGraph structure learning	—Unverified	0
SHAKTI: A 2.5 Billion Parameter Small Language Model Optimized for Edge AI and Low-Resource Environments	Oct 15, 2024	Language ModelingLanguage Modelling	—Unverified	0
Bilinear MLPs enable weight-based mechanistic interpretability	Oct 10, 2024	image-classificationImage Classification	CodeCode Available	1
Generating long-horizon stock "buy" signals with a neural language model	Oct 9, 2024	Language ModelingLanguage Modelling	—Unverified	0
Personal Intelligence System UniLM: Hybrid On-Device Small Language Model and Server-Based Large Language Model for Malay Nusantara	Oct 9, 2024	Language ModelingLanguage Modelling	—Unverified	0
DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models	Oct 8, 2024	Language ModelingLanguage Modelling	—Unverified	0
Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages	Oct 4, 2024	ChatbotCross-Lingual Transfer	—Unverified	0
Cross-Domain Content Generation with Domain-Specific Small Language Models	Sep 19, 2024	Small Language Model	—Unverified	0
Small Language Models are Equation Reasoners	Sep 19, 2024	Arithmetic ReasoningKnowledge Distillation	—Unverified	0
Small Language Models can Outperform Humans in Short Creative Writing: A Study Comparing SLMs with Humans and LLMs	Sep 17, 2024	Language ModellingSmall Language Model	CodeCode Available	0
AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model	Sep 6, 2024	AttributeAutoML	CodeCode Available	1
TinyAgent: Function Calling at the Edge	Sep 1, 2024	Language ModellingQuantization	CodeCode Available	3
Assessing Generative Language Models in Classification Tasks: Performance and Self-Evaluation Capabilities in the Environmental and Climate Change Domain	Aug 30, 2024	Language ModelingLanguage Modelling	CodeCode Available	0
InkubaLM: A small language model for low-resource African languages	Aug 30, 2024	Language ModelingLanguage Modelling	—Unverified	0
LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs	Aug 24, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
SLM Meets LLM: Balancing Latency, Interpretability and Consistency in Hallucination Detection	Aug 22, 2024	HallucinationLanguage Modeling	CodeCode Available	1
Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards	Aug 21, 2024	ChunkingComputational Efficiency	CodeCode Available	1
VersusDebias: Universal Zero-Shot Debiasing for Text-to-Image Models via SLM-Based Prompt Engineering and Generative Adversary	Jul 28, 2024	AttributeFairness	CodeCode Available	0
Exploring Domain Robust Lightweight Reward Models based on Router Mechanism	Jul 24, 2024	Language ModelingLanguage Modelling	—Unverified	0
Graph-Structured Speculative Decoding	Jul 23, 2024	Language ModellingSmall Language Model	—Unverified	0
RDBE: Reasoning Distillation-Based Evaluation Enhances Automatic Essay Scoring	Jul 3, 2024	DecoderLanguage Modeling	—Unverified	0
TinyStyler: Efficient Few-Shot Text Style Transfer with Authorship Embeddings	Jun 21, 2024	AttributeLanguage Modeling	CodeCode Available	1
PDSS: A Privacy-Preserving Framework for Step-by-Step Distillation of Large Language Models	Jun 18, 2024	DecoderLanguage Modeling	—Unverified	0
HARE: HumAn pRiors, a key to small language model Efficiency	Jun 17, 2024	DiversityLanguage Modeling	—Unverified	0
Small-E: Small Language Model with Linear Attention for Efficient Speech Synthesis	Jun 6, 2024	DecoderInductive Bias	CodeCode Available	2
Efficient Medical Question Answering with Knowledge-Augmented Question Generation	May 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	0
XAMPLER: Learning to Retrieve Cross-Lingual In-Context Examples	May 8, 2024	In-Context LearningLanguage Modeling	CodeCode Available	0
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models	May 1, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Plug and Play with Prompts: A Prompt Tuning Approach for Controlling Text Generation	Apr 8, 2024	Language ModelingLanguage Modelling	—Unverified	0
Towards Pareto Optimal Throughput in Small Language Model Serving	Apr 4, 2024	Language ModelingLanguage Modelling	—Unverified	0
Think Big, Generate Quick: LLM-to-SLM for Fast Autoregressive Decoding	Feb 26, 2024	DecoderInstruction Following	—Unverified	0
Recursive Speculative Decoding: Accelerating LLM Inference via Sampling Without Replacement	Feb 21, 2024	Language ModellingSmall Language Model	—Unverified	0
Cognitive Visual-Language Mapper: Advancing Multimodal Comprehension with Enhanced Visual Knowledge Alignment	Feb 21, 2024	Language ModellingQuestion Answering	CodeCode Available	1
Purifying Large Language Models by Ensembling a Small Language Model	Feb 19, 2024	Data PoisoningLanguage Modeling	—Unverified	0
Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction	Feb 17, 2024	Few-Shot LearningLanguage Modelling	CodeCode Available	0
Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models	Jan 24, 2024	Hateful Meme ClassificationLanguage Modelling	CodeCode Available	1
Small Language Model Meets with Reinforced Vision Vocabulary	Jan 23, 2024	Language ModelingLanguage Modelling	—Unverified	0
Small Language Model Can Self-correct	Jan 14, 2024	Language ModelingLanguage Modelling	—Unverified	0
TinyLlama: An Open-Source Small Language Model	Jan 4, 2024	Computational EfficiencyLanguage Modeling	CodeCode Available	11
LLaVA-Phi: Efficient Multi-Modal Assistant with Small Language Model	Jan 4, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
PerfRL: A Small Language Model Framework for Efficient Code Optimization	Dec 9, 2023	Language ModelingLanguage Modelling	—Unverified	0
Recommendations by Concise User Profiles from Review Text	Nov 2, 2023	Language ModelingLanguage Modelling	—Unverified	0
SOUL: Towards Sentiment and Opinion Understanding of Language	Oct 27, 2023	Language ModellingSentiment Analysis	CodeCode Available	0

Show:10 25 50

← PrevPage 2 of 3Next →

No leaderboard results yet.