| Prompt Candidates, then Distill: A Teacher-Student Framework for LLM-driven Data Annotation | Jun 4, 2025 | Small Language Modeltext-classification | CodeCode Available | 1 |
| CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models | May 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model | Sep 6, 2024 | AttributeAutoML | CodeCode Available | 1 |
| Leveraging Fine-Tuned Retrieval-Augmented Generation with Long-Context Support: For 3GPP Standards | Aug 21, 2024 | ChunkingComputational Efficiency | CodeCode Available | 1 |
| Bilinear MLPs enable weight-based mechanistic interpretability | Oct 10, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Small Language Model Makes an Effective Long Text Extractor | Feb 11, 2025 | GPULanguage Modeling | CodeCode Available | 1 |
| Siamese BERT-based Model for Web Search Relevance Ranking Evaluated on a New Czech Dataset | Dec 3, 2021 | Document RankingLanguage Modeling | CodeCode Available | 1 |
| Biomedical Question Answering via Multi-Level Summarization on a Local Knowledge Graph | Apr 2, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distil-xLSTM: Learning Attention Mechanisms through Recurrent Structures | Mar 24, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distilling On-device Language Models for Robot Planning with Minimal Human Intervention | Jun 20, 2025 | Small Language Model | —Unverified | 0 |