| DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings | Nov 25, 2024 | SentenceSentence Embedding | —Unverified | 0 |
| BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques | Nov 22, 2024 | Hate Speech DetectionKnowledge Distillation | —Unverified | 0 |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Nov 2, 2024 | Line DetectionSemantic Similarity | CodeCode Available | 1 |
| Dialectal and Low-Resource Machine Translation for Aromanian | Oct 23, 2024 | Machine TranslationSentence | —Unverified | 0 |
| GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings | Oct 18, 2024 | Contrastive LearningMTEB Benchmark | CodeCode Available | 0 |
| A new approach for fine-tuning sentence transformers for intent classification and out-of-scope detection tasks | Oct 17, 2024 | Classificationintent-classification | CodeCode Available | 0 |
| Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes | Oct 8, 2024 | ArticlesClassification | CodeCode Available | 1 |
| Black-Box Segmentation of Electronic Medical Records | Sep 29, 2024 | SegmentationSentence | —Unverified | 0 |
| An Effective Approach to Embedding Source Code by Combining Large Language and Sentence Embedding Models | Sep 23, 2024 | Clone DetectionDomain Adaptation | —Unverified | 0 |
| Towards Building Efficient Sentence BERT Models using Layer Pruning | Sep 21, 2024 | Natural Language InferenceSemantic Textual Similarity | —Unverified | 0 |
| Enhancing Unsupervised Sentence Embeddings via Knowledge-Driven Data Augmentation and Gaussian-Decayed Contrastive Learning | Sep 19, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| ConCSE: Unified Contrastive Learning and Augmentation for Code-Switched Embeddings | Aug 28, 2024 | Contrastive LearningNatural Language Inference | CodeCode Available | 0 |
| Practical token pruning for foundation models in few-shot conversational virtual assistant systems | Aug 21, 2024 | ClassificationContrastive Learning | —Unverified | 0 |
| Extracting Sentence Embeddings from Pretrained Transformer Models | Aug 15, 2024 | ClusteringRetrieval-augmented Generation | —Unverified | 0 |
| Sign Language Translation with Sentence Embedding Supervision | Aug 14, 2024 | Gloss-free Sign Language TranslationSentence | CodeCode Available | 0 |
| reCSE: Portable Reshaping Features for Sentence Embedding in Self-supervised Contrastive Learning | Aug 9, 2024 | Contrastive LearningData Augmentation | CodeCode Available | 0 |
| In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation | Aug 1, 2024 | DiversityIn-Context Learning | CodeCode Available | 0 |
| QAEA-DR: A Unified Text Augmentation Framework for Dense Retrieval | Jul 29, 2024 | Answer GenerationEvent Extraction | —Unverified | 0 |
| Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification | Jul 25, 2024 | SentenceSentence Embedding | CodeCode Available | 0 |
| Whitening Not Recommended for Classification Tasks in LLMs | Jul 16, 2024 | ClassificationSentence | —Unverified | 0 |
| Unveiling the Potential of BERTopic for Multilingual Fake News Analysis -- Use Case: Covid-19 | Jul 11, 2024 | ArticlesClustering | —Unverified | 0 |
| Are there identifiable structural parts in the sentence embedding whole? | Jun 24, 2024 | SentenceSentence Embedding | —Unverified | 0 |
| Towards Understanding Domain Adapted Sentence Embeddings for Document Retrieval | Jun 18, 2024 | Domain AdaptationQuestion Answering | —Unverified | 0 |
| Space Decomposition for Sentence Embedding | Jun 5, 2024 | Semantic Textual SimilaritySentence | CodeCode Available | 0 |
| MTEB-French: Resources for French Sentence Embedding Evaluation and Analysis | May 30, 2024 | SentenceSentence Embedding | —Unverified | 0 |