| Sentence Embeddings as an intermediate target in end-to-end summarisation | May 6, 2025 | SentenceSentence Embeddings | CodeCode Available | 0 |
| JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings | May 5, 2025 | Contrastive LearningSentence | CodeCode Available | 0 |
| Scalable Unit Harmonization in Medical Informatics Using Bi-directional Transformers and Bayesian-Optimized BM25 and Sentence Embedding Retrieval | May 1, 2025 | Bayesian OptimizationInformation Retrieval | —Unverified | 0 |
| Information Leakage of Sentence Embeddings via Generative Embedding Inversion Attacks | Apr 23, 2025 | SentenceSentence Embedding | CodeCode Available | 0 |
| Leveraging Language Models for Automated Patient Record Linkage | Apr 21, 2025 | BlockingData Integration | —Unverified | 0 |
| A Framework for Lightweight Responsible Prompting Recommendation | Mar 29, 2025 | SentenceSentence Embeddings | —Unverified | 0 |
| M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP | Mar 28, 2025 | Audio captioningAudio Classification | CodeCode Available | 0 |
| A Retrieval-Based Approach to Medical Procedure Matching in Romanian | Mar 26, 2025 | Medical ProcedureRetrieval | —Unverified | 0 |
| SCI-IDEA: Context-Aware Scientific Ideation Using Token and Sentence Embeddings | Mar 25, 2025 | scientific discoverySentence | —Unverified | 0 |
| CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement | Mar 21, 2025 | Dimensionality ReductionLanguage Modeling | —Unverified | 0 |
| Text Compression for Efficient Language Generation | Mar 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Domain Adaptation for Japanese Sentence Embeddings with Contrastive Learning based on Synthetic Sentence Generation | Mar 12, 2025 | Contrastive LearningDomain Adaptation | CodeCode Available | 0 |
| SINdex: Semantic INconsistency Index for Hallucination Detection in LLMs | Mar 7, 2025 | ClusteringHallucination | —Unverified | 0 |
| Exploring RWKV for Sentence Embeddings: Layer-wise Analysis and Baseline Comparison for Semantic Similarity | Feb 20, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| Complex Ontology Matching with Large Language Model Embeddings | Feb 19, 2025 | Graph MatchingLanguage Modeling | —Unverified | 0 |
| Optimizing Sentence Embedding with Pseudo-Labeling and Model Ensembles: A Hierarchical Framework for Enhanced NLP Tasks | Jan 27, 2025 | Data AugmentationPseudo Label | —Unverified | 0 |
| 2-Tier SimCSE: Elevating BERT for Robust Sentence Embeddings | Jan 23, 2025 | Contrastive LearningSemantic Textual Similarity | —Unverified | 0 |
| AgentRec: Agent Recommendation Using Sentence Embeddings Aligned to Human Feedback | Jan 23, 2025 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| FuocChuVIP123 at CoMeDi Shared Task: Disagreement Ranking with XLM-Roberta Sentence Embeddings and Deep Neural Regression | Jan 21, 2025 | SentenceSentence Embeddings | —Unverified | 0 |
| Conversational Text Extraction with Large Language Models Using Retrieval-Augmented Systems | Jan 16, 2025 | Question AnsweringRAG | —Unverified | 0 |
| Enhancing Plagiarism Detection in Marathi with a Weighted Ensemble of TF-IDF and BERT Embeddings for Low-Resource Language Processing | Jan 9, 2025 | Paraphrase IdentificationSentence | CodeCode Available | 0 |
| Multi-label Cross-lingual automatic music genre classification from lyrics with Sentence BERT | Jan 7, 2025 | ClassificationGenre classification | —Unverified | 0 |
| Token Prepending: A Training-Free Approach for Eliciting Better Sentence Embeddings from LLMs | Dec 16, 2024 | Prompt EngineeringSemantic Textual Similarity | —Unverified | 0 |
| An Incremental Clustering Baseline for Event Detection on Twitter | Dec 16, 2024 | ClusteringEvent Detection | CodeCode Available | 0 |
| GEAR: A Simple GENERATE, EMBED, AVERAGE AND RANK Approach for Unsupervised Reverse Dictionary | Dec 9, 2024 | Reverse DictionarySentence | CodeCode Available | 0 |
| LuxEmbedder: A Cross-Lingual Approach to Enhanced Luxembourgish Sentence Embeddings | Dec 4, 2024 | Recommendation SystemsSentence | CodeCode Available | 0 |
| DoubleCCA: Improving Foundation Model Group Robustness with Random Sentence Embeddings | Nov 25, 2024 | SentenceSentence Embedding | —Unverified | 0 |
| HNCSE: Advancing Sentence Embeddings via Hybrid Contrastive Learning with Hard Negatives | Nov 19, 2024 | Contrastive LearningRepresentation Learning | —Unverified | 0 |
| GASE: Generatively Augmented Sentence Encoding | Nov 7, 2024 | Data AugmentationSemantic Textual Similarity | —Unverified | 0 |
| GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings | Oct 18, 2024 | Contrastive LearningMTEB Benchmark | CodeCode Available | 0 |
| Parallel Corpus Augmentation using Masked Language Models | Oct 4, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking | Sep 24, 2024 | Model SelectionSentence | CodeCode Available | 0 |
| Mitigating Semantic Leakage in Cross-lingual Embeddings via Orthogonality Constraint | Sep 24, 2024 | DisentanglementRepresentation Learning | CodeCode Available | 0 |
| Enhancing Unsupervised Sentence Embeddings via Knowledge-Driven Data Augmentation and Gaussian-Decayed Contrastive Learning | Sep 19, 2024 | Contrastive LearningData Augmentation | —Unverified | 0 |
| Exploring syntactic information in sentence embeddings through multilingual subject-verb agreement | Sep 10, 2024 | Multiple-choiceSentence | —Unverified | 0 |
| Exploring Italian sentence embeddings properties through multi-tasking | Sep 10, 2024 | SentenceSentence Embeddings | CodeCode Available | 0 |
| Extracting Sentence Embeddings from Pretrained Transformer Models | Aug 15, 2024 | ClusteringRetrieval-augmented Generation | —Unverified | 0 |
| Sign Language Translation with Sentence Embedding Supervision | Aug 14, 2024 | Gloss-free Sign Language TranslationSentence | CodeCode Available | 0 |
| New Curriculum, New Chance -- Retrieval Augmented Generation for Lesson Planning in Ugandan Secondary Schools. Prototype Quality Evaluation | Aug 14, 2024 | Retrieval-augmented GenerationSentence Embeddings | —Unverified | 0 |
| In-Context Example Selection via Similarity Search Improves Low-Resource Machine Translation | Aug 1, 2024 | DiversityIn-Context Learning | CodeCode Available | 0 |
| Open Sentence Embeddings for Portuguese with the Serafim PT* encoders family | Jul 28, 2024 | ClusteringRetrieval | —Unverified | 0 |
| Tracking linguistic information in transformer-based sentence embeddings through targeted sparsification | Jul 25, 2024 | SentenceSentence Embedding | CodeCode Available | 0 |
| Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment | Jul 20, 2024 | Contrastive LearningMultiple-choice | CodeCode Available | 0 |
| BERTer: The Efficient One | Jul 19, 2024 | Semantic Textual SimilaritySentence | —Unverified | 0 |
| Intelligent Multi-Document Summarisation for Extracting Insights on Racial Inequalities from Maternity Incident Investigation Reports | Jul 11, 2024 | ClusteringSentence | —Unverified | 0 |
| Are there identifiable structural parts in the sentence embedding whole? | Jun 24, 2024 | SentenceSentence Embedding | —Unverified | 0 |
| Towards Understanding Domain Adapted Sentence Embeddings for Document Retrieval | Jun 18, 2024 | Domain AdaptationQuestion Answering | —Unverified | 0 |
| News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation | Jun 18, 2024 | Cross-Lingual TransferDomain Adaptation | CodeCode Available | 0 |
| SparseCL: Sparse Contrastive Learning for Contradiction Retrieval | Jun 15, 2024 | Contrastive LearningFact Checking | —Unverified | 0 |
| Exploring the Correlation between Human and Machine Evaluation of Simultaneous Speech Translation | Jun 14, 2024 | Semantic SimilaritySemantic Textual Similarity | —Unverified | 0 |