| Large Concept Models: Language Modeling in a Sentence Representation Space | Dec 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 7 |
| 2D Matryoshka Sentence Embeddings | Feb 22, 2024 | RAGRepresentation Learning | CodeCode Available | 4 |
| Bridging Language and Items for Retrieval and Recommendation | Mar 6, 2024 | RetrievalSentence | CodeCode Available | 3 |
| Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models | Feb 19, 2025 | Contrastive LearningSentence | CodeCode Available | 2 |
| SONAR: Sentence-Level Multimodal and Language-Agnostic Representations | Aug 22, 2023 | DecoderMachine Translation | CodeCode Available | 2 |
| RetroMAE: Pre-Training Retrieval-oriented Language Models Via Masked Auto-Encoder | May 24, 2022 | DecoderInformation Retrieval | CodeCode Available | 2 |
| PromptBERT: Improving BERT Sentence Embeddings with Prompts | Jan 12, 2022 | Contrastive LearningDenoising | CodeCode Available | 2 |
| FanChuan: A Multilingual and Graph-Structured Benchmark For Parody Detection and Analysis | Feb 23, 2025 | SentenceSentence Embedding | CodeCode Available | 1 |
| CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research | Nov 2, 2024 | Line DetectionSemantic Similarity | CodeCode Available | 1 |
| Efficient Few-shot Learning for Multi-label Classification of Scientific Documents with Many Classes | Oct 8, 2024 | ArticlesClassification | CodeCode Available | 1 |