| 2D Matryoshka Training for Information Retrieval | Nov 26, 2024 | Information RetrievalRetrieval | CodeCode Available | 0 |
| Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image | Nov 16, 2024 | ClassificationDiagnostic | —Unverified | 0 |
| Towards Cross-Modal Text-Molecule Retrieval with Better Modality Alignment | Oct 31, 2024 | Contrastive Learningcross-modal alignment | CodeCode Available | 0 |
| One Prompt to Verify Your Models: Black-Box Text-to-Image Models Verification via Non-Transferable Adversarial Attacks | Oct 30, 2024 | text similarity | —Unverified | 0 |
| GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings | Oct 18, 2024 | Contrastive LearningMTEB Benchmark | CodeCode Available | 0 |
| Starbucks: Improved Training for 2D Matryoshka Embeddings | Oct 17, 2024 | Language Modellingtext similarity | CodeCode Available | 1 |
| SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs | Oct 12, 2024 | AudioCapsAudio captioning | CodeCode Available | 0 |
| Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Evaluation of Generative Robotic Simulations | Oct 10, 2024 | Diversitytext similarity | —Unverified | 0 |
| Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments | Oct 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models | Oct 1, 2024 | Hallucinationtext similarity | —Unverified | 0 |
| Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings | Sep 12, 2024 | FADImage Captioning | —Unverified | 0 |
| Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models? | Sep 4, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Sep 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design | Aug 22, 2024 | Information RetrievalReranking | —Unverified | 0 |
| Quantum Algorithms for Compositional Text Processing | Aug 12, 2024 | Question Answeringtext similarity | —Unverified | 0 |
| Judgment2vec: Apply Graph Analytics to Searching and Recommendation of Similar Judgments | Aug 8, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Decoding Knowledge Claims: The Evaluation of Scientific Publication Contributions through Semantic Analysis | Jul 26, 2024 | text similarity | —Unverified | 0 |
| Positive Text Reframing under Multi-strategy Optimization | Jul 25, 2024 | Re-RankingText Generation | CodeCode Available | 0 |
| A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions | Jul 23, 2024 | Language Modellingtext similarity | —Unverified | 0 |
| Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment | Jul 20, 2024 | Contrastive LearningMultiple-choice | CodeCode Available | 0 |
| Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation | Jul 2, 2024 | AnatomyClinical Knowledge | CodeCode Available | 1 |
| Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP | Jun 25, 2024 | cross-modal alignmentImage Classification | CodeCode Available | 2 |
| Extrinsic Evaluation of Cultural Competence in Large Language Models | Jun 17, 2024 | Open-Ended Question AnsweringQuestion Answering | CodeCode Available | 0 |
| In-depth analysis of recall initiators of medical devices with a Machine Learning-Natural language Processing workflow | Jun 14, 2024 | Clusteringtext similarity | —Unverified | 0 |