| ActAlign: Zero-Shot Fine-Grained Video Classification via Language-Guided Sequence Alignment | Jun 28, 2025 | Dynamic Time WarpingLarge Language Model | CodeCode Available | 0 |
| Adding simple structure at inference improves Vision-Language Compositionality | Jun 11, 2025 | AttributeImage-text Retrieval | CodeCode Available | 0 |
| Towards Practical Defect-Focused Automated Code Review | May 23, 2025 | Defect DetectionText Generation | —Unverified | 0 |
| Redemption Score: An Evaluation Framework to Rank Image Captions While Redeeming Image Semantics and Language Pragmatics | May 22, 2025 | Image Captioningtext similarity | —Unverified | 0 |
| The power of text similarity in identifying AI-LLM paraphrased documents: The case of BBC news articles and ChatGPT | May 18, 2025 | ArticlesSpecificity | —Unverified | 0 |
| GIE-Bench: Towards Grounded Evaluation for Text-Guided Image Editing | May 16, 2025 | Instruction FollowingMultiple-choice | CodeCode Available | 1 |
| A Two-Sample Test of Text Generation Similarity | May 8, 2025 | Text Generationtext similarity | —Unverified | 0 |
| JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings | May 5, 2025 | Contrastive LearningSentence | CodeCode Available | 0 |
| Can LLMs Generate Tabular Summaries of Science Papers? Rethinking the Evaluation Protocol | Apr 14, 2025 | text similarity | CodeCode Available | 0 |
| LayerFlow: Layer-wise Exploration of LLM Embeddings using Uncertainty-aware Interlinked Projections | Apr 9, 2025 | Dimensionality Reductiontext similarity | —Unverified | 0 |
| CoMAC: Conversational Agent for Multi-Source Auxiliary Context with Sparse and Symmetric Latent Interactions | Mar 25, 2025 | Response Generationtext similarity | CodeCode Available | 0 |
| TextInPlace: Indoor Visual Place Recognition in Repetitive Structures with Scene Text Spotting and Verification | Mar 9, 2025 | Robot NavigationSTS | CodeCode Available | 1 |
| TAIL: Text-Audio Incremental Learning | Mar 6, 2025 | AudioCapsIncremental Learning | —Unverified | 0 |
| Interpretable Text Embeddings and Text Similarity Explanation: A Primer | Feb 20, 2025 | Similarity Explanationtext similarity | —Unverified | 0 |
| Semantics-aware Test-time Adaptation for 3D Human Pose Estimation | Feb 15, 2025 | 3D human pose and shape estimation3D Human Pose Estimation | —Unverified | 0 |
| FiLo++: Zero-/Few-Shot Anomaly Detection by Fused Fine-Grained Descriptions and Deformable Localization | Jan 17, 2025 | Anomaly DetectionImage-text matching | CodeCode Available | 2 |
| SHYI: Action Support for Contrastive Learning in High-Fidelity Text-to-Image Generation | Jan 15, 2025 | Contrastive LearningImage Generation | —Unverified | 0 |
| Taxonomy-Aware Evaluation of Vision-Language Models | Jan 1, 2025 | Fine-Grained Image ClassificationLanguage Modeling | —Unverified | 0 |
| Unleashing Text-to-Image Diffusion Prior for Zero-Shot Image Captioning | Dec 31, 2024 | Caption GenerationDecoder | —Unverified | 0 |
| DeepCRCEval: Revisiting the Evaluation of Code Review Comment Generation | Dec 24, 2024 | Comment GenerationFew-Shot Learning | —Unverified | 0 |
| SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval | Dec 16, 2024 | FormInformation Retrieval | CodeCode Available | 1 |
| Vulnerability of Text-Matching in ML/AI Conference Reviewer Assignments to Collusions | Dec 9, 2024 | Text Matchingtext similarity | CodeCode Available | 0 |
| TOBUGraph: Knowledge Graph-Based Retrieval for Enhanced LLM Performance Beyond RAG | Dec 6, 2024 | ChunkingHallucination | —Unverified | 0 |
| Detecting Redundant Health Survey Questions Using Language-agnostic BERT Sentence Embedding (LaBSE) | Dec 5, 2024 | Computational EfficiencyQuestion Similarity | —Unverified | 0 |
| Patent-publication pairs for the detection of knowledge transfer from research to industry: reducing ambiguities with word embeddings and references | Dec 1, 2024 | text similarityTransfer Learning | —Unverified | 0 |
| 2D Matryoshka Training for Information Retrieval | Nov 26, 2024 | Information RetrievalRetrieval | CodeCode Available | 0 |
| Diagnostic Text-guided Representation Learning in Hierarchical Classification for Pathological Whole Slide Image | Nov 16, 2024 | ClassificationDiagnostic | —Unverified | 0 |
| Towards Cross-Modal Text-Molecule Retrieval with Better Modality Alignment | Oct 31, 2024 | Contrastive Learningcross-modal alignment | CodeCode Available | 0 |
| One Prompt to Verify Your Models: Black-Box Text-to-Image Models Verification via Non-Transferable Adversarial Attacks | Oct 30, 2024 | text similarity | —Unverified | 0 |
| GenEOL: Harnessing the Generative Power of LLMs for Training-Free Sentence Embeddings | Oct 18, 2024 | Contrastive LearningMTEB Benchmark | CodeCode Available | 0 |
| Starbucks: Improved Training for 2D Matryoshka Embeddings | Oct 17, 2024 | Language Modellingtext similarity | CodeCode Available | 1 |
| SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs | Oct 12, 2024 | AudioCapsAudio captioning | CodeCode Available | 0 |
| Calibrated Cache Model for Few-Shot Vision-Language Model Adaptation | Oct 11, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Evaluation of Generative Robotic Simulations | Oct 10, 2024 | Diversitytext similarity | —Unverified | 0 |
| Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments | Oct 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models | Oct 1, 2024 | Hallucinationtext similarity | —Unverified | 0 |
| Bridging Paintings and Music -- Exploring Emotion based Music Generation through Paintings | Sep 12, 2024 | FADImage Captioning | —Unverified | 0 |
| Pooling And Attention: What Are Effective Designs For LLM-Based Embedding Models? | Sep 4, 2024 | Information RetrievalRetrieval | CodeCode Available | 1 |
| What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations | Sep 4, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The Russian-focused embedders' exploration: ruMTEB benchmark and Russian embedding model design | Aug 22, 2024 | Information RetrievalReranking | —Unverified | 0 |
| Quantum Algorithms for Compositional Text Processing | Aug 12, 2024 | Question Answeringtext similarity | —Unverified | 0 |
| Judgment2vec: Apply Graph Analytics to Searching and Recommendation of Similar Judgments | Aug 8, 2024 | Information RetrievalRetrieval | —Unverified | 0 |
| Decoding Knowledge Claims: The Evaluation of Scientific Publication Contributions through Semantic Analysis | Jul 26, 2024 | text similarity | —Unverified | 0 |
| Positive Text Reframing under Multi-strategy Optimization | Jul 25, 2024 | Re-RankingText Generation | CodeCode Available | 0 |
| A Comparative Study on Patient Language across Therapeutic Domains for Effective Patient Voice Classification in Online Health Discussions | Jul 23, 2024 | Language Modellingtext similarity | —Unverified | 0 |
| Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment | Jul 20, 2024 | Contrastive LearningMultiple-choice | CodeCode Available | 0 |
| Towards a Holistic Framework for Multimodal Large Language Models in Three-dimensional Brain CT Report Generation | Jul 2, 2024 | AnatomyClinical Knowledge | CodeCode Available | 1 |
| Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP | Jun 25, 2024 | cross-modal alignmentImage Classification | CodeCode Available | 2 |
| Extrinsic Evaluation of Cultural Competence in Large Language Models | Jun 17, 2024 | Open-Ended Question AnsweringQuestion Answering | CodeCode Available | 0 |
| In-depth analysis of recall initiators of medical devices with a Machine Learning-Natural language Processing workflow | Jun 14, 2024 | Clusteringtext similarity | —Unverified | 0 |