Label Smoothing for Text Mining Oct 1, 2022 Retrieval text-classification
— Unverified 00 LaMP: Language-Motion Pretraining for Motion Generation, Retrieval, and Captioning Oct 9, 2024 Large Language Model Motion Captioning
— Unverified 00 LaT: Latent Translation with Cycle-Consistency for Video-Text Retrieval Jul 11, 2022 Representation Learning Retrieval
— Unverified 00 Learning Audio-guided Video Representation with Gated Attention for Video-Text Retrieval Apr 3, 2025 Information Retrieval Representation Learning
— Unverified 00 Learning Context-Adapted Video-Text Retrieval by Attending to User Comments Sep 29, 2021 Retrieval Text Retrieval
— Unverified 00 Learning Joint Visual Semantic Matching Embeddings for Language-guided Retrieval Aug 1, 2020 Image Retrieval Retrieval
— Unverified 00 Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm Jun 3, 2020 cross-modal alignment General Classification
— Unverified 00 Learning to embed semantic similarity for joint image-text retrieval Oct 7, 2022 Image-text Retrieval Metric Learning
— Unverified 00 Learning with Noisy Correspondence Apr 13, 2024 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
— Unverified 00 Lecture Presentations Multimodal Dataset: Towards Understanding Multimodality in Educational Videos Jan 1, 2023 Attribute Retrieval
— Unverified 00 Leveraging Generative Language Models for Weakly Supervised Sentence Component Analysis in Video-Language Joint Learning Dec 10, 2023 Language Modeling Language Modelling
— Unverified 00 Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships May 29, 2024 Adversarial Defense Adversarial Robustness
— Unverified 00 Lifelong learning for text retrieval and recognition in historical handwritten document collections Dec 11, 2019 Deep Learning Lifelong learning
— Unverified 00 LightCLIP: Learning Multi-Level Interaction for Lightweight Vision-Language Models Dec 1, 2023 image-classification Image Classification
— Unverified 00 Linq-Embed-Mistral Technical Report Dec 4, 2024 Retrieval Text Retrieval
— Unverified 00 LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Mar 4, 2025 Contrastive Learning Image-text Retrieval
— Unverified 00 Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models Nov 17, 2017 Cross-Modal Retrieval Image-text Retrieval
— Unverified 00 LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval Mar 10, 2022 Image-text Retrieval Retrieval
— Unverified 00 LSTM-based Selective Dense Text Retrieval Guided by Sparse Lexical Retrieval Feb 15, 2025 Retrieval Text Retrieval
— Unverified 00 LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival Mar 16, 2024 Caption Generation Image-text Retrieval
— Unverified 00 LV-MAE: Learning Long Video Representations through Masked-Embedding Autoencoders Apr 4, 2025 Self-Supervised Learning Text Retrieval
— Unverified 00 M2D2: Exploring General-purpose Audio-Language Representations Beyond CLAP Mar 28, 2025 Audio captioning Audio Classification
— Unverified 00 Mamba Retriever: Utilizing Mamba for Effective and Efficient Dense Retrieval Aug 15, 2024 Information Retrieval Mamba
— Unverified 00 MAMO: Masked Multimodal Modeling for Fine-Grained Vision-Language Representation Learning Oct 9, 2022 Image-text Retrieval multimodal interaction
— Unverified 00 Masked Contrastive Pre-Training for Efficient Video-Text Retrieval Dec 2, 2022 Image-text Retrieval Retrieval
— Unverified 00 Mask to reconstruct: Cooperative Semantics Completion for Video-text Retrieval May 13, 2023 Retrieval Text Retrieval
— Unverified 00 MASS: Overcoming Language Bias in Image-Text Matching Jan 20, 2025 Image-text matching Image-text Retrieval
— Unverified 00 Maximal Matching Matters: Preventing Representation Collapse for Robust Cross-Modal Retrieval Jun 26, 2025 Cross-Modal Retrieval Image-text Retrieval
— Unverified 00 MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval Oct 30, 2023 cross-modal alignment Image-text Retrieval
— Unverified 00 Med-gte-hybrid: A contextual embedding transformer model for extracting actionable information from clinical texts Feb 21, 2025 Contrastive Learning Decision Making
— Unverified 00 Memory Enhanced Embedding Learning for Cross-Modal Video-Text Retrieval Mar 29, 2021 Retrieval Text Retrieval
— Unverified 00 MeSH-based dataset for measuring the relevance of text retrieval Jul 1, 2018 Information Retrieval Retrieval
— Unverified 00 mGTE: Generalized Long-Context Text Representation and Reranking Models for Multilingual Text Retrieval Jul 29, 2024 Contrastive Learning Reranking
— Unverified 00 MIaS: Math-Aware Retrieval in Digital Mathematical Libraries Aug 28, 2018 Information Retrieval Math
— Unverified 00 MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction Apr 19, 2024 Image Reconstruction Text Retrieval
— Unverified 00 MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval May 26, 2025 Image Retrieval Large Language Model
— Unverified 00 MM-Embed: Universal Multimodal Retrieval with Multimodal LLMs Nov 4, 2024 Cross-Modal Retrieval Information Retrieval
— Unverified 00 MORE: Multi-mOdal REtrieval Augmented Generative Commonsense Reasoning Feb 21, 2024 Retrieval Text Generation
— Unverified 00 MSLKANet: A Multi-Scale Large Kernel Attention Network for Scene Text Removal Nov 12, 2022 Retrieval Text Retrieval
— Unverified 00 M-SpeechCLIP: Leveraging Large-Scale, Pre-Trained Models for Multilingual Speech to Image Retrieval Nov 2, 2022 Image Retrieval Retrieval
— Unverified 00 Multi-Head Attention Driven Dynamic Visual-Semantic Embedding for Enhanced Image-Text Matching Dec 26, 2024 Image-text matching Text Matching
— Unverified 00 Multilateral Semantic Relations Modeling for Image Text Retrieval Jan 1, 2023 Image-text Retrieval Retrieval
— Unverified 00 Multilingual Diversity Improves Vision-Language Representations May 27, 2024 Diversity Text Retrieval
— Unverified 00 Multimodal Learned Sparse Retrieval for Image Suggestion Feb 12, 2024 Image Captioning Retrieval
— Unverified 00 Multimodal Misinformation Detection using Large Vision-Language Models Jul 19, 2024 Fact Checking Fact Verification
— Unverified 00 Multiscale Matching Driven by Cross-Modal Similarity Consistency for Audio-Text Retrieval Mar 15, 2024 AudioCaps Contrastive Learning
— Unverified 00 Multi-Scale Temporal Difference Transformer for Video-Text Retrieval Jun 23, 2024 Retrieval Text Retrieval
— Unverified 00 Multi-Task Contrastive Learning for 8192-Token Bilingual Text Embeddings Feb 26, 2024 Contrastive Learning Multi-Task Learning
— Unverified 00 Named Entity and Relation Extraction with Multi-Modal Retrieval Dec 3, 2022 Mixture-of-Experts Multi-modal Named Entity Recognition
— Unverified 00 NAVERO: Unlocking Fine-Grained Semantics for Video-Language Compositionality Aug 18, 2024 Retrieval Text Retrieval
— Unverified 00