BM25S: Orders of magnitude faster lexical search via eager sparse scoring Jul 4, 2024 Passage Retrieval Retrieval
Code Code Available 5Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Jul 2, 2024 Few-Shot Learning Language Modeling
Code Code Available 0SignCLIP: Connecting Text and Sign Language by Contrastive Learning Jul 1, 2024 Contrastive Learning Retrieval
Code Code Available 1Memory^3: Language Modeling with Explicit Memory Jul 1, 2024 Language Modeling Language Modelling
— Unverified 0CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding Evaluation Jul 1, 2024 Image-text Retrieval Question Answering
Code Code Available 1PathAlign: A vision-language model for whole slide images in histopathology Jun 27, 2024 Diagnostic Image Retrieval
— Unverified 0Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning Jun 26, 2024 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling Jun 25, 2024 Cross-Modal Retrieval Natural Language Queries
— Unverified 0Multi-Scale Temporal Difference Transformer for Video-Text Retrieval Jun 23, 2024 Retrieval Text Retrieval
— Unverified 0Evaluating D-MERIT of Partial-annotation on Information Retrieval Jun 23, 2024 Information Retrieval Passage Retrieval
— Unverified 0RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation Jun 20, 2024 Information Retrieval Retrieval
— Unverified 0Symmetric Multi-Similarity Loss for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2024 Jun 18, 2024 Ensemble Learning Multi-Instance Retrieval
Code Code Available 0News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation Jun 18, 2024 Cross-Lingual Transfer Domain Adaptation
Code Code Available 0Unifying Multimodal Retrieval via Document Screenshot Embedding Jun 17, 2024 Language Modelling Natural Questions
— Unverified 0Composing Object Relations and Attributes for Image-Text Matching Jun 17, 2024 Attribute Graph Attention
Code Code Available 1BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval Jun 14, 2024 Image Retrieval Image to text
Code Code Available 0Enhancing Knowledge Retrieval with In-Context Learning and Semantic Search through Generative AI Jun 13, 2024 In-Context Learning Information Retrieval
— Unverified 0Towards Vision-Language Geo-Foundation Model: A Survey Jun 13, 2024 Earth Observation Image Captioning
Code Code Available 2Which Country Is This? Automatic Country Ranking of Street View Photos Jun 11, 2024 Retrieval Text Retrieval
Code Code Available 0Bridging Language Gaps in Audio-Text Retrieval Jun 11, 2024 AudioCaps Retrieval
Code Code Available 1RWKV-CLIP: A Robust Vision-Language Representation Learner Jun 11, 2024 Image-text Retrieval Representation Learning
Code Code Available 2Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval Jun 9, 2024 Image-text Retrieval Person Retrieval
— Unverified 0Diving Deep into the Motion Representation of Video-Text Models Jun 7, 2024 Retrieval Text Retrieval
Code Code Available 0A Bi-metric Framework for Fast Similarity Search Jun 5, 2024 MTEB Benchmark Re-Ranking
Code Code Available 0HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model Jun 1, 2024 Action Recognition Activity Recognition
— Unverified 0Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training May 30, 2024 Image-text Retrieval Language Modeling
— Unverified 0Uncertainty-aware sign language video retrieval with probability distribution modeling May 30, 2024 Retrieval Sign Language Retrieval
— Unverified 0Jina CLIP: Your CLIP Model Is Also Your Text Retriever May 30, 2024 Information Retrieval Retrieval
— Unverified 0Transcending Fusion: A Multi-Scale Alignment Method for Remote Sensing Image-Text Retrieval May 29, 2024 cross-modal alignment Image-text Retrieval
Code Code Available 1Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships May 29, 2024 Adversarial Defense Adversarial Robustness
— Unverified 0LDMol: Text-to-Molecule Diffusion Model with Structurally Informative Latent Space May 28, 2024 Contrastive Learning Decoder
Code Code Available 1Multilingual Diversity Improves Vision-Language Representations May 27, 2024 Diversity Text Retrieval
— Unverified 0Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration May 26, 2024 Information Retrieval Retrieval
Code Code Available 1Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning May 26, 2024 Image to text Image-to-Text Retrieval
— Unverified 0Accelerating Transformers with Spectrum-Preserving Token Merging May 25, 2024 image-classification Image Classification
Code Code Available 2An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval May 25, 2024 Retrieval Text Retrieval
— Unverified 0Active Learning for Finely-Categorized Image-Text Retrieval by Selecting Hard Negative Unpaired Samples May 25, 2024 Active Learning Image-text Retrieval
— Unverified 0ProtT3: Protein-to-Text Generation for Text-based Protein Understanding May 21, 2024 Property Prediction Question Answering
Code Code Available 2PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning May 16, 2024 Image-text Retrieval Representation Learning
Code Code Available 1Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation May 16, 2024 AudioCaps Event Detection
Code Code Available 1Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval May 14, 2024 Cross-Modal Retrieval Cross-Modal Retrieval on RSITMD
— Unverified 0RETTA: Retrieval-Enhanced Test-Time Adaptation for Zero-Shot Video Captioning May 11, 2024 Image-text matching Retrieval
— Unverified 0Explaining Text Similarity in Transformer Models May 10, 2024 Information Retrieval Retrieval
Code Code Available 0ProCIS: A Benchmark for Proactive Retrieval in Conversations May 10, 2024 Retrieval Text Retrieval
Code Code Available 0Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning May 7, 2024 Benchmarking Contrastive Learning
Code Code Available 0Exploiting Positional Bias for Query-Agnostic Generative Content in Search May 1, 2024 Position Text Retrieval
Code Code Available 0Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation May 1, 2024 Retrieval Text Augmentation
— Unverified 0Efficient Inverted Indexes for Approximate Retrieval over Learned Sparse Representations Apr 29, 2024 Retrieval Text Retrieval
Code Code Available 2Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment Apr 28, 2024 Cross-Modal Retrieval Image Retrieval
Code Code Available 2VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations Apr 25, 2024 Image to text Sensitivity
Code Code Available 0