Towards a text-based quantitative and explainable histopathology image analysis Jul 10, 2024 image-classification Image Classification
Code Code Available 0CosmoCLIP: Generalizing Large Vision-Language Models for Astronomical Imaging Jul 10, 2024 Contrastive Learning Image-text Retrieval
— Unverified 0EA-VTR: Event-Aware Video-Text Retrieval Jul 10, 2024 Action Recognition Contrastive Learning
— Unverified 0CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding Jul 9, 2024 Contrastive Learning Domain Adaptation
— Unverified 0Neurocache: Efficient Vector Retrieval for Long-range Language Modeling Jul 2, 2024 Few-Shot Learning Language Modeling
Code Code Available 0Memory^3: Language Modeling with Explicit Memory Jul 1, 2024 Language Modeling Language Modelling
— Unverified 0PathAlign: A vision-language model for whole slide images in histopathology Jun 27, 2024 Diagnostic Image Retrieval
— Unverified 0Improving the Consistency in Cross-Lingual Cross-Modal Retrieval with 1-to-K Contrastive Learning Jun 26, 2024 Contrastive Learning Cross-Modal Retrieval
Code Code Available 0ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling Jun 25, 2024 Cross-Modal Retrieval Natural Language Queries
— Unverified 0Evaluating D-MERIT of Partial-annotation on Information Retrieval Jun 23, 2024 Information Retrieval Passage Retrieval
— Unverified 0Multi-Scale Temporal Difference Transformer for Video-Text Retrieval Jun 23, 2024 Retrieval Text Retrieval
— Unverified 0RE-AdaptIR: Improving Information Retrieval through Reverse Engineered Adaptation Jun 20, 2024 Information Retrieval Retrieval
— Unverified 0Symmetric Multi-Similarity Loss for EPIC-KITCHENS-100 Multi-Instance Retrieval Challenge 2024 Jun 18, 2024 Ensemble Learning Multi-Instance Retrieval
Code Code Available 0News Without Borders: Domain Adaptation of Multilingual Sentence Embeddings for Cross-lingual News Recommendation Jun 18, 2024 Cross-Lingual Transfer Domain Adaptation
Code Code Available 0Unifying Multimodal Retrieval via Document Screenshot Embedding Jun 17, 2024 Language Modelling Natural Questions
— Unverified 0BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval Jun 14, 2024 Image Retrieval Image to text
Code Code Available 0Enhancing Knowledge Retrieval with In-Context Learning and Semantic Search through Generative AI Jun 13, 2024 In-Context Learning Information Retrieval
— Unverified 0Which Country Is This? Automatic Country Ranking of Street View Photos Jun 11, 2024 Retrieval Text Retrieval
Code Code Available 0Beat: Bi-directional One-to-Many Embedding Alignment for Text-based Person Retrieval Jun 9, 2024 Image-text Retrieval Person Retrieval
— Unverified 0Diving Deep into the Motion Representation of Video-Text Models Jun 7, 2024 Retrieval Text Retrieval
Code Code Available 0A Bi-metric Framework for Fast Similarity Search Jun 5, 2024 MTEB Benchmark Re-Ranking
Code Code Available 0HENASY: Learning to Assemble Scene-Entities for Egocentric Video-Language Model Jun 1, 2024 Action Recognition Activity Recognition
— Unverified 0Jina CLIP: Your CLIP Model Is Also Your Text Retriever May 30, 2024 Information Retrieval Retrieval
— Unverified 0Uncertainty-aware sign language video retrieval with probability distribution modeling May 30, 2024 Retrieval Sign Language Retrieval
— Unverified 0Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training May 30, 2024 Image-text Retrieval Language Modeling
— Unverified 0Multimodal Adversarial Defense for Vision-Language Models by Leveraging One-To-Many Relationships May 29, 2024 Adversarial Defense Adversarial Robustness
— Unverified 0Multilingual Diversity Improves Vision-Language Representations May 27, 2024 Diversity Text Retrieval
— Unverified 0Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning May 26, 2024 Image to text Image-to-Text Retrieval
— Unverified 0Active Learning for Finely-Categorized Image-Text Retrieval by Selecting Hard Negative Unpaired Samples May 25, 2024 Active Learning Image-text Retrieval
— Unverified 0An Empirical Study of Excitation and Aggregation Design Adaptions in CLIP4Clip for Video-Text Retrieval May 25, 2024 Retrieval Text Retrieval
— Unverified 0Global–Local Information Soft-Alignment for Cross-Modal Remote-Sensing Image–Text Retrieval May 14, 2024 Cross-Modal Retrieval Cross-Modal Retrieval on RSITMD
— Unverified 0RETTA: Retrieval-Enhanced Test-Time Adaptation for Zero-Shot Video Captioning May 11, 2024 Image-text matching Retrieval
— Unverified 0Explaining Text Similarity in Transformer Models May 10, 2024 Information Retrieval Retrieval
Code Code Available 0ProCIS: A Benchmark for Proactive Retrieval in Conversations May 10, 2024 Retrieval Text Retrieval
Code Code Available 0Refining Joint Text and Source Code Embeddings for Retrieval Task with Parameter-Efficient Fine-Tuning May 7, 2024 Benchmarking Contrastive Learning
Code Code Available 0Exploiting Positional Bias for Query-Agnostic Generative Content in Search May 1, 2024 Position Text Retrieval
Code Code Available 0Distance Sampling-based Paraphraser Leveraging ChatGPT for Text Data Manipulation May 1, 2024 Retrieval Text Augmentation
— Unverified 0VISLA Benchmark: Evaluating Embedding Sensitivity to Semantic and Lexical Alterations Apr 25, 2024 Image to text Sensitivity
Code Code Available 0UrbanCross: Enhancing Satellite Image-Text Retrieval with Cross-Domain Adaptation Apr 22, 2024 Diversity Domain Adaptation
— Unverified 0MindTuner: Cross-Subject Visual Decoding with Visual Fingerprint and Semantic Correction Apr 19, 2024 Image Reconstruction Text Retrieval
— Unverified 0FecTek: Enhancing Term Weight in Lexicon-Based Retrieval with Feature Context and Term-level Knowledge Apr 18, 2024 Contrastive Learning Retrieval
— Unverified 0TEXT2TASTE: A Versatile Egocentric Vision System for Intelligent Reading Assistance Using Large Language Model Apr 14, 2024 Language Modeling Language Modelling
— Unverified 0Learning with Noisy Correspondence Apr 13, 2024 Cross-Modal Retrieval Cross-modal retrieval with noisy correspondence
— Unverified 0HaVTR: Improving Video-Text Retrieval Through Augmentation Using Large Foundation Models Apr 7, 2024 Hallucination Representation Learning
— Unverified 0Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement Apr 6, 2024 Image-text Retrieval object-detection
— Unverified 0Shallow Cross-Encoders for Low-Latency Retrieval Mar 29, 2024 CPU GPU
Code Code Available 0Denoising Table-Text Retrieval for Open-Domain Question Answering Mar 26, 2024 Denoising Open-Domain Question Answering
Code Code Available 0Improving Retrieval for RAG based Question Answering Models on Financial Documents Mar 23, 2024 Chunking Question Answering
— Unverified 0Refining Knowledge Transfer on Audio-Image Temporal Agreement for Audio-Text Cross Retrieval Mar 16, 2024 Image Retrieval Retrieval
— Unverified 0Improving Adversarial Transferability of Vision-Language Pre-training Models through Collaborative Multimodal Interaction Mar 16, 2024 Adversarial Robustness Image-text Retrieval
— Unverified 0