Towards a multimodal framework for remote sensing image change retrieval and captioning Jun 19, 2024 Change Detection Contrastive Learning
Code Code Available 0Accurate and Fast Pixel Retrieval with Spatial and Uncertainty Aware Hypergraph Diffusion Jun 17, 2024 Content-Based Image Retrieval Image Retrieval
— Unverified 0ARNet: Self-Supervised FG-SBIR with Unified Sample Feature Alignment and Multi-Scale Token Recycling Jun 17, 2024 Image Retrieval Retrieval
Code Code Available 0Multimodal Needle in a Haystack: Benchmarking Long-Context Capability of Multimodal Large Language Models Jun 17, 2024 Benchmarking
Code Code Available 2They're All Doctors: Synthesizing Diverse Counterfactuals to Mitigate Associative Bias Jun 17, 2024 All counterfactual
— Unverified 0Annotation Cost-Efficient Active Learning for Deep Metric Learning Driven Remote Sensing Image Retrieval Jun 14, 2024 Active Learning Content-Based Image Retrieval
— Unverified 0BiVLC: Extending Vision-Language Compositionality Evaluation with Text-to-Image Retrieval Jun 14, 2024 Image Retrieval Image to text
Code Code Available 0DenoiseRep: Denoising Model for Representation Learning Jun 13, 2024 Denoising Fine-Grained Image Classification
Code Code Available 1An Efficient Post-hoc Framework for Reducing Task Discrepancy of Text Encoders for Composed Image Retrieval Jun 13, 2024 Contrastive Learning Image Retrieval
Code Code Available 2Enhancing Diagnostic Accuracy in Rare and Common Fundus Diseases with a Knowledge-Rich Vision-Language Model Jun 13, 2024 Diagnostic Image Retrieval
Code Code Available 2ConceptHash: Interpretable Fine-Grained Hashing via Concept Discovery Jun 12, 2024 Image Retrieval
Code Code Available 0FakeInversion: Learning to Detect Images from Unseen Text-to-Image Models by Inverting Stable Diffusion Jun 12, 2024 Image Retrieval
— Unverified 0Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning Jun 11, 2024 Benchmarking Contrastive Learning
Code Code Available 0Fetch-A-Set: A Large-Scale OCR-Free Benchmark for Historical Document Retrieval Jun 11, 2024 Image Retrieval Image to text
— Unverified 0Image Textualization: An Automatic Framework for Creating Accurate and Detailed Image Descriptions Jun 11, 2024 Hallucination Image Description
Code Code Available 2TIGeR: Unifying Text-to-Image Generation and Retrieval with Large Multimodal Models Jun 9, 2024 counterfactual Image Generation
— Unverified 0PQPP: A Joint Benchmark for Text-to-Image Prompt and Query Performance Prediction Jun 7, 2024 Image Generation Image Retrieval
Code Code Available 0VISTA: Visualized Text Embedding For Universal Multi-Modal Retrieval Jun 6, 2024 Image Retrieval Retrieval
Code Code Available 0Interactive Text-to-Image Retrieval with Large Language Models: A Plug-and-Play Approach Jun 5, 2024 Image Retrieval Instruction Following
Code Code Available 1No Captions, No Problem: Captionless 3D-CLIP Alignment with Hard Negatives via CLIP Knowledge and LLMs Jun 4, 2024 3D Classification Cross-Modal Retrieval
— Unverified 0Scale-Free Image Keypoints Using Differentiable Persistent Homology Jun 3, 2024 Image Retrieval Keypoint Detection
Code Code Available 1Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP Jun 3, 2024 Image Retrieval
Code Code Available 1Reverse Image Retrieval Cues Parametric Memory in Multimodal LLMs May 29, 2024 Image Retrieval Question Answering
Code Code Available 1ContextBLIP: Doubly Contextual Alignment for Contrastive Image Retrieval from Linguistically Complex Descriptions May 29, 2024 Image Retrieval Retrieval
— Unverified 0SketchTriplet: Self-Supervised Scenarized Sketch-Text-Image Triplet Generation May 29, 2024 Image Generation Image Retrieval
— Unverified 0Multi-Modal Generative Embedding Model May 29, 2024 Caption Generation Cross-Modal Retrieval
— Unverified 0CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval May 29, 2024 Cross-Modal Retrieval Image Retrieval
Code Code Available 1AdapNet: Adaptive Noise-Based Network for Low-Quality Image Retrieval May 28, 2024 Image Retrieval Re-Ranking
— Unverified 0Composed Image Retrieval for Remote Sensing May 24, 2024 Composed Image Retrieval (CoIR) Descriptive
Code Code Available 2Self-distilled Dynamic Fusion Network for Language-based Fashion Retrieval May 24, 2024 Image Retrieval Retrieval
— Unverified 0EMR-Merging: Tuning-Free High-Performance Model Merging May 23, 2024 Image Classification Image Retrieval
Code Code Available 2Thesis: Document Summarization with applications to Keyword extraction and Image Retrieval May 20, 2024 Articles Document Summarization
— Unverified 0FFF: Fixing Flawed Foundations in contrastive pre-training results in very strong Vision-Language models May 16, 2024 Diversity Image Retrieval
— Unverified 0Content-Based Image Retrieval for Multi-Class Volumetric Radiology Images: A Benchmark Study May 15, 2024 Content-Based Image Retrieval Image Retrieval
— Unverified 0HybridHash: Hybrid Convolutional and Self-Attention Deep Hashing for Image Retrieval May 13, 2024 Deep Hashing Image Retrieval
Code Code Available 1Leveraging Medical Foundation Model Features in Graph Neural Network-Based Retrieval of Breast Histopathology Images May 7, 2024 Contrastive Learning Diagnostic
— Unverified 0Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval May 6, 2024 Image Retrieval Language Modeling
— Unverified 0A New Robust Partial p-Wasserstein-Based Metric for Comparing Distributions May 6, 2024 Image Retrieval Sensitivity
— Unverified 0Knowledge-aware Text-Image Retrieval for Remote Sensing Images May 6, 2024 Diversity Earth Observation
— Unverified 0iSEARLE: Improving Textual Inversion for Zero-Shot Composed Image Retrieval May 5, 2024 Benchmarking Composed Image Retrieval (CoIR)
Code Code Available 2What matters when building vision-language models? May 3, 2024 1 Image, 2*2 Stitching Image Retrieval
— Unverified 0Spherical Linear Interpolation and Text-Anchoring for Zero-shot Composed Image Retrieval May 1, 2024 Image Retrieval Retrieval
— Unverified 0Large Language Model Informed Patent Image Retrieval Apr 30, 2024 Image Retrieval Language Modeling
— Unverified 0Enhancing Interactive Image Retrieval With Query Rewriting Using Large Language Models and Vision Language Models Apr 29, 2024 Image Retrieval Language Modeling
— Unverified 0Dual-Modal Prompting for Sketch-Based Image Retrieval Apr 29, 2024 Image Retrieval Retrieval
— Unverified 0Semantic Line Combination Detector Apr 29, 2024 Image Retrieval Retrieval
Code Code Available 1Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment Apr 28, 2024 Cross-Modal Retrieval Image Retrieval
Code Code Available 2Learning text-to-video retrieval from image captioning Apr 26, 2024 Image Captioning Image Retrieval
— Unverified 0CriSp: Leveraging Tread Depth Maps for Enhanced Crime-Scene Shoeprint Matching Apr 25, 2024 Benchmarking Data Augmentation
Code Code Available 0Revisiting Relevance Feedback for CLIP-based Interactive Image Retrieval Apr 25, 2024 Image Retrieval Metric Learning
— Unverified 0