Fine-tuning CLIP Text Encoders with Two-step Paraphrasing Feb 23, 2024 Image Captioning Image Retrieval
— Unverified 0Weatherproofing Retrieval for Localization with Generative AI and Geometric Consistency Feb 14, 2024 Image Retrieval Retrieval
— Unverified 0Large Language Models for Captioning and Retrieving Remote Sensing Images Feb 9, 2024 Cross-Modal Retrieval Decoder
— Unverified 0Measuring Machine Learning Harms from Stereotypes Requires Understanding Who Is Harmed by Which Errors in What Ways Feb 6, 2024 Fairness Image Retrieval
— Unverified 0Learning Semantic Proxies from Visual Prompts for Parameter-Efficient Fine-Tuning in Deep Metric Learning Feb 4, 2024 Image Retrieval Metric Learning
Code Code Available 0Region-Based Representations Revisited Feb 4, 2024 Image Retrieval Retrieval
Code Code Available 1Zero-shot sketch-based remote sensing image retrieval based on multi-level and attention-guided tokenization Feb 3, 2024 Cross-Modal Retrieval Image Retrieval
Code Code Available 0PICS: Pipeline for Image Captioning and Search Feb 1, 2024 Asset Management Image Captioning
— Unverified 0Approximate Nearest Neighbor Search with Window Filters Feb 1, 2024 Image Retrieval
Code Code Available 1Local Feature Matching Using Deep Learning: A Survey Jan 31, 2024 3D Reconstruction Deep Learning
Code Code Available 2Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors Jan 29, 2024 Decoder Image Generation
— Unverified 0Cross-Modal Coordination Across a Diverse Set of Input Modalities Jan 29, 2024 Cross-Modal Retrieval Image Retrieval
— Unverified 0Transformer-based Clipped Contrastive Quantization Learning for Unsupervised Image Retrieval Jan 27, 2024 Contrastive Learning Image Retrieval
— Unverified 0Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode Jan 24, 2024 Image Retrieval Information Retrieval
— Unverified 0PlaceFormer: Transformer-based Visual Place Recognition using Multi-Scale Patch Selection and Fusion Jan 23, 2024 Computational Efficiency Image Retrieval
— Unverified 0CBVS: A Large-Scale Chinese Image-Text Benchmark for Real-World Short Video Search Scenarios Jan 19, 2024 Common Sense Reasoning Image Retrieval
Code Code Available 1Cross-Modality Perturbation Synergy Attack for Person Re-identification Jan 18, 2024 Image Retrieval Person Re-Identification
— Unverified 0Siamese Content-based Search Engine for a More Transparent Skin and Breast Cancer Diagnosis through Histological Imaging Jan 16, 2024 Image Retrieval Retrieval
— Unverified 0Exploring Masked Autoencoders for Sensor-Agnostic Image Retrieval in Remote Sensing Jan 15, 2024 Content-Based Image Retrieval Image Retrieval
Code Code Available 1HiHPQ: Hierarchical Hyperbolic Product Quantization for Unsupervised Image Retrieval Jan 14, 2024 Contrastive Learning Image Retrieval
Code Code Available 1On Image Search in Histopathology Jan 14, 2024 Image Retrieval Prognosis
— Unverified 0Modality-Aware Representation Learning for Zero-shot Sketch-based Image Retrieval Jan 10, 2024 Image Retrieval Representation Learning
Code Code Available 0Analysis and Validation of Image Search Engines in Histopathology Jan 6, 2024 Image Retrieval Prognosis
Code Code Available 0Benchmarking PathCLIP for Pathology Image Analysis Jan 5, 2024 Benchmarking Decision Making
— Unverified 0BEV-TSR: Text-Scene Retrieval in BEV Space for Autonomous Driving Jan 2, 2024 Autonomous Driving Caption Generation
— Unverified 0Language-only Training of Zero-shot Composed Image Retrieval Jan 1, 2024 Image Retrieval Retrieval
Code Code Available 2Attribute-Guided Pedestrian Retrieval: Bridging Person Re-ID with Internal Attribute Variability Jan 1, 2024 Attribute Image Retrieval
— Unverified 0Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment Jan 1, 2024 cross-modal alignment Cross-Modal Retrieval
Code Code Available 2Leveraging Camera Triplets for Efficient and Accurate Structure-from-Motion Jan 1, 2024 Image Retrieval
— Unverified 0Characteristics Matching Based Hash Codes Generation for Efficient Fine-grained Image Retrieval Jan 1, 2024 Image Retrieval Retrieval
— Unverified 0D3still: Decoupled Differential Distillation for Asymmetric Image Retrieval Jan 1, 2024 Image Retrieval Retrieval
Code Code Available 2ConCon-Chi: Concept-Context Chimera Benchmark for Personalized Vision-Language Tasks Jan 1, 2024 Image Retrieval
Code Code Available 0Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval Dec 31, 2023 Face Image Retrieval Image Retrieval
Code Code Available 1Recursive Distillation for Open-Set Distributed Robot Localization Dec 26, 2023 Continual Learning Image Retrieval
— Unverified 0BEV-CV: Birds-Eye-View Transform for Cross-View Geo-Localisation Dec 23, 2023 Camera Localization Cross-View Geo-Localisation
Code Code Available 1InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks Dec 21, 2023 Image Retrieval Image-to-Text Retrieval
Code Code Available 1Gemini: A Family of Highly Capable Multimodal Models Dec 19, 2023 1 Image, 2*2 Stitching Arithmetic Reasoning
Code Code Available 1VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering Dec 19, 2023 Image Retrieval Question Answering
Code Code Available 0Advancing Image Retrieval with Few-Shot Learning and Relevance Feedback Dec 18, 2023 Binary Classification Classification
Code Code Available 0Symmetrical Bidirectional Knowledge Alignment for Zero-Shot Sketch-Based Image Retrieval Dec 16, 2023 Image Retrieval Knowledge Distillation
Code Code Available 0Data-Efficient Multimodal Fusion on a Single GPU Dec 15, 2023 GPU Image Retrieval
Code Code Available 1Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval Dec 15, 2023 All Image Retrieval
Code Code Available 1Training-free Zero-shot Composed Image Retrieval with Local Concept Reranking Dec 14, 2023 Image Retrieval Reranking
— Unverified 0Advancements in Content-Based Image Retrieval: A Comprehensive Survey of Relevance Feedback Techniques Dec 13, 2023 Active Learning Content-Based Image Retrieval
— Unverified 0C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation Dec 13, 2023 Image Retrieval Pose Estimation
— Unverified 0Contextually Affinitive Neighborhood Refinery for Deep Clustering Dec 12, 2023 Clustering Deep Clustering
Code Code Available 1Collapse-Aware Triplet Decoupling for Adversarially Robust Image Retrieval Dec 12, 2023 Adversarial Defense Image Retrieval
Code Code Available 1Dynamic Weighted Combiner for Mixed-Modal Image Retrieval Dec 11, 2023 Image Retrieval Retrieval
Code Code Available 0Lite-Mind: Towards Efficient and Robust Brain Representation Network Dec 6, 2023 Brain Decoding Image Retrieval
Code Code Available 1FreestyleRet: Retrieving Images from Style-Diversified Queries Dec 5, 2023 Image Retrieval Retrieval
Code Code Available 1