AnyLoc: Towards Universal Visual Place Recognition Aug 1, 2023 Image Retrieval Visual Place Recognition
Code Code Available 2RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing Jun 20, 2023 Cross-Modal Retrieval Image Retrieval
Code Code Available 2Generating Images with Multimodal Language Models May 26, 2023 Decoder Image Generation
Code Code Available 2InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning May 11, 2023 1 Image, 2*2 Stitching Diversity
Code Code Available 2Unicom: Universal and Compact Representation Learning for Image Retrieval Apr 12, 2023 Image Classification Image Retrieval
Code Code Available 2MixVPR: Feature Mixing for Visual Place Recognition Mar 3, 2023 Autonomous Driving Image Retrieval
Code Code Available 2Grounding Language Models to Images for Multimodal Inputs and Outputs Jan 31, 2023 Image Retrieval In-Context Learning
Code Code Available 2Text2Poster: Laying out Stylized Texts on Retrieved Images Jan 6, 2023 Image Retrieval Layout Design
Code Code Available 2Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark Nov 24, 2022 2D Object Detection Image Retrieval
Code Code Available 2CLEAR: A Fully User-side Image Search System Jun 17, 2022 Image Retrieval Privacy Preserving
Code Code Available 2Investigating the Role of Image Retrieval for Visual Localization -- An exhaustive benchmark May 31, 2022 Autonomous Driving Camera Pose Estimation
Code Code Available 2Fine-grained Image Captioning with CLIP Reward May 26, 2022 Caption Generation Descriptive
Code Code Available 2Exploring a Fine-Grained Multiscale Method for Cross-Modal Remote Sensing Image Retrieval Apr 21, 2022 Cross-Modal Retrieval Image Retrieval
Code Code Available 2Rethinking Visual Geo-localization for Large-Scale Applications Apr 5, 2022 Contrastive Learning geo-localization
Code Code Available 2WIT: Wikipedia-based Image Text Dataset for Multimodal Multilingual Machine Learning Mar 2, 2021 BIG-bench Machine Learning Image Retrieval
Code Code Available 2Objectron: A Large Scale Dataset of Object-Centric Videos in the Wild with Pose Annotations Dec 18, 2020 3D Object Detection 3D Object Tracking
Code Code Available 2FastReID: A Pytorch Toolbox for General Instance Re-identification Jun 4, 2020 Face Recognition GPU
Code Code Available 2PyRetri: A PyTorch-based Library for Unsupervised Image Retrieval by Deep Convolutional Neural Networks May 2, 2020 Content-Based Image Retrieval Deep Learning
Code Code Available 2Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks Apr 13, 2020 Cross-Modal Retrieval Image Captioning
Code Code Available 2RadiomicsRetrieval: A Customizable Framework for Medical Image Retrieval Using Radiomics Features Jul 11, 2025 Contrastive Learning Image Retrieval
Code Code Available 1ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval May 27, 2025 Image Retrieval Retrieval
Code Code Available 1Visualized Text-to-Image Retrieval May 26, 2025 Image Retrieval Question Answering
Code Code Available 1One Surrogate to Fool Them All: Universal, Transferable, and Targeted Adversarial Attacks with CLIP May 26, 2025 All Image Retrieval
Code Code Available 1Focus on Local: Finding Reliable Discriminative Regions for Visual Place Recognition Apr 14, 2025 Computational Efficiency Image Retrieval
Code Code Available 1LOCORE: Image Re-ranking with Long-Context Sequence Modeling Mar 27, 2025 Image Retrieval Re-Ranking
Code Code Available 1FineCIR: Explicit Parsing of Fine-Grained Modification Semantics for Composed Image Retrieval Mar 27, 2025 Image Retrieval Retrieval
Code Code Available 1CoLLM: A Large Language Model for Composed Image Retrieval Mar 25, 2025 Image Retrieval Language Modeling
Code Code Available 1Fine-grained Textual Inversion Network for Zero-Shot Composed Image Retrieval Mar 25, 2025 Attribute Image Retrieval
Code Code Available 1Missing Target-Relevant Information Prediction with World Model for Accurate Zero-Shot Composed Image Retrieval Mar 21, 2025 Attribute Image Retrieval
Code Code Available 1Scale Efficient Training for Large Datasets Mar 17, 2025 geo-localization Image Retrieval
Code Code Available 1ImageScope: Unifying Language-Guided Image Retrieval via Large Multimodal Model Collective Reasoning Mar 13, 2025 Image Retrieval Retrieval
Code Code Available 1VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Mar 13, 2025 Image Retrieval Math
Code Code Available 1ILIAS: Instance-Level Image retrieval At Scale Feb 17, 2025 Benchmarking Image Retrieval
Code Code Available 1Composite Sketch+Text Queries for Retrieving Objects with Elusive Names and Complex Interactions Feb 12, 2025 Contrastive Learning Image Retrieval
Code Code Available 1A Flexible Plug-and-Play Module for Generating Variable-Length Dec 12, 2024 Deep Hashing Image Retrieval
Code Code Available 1IMPACT: A Large-scale Integrated Multimodal Patent Analysis and Creation Dataset for Design Patents Dec 10, 2024 Cross-Modal Retrieval Image Classification
Code Code Available 1Composed Image Retrieval for Training-Free Domain Conversion Dec 4, 2024 Image Retrieval Language Modeling
Code Code Available 1Image Generation Diversity Issues and How to Tame Them Nov 25, 2024 Diversity Image Generation
Code Code Available 1Globally Correlation-Aware Hard Negative Generation Nov 20, 2024 Image Retrieval Metric Learning
Code Code Available 1Nearest Neighbor Normalization Improves Multimodal Retrieval Oct 31, 2024 Cross-Modal Retrieval Image Captioning
Code Code Available 1CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Oct 2, 2024 Astronomy Image Quality Assessment
Code Code Available 1Integrating Visual and Textual Inputs for Searching Large-Scale Map Collections with CLIP Oct 2, 2024 Image Retrieval
Code Code Available 1VLAD-BuFF: Burst-aware Fast Feature Aggregation for Visual Place Recognition Sep 28, 2024 Image Retrieval Visual Localization
Code Code Available 1SpaGBOL: Spatial-Graph-Based Orientated Localisation Sep 23, 2024 Camera Localization Cross-View Geo-Localisation
Code Code Available 1Efficient and Discriminative Image Feature Extraction for Universal Image Retrieval Sep 20, 2024 Image Retrieval Metric Learning
Code Code Available 1NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval Sep 4, 2024 Image Retrieval RAG
Code Code Available 1UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation Aug 21, 2024 Image Generation Image Retrieval
Code Code Available 1AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval Aug 6, 2024 Image Retrieval Re-Ranking
Code Code Available 1Visual Haystacks: A Vision-Centric Needle-In-A-Haystack Benchmark Jul 18, 2024 GPU Image Retrieval
Code Code Available 1No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations Jul 15, 2024 All Image Retrieval
Code Code Available 1