| Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning | Oct 17, 2020 | RetrievalTransfer Learning | CodeCode Available | 1 | 5 |
| Collapse-Aware Triplet Decoupling for Adversarially Robust Image Retrieval | Dec 12, 2023 | Adversarial DefenseImage Retrieval | CodeCode Available | 1 | 5 |
| Adaptive Offline Quintuplet Loss for Image-Text Matching | Mar 7, 2020 | Image-text matchingText Matching | CodeCode Available | 1 | 5 |
| Complementary Patch for Weakly Supervised Semantic Segmentation | Aug 9, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 | 5 |
| Communicative Message Passing for Inductive Relation Reasoning | Dec 16, 2020 | Inductive BiasInductive Relation Prediction | CodeCode Available | 1 | 5 |
| A Unified Object Motion and Affinity Model for Online Multi-Object Tracking | Mar 25, 2020 | Metric LearningMulti-Object Tracking | CodeCode Available | 1 | 5 |
| Conditional Similarity Networks | Mar 25, 2016 | Triplet | CodeCode Available | 1 | 5 |
| Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection | Jan 1, 2021 | Common Sense ReasoningGraph Generation | CodeCode Available | 1 | 5 |
| Automatic Prosody Annotation with Pre-Trained Text-Speech Model | Jun 16, 2022 | Speech Synthesistext-to-speech | CodeCode Available | 1 | 5 |
| FaceNet: A Unified Embedding for Face Recognition and Clustering | Mar 12, 2015 | ClusteringDisguised Face Verification | CodeCode Available | 1 | 5 |