| Multi-level Distance Regularization for Deep Metric Learning | Feb 8, 2021 | Metric LearningRetrieval | CodeCode Available | 1 |
| Blind Speech Separation and Dereverberation using Neural Beamforming | Mar 24, 2021 | Speaker IdentificationSpeaker Separation | CodeCode Available | 1 |
| MSCMNet: Multi-scale Semantic Correlation Mining for Visible-Infrared Person Re-Identification | Nov 24, 2023 | Person Re-IdentificationTriplet | CodeCode Available | 1 |
| Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model | May 20, 2023 | DiversityHuman-Object Interaction Detection | CodeCode Available | 1 |
| Neural-Logic Human-Object Interaction Detection | Nov 16, 2023 | DecoderHuman-Object Interaction Detection | CodeCode Available | 1 |
| No Fuss Distance Metric Learning using Proxies | Mar 21, 2017 | Metric LearningSemantic Similarity | CodeCode Available | 1 |
| Adjacency List Oriented Relational Fact Extraction via Adaptive Multi-task Learning | Jun 3, 2021 | Multi-Task LearningRelation Extraction | CodeCode Available | 1 |
| Deep Animation Video Interpolation in the Wild | Apr 6, 2021 | Optical Flow EstimationTriplet | CodeCode Available | 1 |
| One-shot Scene Graph Generation | Feb 22, 2022 | Graph GenerationScene Graph Generation | CodeCode Available | 1 |
| On Metric Learning for Audio-Text Cross-Modal Retrieval | Mar 29, 2022 | AudioCapsCross-Modal Retrieval | CodeCode Available | 1 |
| Open-Vocabulary Point-Cloud Object Detection without 3D Annotation | Apr 3, 2023 | 3D Object Detection3D Open-Vocabulary Object Detection | CodeCode Available | 1 |
| A Robustly Optimized BMRC for Aspect Sentiment Triplet Extraction | Jul 1, 2022 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |
| Deep Attention-guided Graph Clustering with Dual Self-supervision | Nov 10, 2021 | ClusteringDeep Attention | CodeCode Available | 1 |
| PADS: Policy-Adapted Sampling for Visual Similarity Learning | Mar 24, 2020 | Metric LearningReinforcement Learning | CodeCode Available | 1 |
| Decompose to Adapt: Cross-domain Object Detection via Feature Disentanglement | Jan 6, 2022 | DisentanglementDomain Adaptation | CodeCode Available | 1 |
| Patent Image Retrieval Using Cross-entropy-based Metric Learning | Feb 20, 2023 | Image RetrievalMetric Learning | CodeCode Available | 1 |
| ByteCover: Cover Song Identification via Multi-Loss Training | Oct 27, 2020 | Cover song identificationTriplet | CodeCode Available | 1 |
| CaLa: Complementary Association Learning for Augmenting Composed Image Retrieval | May 29, 2024 | Cross-Modal RetrievalImage Retrieval | CodeCode Available | 1 |
| Adversarial Attack and Defense in Deep Ranking | Jun 7, 2021 | Adversarial AttackAdversarial Robustness | CodeCode Available | 1 |
| Candidate Set Re-ranking for Composed Image Retrieval with Dual Multi-modal Encoder | May 25, 2023 | Composed Image Retrieval (CoIR)Image Retrieval | CodeCode Available | 1 |
| PointNetVLAD: Deep Point Cloud Based Retrieval for Large-Scale Place Recognition | Apr 10, 2018 | 3D Place RecognitionPoint Cloud Retrieval | CodeCode Available | 1 |
| Polar R-CNN: End-to-End Lane Detection with Fewer Anchors | Nov 3, 2024 | Autonomous DrivingLane Detection | CodeCode Available | 1 |
| Data Splits and Metrics for Method Benchmarking on Surgical Action Triplet Datasets | Apr 11, 2022 | Action Triplet RecognitionBenchmarking | CodeCode Available | 1 |
| DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images | Apr 28, 2025 | Generative Adversarial Networkparameter-efficient fine-tuning | CodeCode Available | 1 |
| Deep Cosine Metric Learning for Person Re-Identification | Dec 2, 2018 | General ClassificationMetric Learning | CodeCode Available | 1 |
| CenterNet: Keypoint Triplets for Object Detection | Apr 17, 2019 | Objectobject-detection | CodeCode Available | 1 |
| Cross-Modal Retrieval for Motion and Text via DopTriple Loss | May 7, 2023 | Cross-Modal RetrievalRetrieval | CodeCode Available | 1 |
| RankDNN: Learning to Rank for Few-shot Learning | Nov 28, 2022 | Few-Shot Learningimage-classification | CodeCode Available | 1 |
| Realistic Website Fingerprinting By Augmenting Network Trace | Sep 18, 2023 | Self-Supervised LearningTriplet | CodeCode Available | 1 |
| Change detection needs change information: improving deep 3D point cloud change detection | Apr 25, 2023 | Change DetectionTriplet | CodeCode Available | 1 |
| REF-VLM: Triplet-Based Referring Paradigm for Unified Visual Decoding | Mar 10, 2025 | Instruction FollowingKeypoint Detection | CodeCode Available | 1 |
| Regularizing End-to-End Speech Translation with Triangular Decomposition Agreement | Dec 21, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| CholecTriplet2021: A benchmark challenge for surgical action triplet recognition | Apr 10, 2022 | Action DetectionAction Triplet Recognition | CodeCode Available | 1 |
| CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection | Feb 13, 2023 | Action Triplet DetectionAction Triplet Recognition | CodeCode Available | 1 |
| CIDEr: Consensus-based Image Description Evaluation | Nov 20, 2014 | Action RecognitionAttribute | CodeCode Available | 1 |
| Circle Loss: A Unified Perspective of Pair Similarity Optimization | Feb 25, 2020 | Face RecognitionFace Verification | CodeCode Available | 1 |
| RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition | Apr 24, 2021 | Image CaptioningObject Recognition | CodeCode Available | 1 |
| Rendezvous: Attention Mechanisms for the Recognition of Surgical Action Triplets in Endoscopic Videos | Sep 7, 2021 | Action Triplet RecognitionTriplet | CodeCode Available | 1 |
| AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition | Oct 8, 2023 | Re-RankingTriplet | CodeCode Available | 1 |
| Instruct-ReID: A Multi-purpose Person Re-identification Task with Instructions | Jun 13, 2023 | Person Re-IdentificationTriplet | CodeCode Available | 1 |
| Revisiting Deep Audio-Text Retrieval Through the Lens of Transportation | May 16, 2024 | AudioCapsEvent Detection | CodeCode Available | 1 |
| Rotate to Attend: Convolutional Triplet Attention Module | Oct 6, 2020 | image-classificationImage Classification | CodeCode Available | 1 |
| SCALE: Synergized Collaboration of Asymmetric Language Translation Engines | Sep 29, 2023 | Continual LearningTranslation | CodeCode Available | 1 |
| Cluster-level Feature Alignment for Person Re-identification | Aug 15, 2020 | Person Re-IdentificationTriplet | CodeCode Available | 1 |
| ClusterLLM: Large Language Models as a Guide for Text Clustering | May 24, 2023 | ClusteringLanguage Modelling | CodeCode Available | 1 |
| Self-Generated Defocus Blur Detection via Dual Adversarial Discriminators | Jun 19, 2021 | Defocus Blur DetectionTriplet | CodeCode Available | 1 |
| Compositional Feature Augmentation for Unbiased Scene Graph Generation | Aug 13, 2023 | DiversityGraph Generation | CodeCode Available | 1 |
| Self-Supervised Transformer for Sparse and Irregularly Sampled Multivariate Clinical Time-Series | Jul 29, 2021 | ImputationMortality Prediction | CodeCode Available | 1 |
| Semi-Supervised Keypoint Detector and Descriptor for Retinal Image Matching | Jul 16, 2022 | Image RegistrationTriplet | CodeCode Available | 1 |
| CuratorNet: Visually-aware Recommendation of Art Images | Sep 9, 2020 | Triplet | CodeCode Available | 1 |