| Blind Speech Separation and Dereverberation using Neural Beamforming | Mar 24, 2021 | Speaker IdentificationSpeaker Separation | CodeCode Available | 1 |
| DenserNet: Weakly Supervised Visual Localization Using Multi-scale Feature Aggregation | Dec 4, 2020 | Image RetrievalRetrieval | CodeCode Available | 1 |
| Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics | Feb 1, 2022 | Human-Object Interaction DetectionObject | CodeCode Available | 1 |
| Differentiable Registration of Images and LiDAR Point Clouds with VoxelPoint-to-Pixel Matching | Dec 7, 2023 | Triplet | CodeCode Available | 1 |
| DistilProtBert: A distilled protein language model used to distinguish between real proteins and their randomly shuffled counterparts | May 10, 2022 | Dimensionality ReductionKnowledge Distillation | CodeCode Available | 1 |
| Domain-invariant Similarity Activation Map Contrastive Learning for Retrieval-based Long-term Visual Localization | Sep 16, 2020 | Autonomous DrivingContrastive Learning | CodeCode Available | 1 |
| Collapse-Aware Triplet Decoupling for Adversarially Robust Image Retrieval | Dec 12, 2023 | Adversarial DefenseImage Retrieval | CodeCode Available | 1 |
| Attribute-aware Identity-hard Triplet Loss for Video-based Person Re-identification | Jun 13, 2020 | AttributeMetric Learning | CodeCode Available | 1 |
| 3D-CSL: self-supervised 3D context similarity learning for Near-Duplicate Video Retrieval | Nov 10, 2022 | RetrievalSelf-Supervised Learning | CodeCode Available | 1 |
| EndoViT: pretraining vision transformers on a large collection of endoscopic images | Apr 3, 2024 | Action Triplet RecognitionSegmentation | CodeCode Available | 1 |
| Audio-based Near-Duplicate Video Retrieval with Audio Similarity Learning | Oct 17, 2020 | RetrievalTransfer Learning | CodeCode Available | 1 |
| Enhanced Multi-Channel Graph Convolutional Network for Aspect Sentiment Triplet Extraction | May 1, 2022 | Aspect Sentiment Triplet ExtractionRelation | CodeCode Available | 1 |
| Adaptive Offline Quintuplet Loss for Image-Text Matching | Mar 7, 2020 | Image-text matchingText Matching | CodeCode Available | 1 |
| Event-level Knowledge Editing | Feb 20, 2024 | knowledge editingTriplet | CodeCode Available | 1 |
| ClusterLLM: Large Language Models as a Guide for Text Clustering | May 24, 2023 | ClusteringLanguage Modelling | CodeCode Available | 1 |
| A Unified Object Motion and Affinity Model for Online Multi-Object Tracking | Mar 25, 2020 | Metric LearningMulti-Object Tracking | CodeCode Available | 1 |
| CoLLM: A Large Language Model for Composed Image Retrieval | Mar 25, 2025 | Image RetrievalLanguage Modeling | CodeCode Available | 1 |
| Feature Re-Learning with Data Augmentation for Video Relevance Prediction | Apr 8, 2020 | Data AugmentationRetrieval | CodeCode Available | 1 |
| Automatic Prosody Annotation with Pre-Trained Text-Speech Model | Jun 16, 2022 | Speech Synthesistext-to-speech | CodeCode Available | 1 |
| Few-Shot Text Classification with Triplet Networks, Data Augmentation, and Curriculum Learning | Mar 12, 2021 | ClassificationData Augmentation | CodeCode Available | 1 |
| Compositional Feature Augmentation for Unbiased Scene Graph Generation | Aug 13, 2023 | DiversityGraph Generation | CodeCode Available | 1 |
| Data Splits and Metrics for Method Benchmarking on Surgical Action Triplet Datasets | Apr 11, 2022 | Action Triplet RecognitionBenchmarking | CodeCode Available | 1 |
| AMC-Loss: Angular Margin Contrastive Loss for Improved Explainability in Image Classification | Apr 21, 2020 | General Classificationimage-classification | CodeCode Available | 1 |
| CholecTriplet2022: Show me a tool and tell me the triplet -- an endoscopic vision challenge for surgical action triplet detection | Feb 13, 2023 | Action Triplet DetectionAction Triplet Recognition | CodeCode Available | 1 |
| A Robustly Optimized BMRC for Aspect Sentiment Triplet Extraction | Jul 1, 2022 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | CodeCode Available | 1 |