| GraFT: Gradual Fusion Transformer for Multimodal Re-Identification | Oct 25, 2023 | Network PruningRepresentation Learning | —Unverified | 0 |
| CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction | Oct 24, 2023 | Aspect Sentiment Triplet ExtractionAspect Term Extraction and Sentiment Classification | CodeCode Available | 1 |
| Large Language Models are Temporal and Causal Reasoners for Video Question Answering | Oct 24, 2023 | Natural Language UnderstandingQuestion Answering | CodeCode Available | 1 |
| Player Re-Identification Using Body Part Appearences | Oct 23, 2023 | Pose EstimationTriplet | —Unverified | 0 |
| Two-Stage Triplet Loss Training with Curriculum Augmentation for Audio-Visual Retrieval | Oct 20, 2023 | Cross-Modal RetrievalRetrieval | —Unverified | 0 |
| Free-text Keystroke Authentication using Transformers: A Comparative Study of Architectures and Loss Functions | Oct 18, 2023 | TripletUser Identification | —Unverified | 0 |
| LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation | Oct 16, 2023 | Few-Shot LearningLarge Language Model | CodeCode Available | 1 |
| KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models | Oct 15, 2023 | Multiple-choiceTriplet | CodeCode Available | 0 |
| ESA: External Space Attention Aggregation for Image-Text Retrieval | Oct 10, 2023 | Image-text RetrievalRetrieval | CodeCode Available | 1 |
| Distillation Improves Visual Place Recognition for Low Quality Images | Oct 10, 2023 | Knowledge DistillationQuantization | CodeCode Available | 0 |
| AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition | Oct 8, 2023 | Re-RankingTriplet | CodeCode Available | 1 |
| Music- and Lyrics-driven Dance Synthesis | Sep 30, 2023 | Triplet | CodeCode Available | 0 |
| SCALE: Synergized Collaboration of Asymmetric Language Translation Engines | Sep 29, 2023 | Continual LearningTranslation | CodeCode Available | 1 |
| Video-adverb retrieval with compositional adverb-action embeddings | Sep 26, 2023 | TripletVideo-Adverb Retrieval | CodeCode Available | 0 |
| A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent Attention | Sep 23, 2023 | BlockingData Augmentation | CodeCode Available | 1 |
| Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Learning | Sep 23, 2023 | Contrastive LearningRecommendation Systems | CodeCode Available | 1 |
| Learning From Noisy Correspondence With Tri-Partition for Cross-Modal Matching | Sep 22, 2023 | Cross-modal retrieval with noisy correspondenceMemorization | —Unverified | 0 |
| Bridging Sensor Gaps via Attention Gated Tuning for Hyperspectral Image Classification | Sep 22, 2023 | Computational EfficiencyHyperspectral Image Classification | CodeCode Available | 0 |
| Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? | Sep 22, 2023 | Semantic SimilaritySemantic Textual Similarity | —Unverified | 0 |
| Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning | Sep 20, 2023 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding | Sep 20, 2023 | Chart Question AnsweringChart Understanding | CodeCode Available | 2 |
| Realistic Website Fingerprinting By Augmenting Network Trace | Sep 18, 2023 | Self-Supervised LearningTriplet | CodeCode Available | 1 |
| Ugly Ducklings or Swans: A Tiered Quadruplet Network with Patient-Specific Mining for Improved Skin Lesion Classification | Sep 18, 2023 | Lesion ClassificationMetric Learning | —Unverified | 0 |
| Unsupervised Contrast-Consistent Ranking with Language Models | Sep 13, 2023 | Language ModellingNegation | CodeCode Available | 0 |
| MaskRenderer: 3D-Infused Multi-Mask Realistic Face Reenactment | Sep 10, 2023 | Face ReenactmentTriplet | —Unverified | 0 |
| A Probabilistic Semi-Supervised Approach with Triplet Markov Chains | Sep 7, 2023 | Bayesian InferenceTriplet | —Unverified | 0 |
| Zero-Shot Scene Graph Generation via Triplet Calibration and Reduction | Sep 7, 2023 | Graph GenerationScene Graph Generation | CodeCode Available | 1 |
| Gene-induced Multimodal Pre-training for Image-omic Classification | Sep 6, 2023 | ClassificationTriplet | CodeCode Available | 0 |
| Graph Self-Contrast Representation Learning | Sep 5, 2023 | Contrastive LearningGraph Representation Learning | —Unverified | 0 |
| ConCur: Self-supervised graph representation based on contrastive learning with curriculum negative sampling | Sep 1, 2023 | Contrastive LearningGraph Representation Learning | CodeCode Available | 0 |
| Patent image retrieval using transformer-based deep metric learning | Sep 1, 2023 | Image RetrievalMetric Learning | CodeCode Available | 1 |
| Attention-based CT Scan Interpolation for Lesion Segmentation of Colorectal Liver Metastases | Aug 30, 2023 | Computed Tomography (CT)Lesion Segmentation | —Unverified | 0 |
| CoVR-2: Automatic Data Construction for Composed Video Retrieval | Aug 28, 2023 | Composed Image Retrieval (CoIR)Composed Video Retrieval (CoVR) | CodeCode Available | 1 |
| Towards Privacy-Supporting Fall Detection via Deep Unsupervised RGB2Depth Adaptation | Aug 23, 2023 | Domain AdaptationTriplet | CodeCode Available | 0 |
| Age Prediction From Face Images Via Contrastive Learning | Aug 23, 2023 | Contrastive LearningMORPH | —Unverified | 0 |
| TeD-SPAD: Temporal Distinctiveness for Self-supervised Privacy-preservation for video Anomaly Detection | Aug 21, 2023 | Anomaly DetectionAttribute | CodeCode Available | 1 |
| Rethinking Person Re-identification from a Projection-on-Prototypes Perspective | Aug 21, 2023 | Person Re-IdentificationPerson Retrieval | —Unverified | 0 |
| CoNe: Contrast Your Neighbours for Supervised Image Classification | Aug 21, 2023 | Classificationimage-classification | CodeCode Available | 0 |
| Noisy-Correspondence Learning for Text-to-Image Person Re-identification | Aug 19, 2023 | Person Re-IdentificationText based Person Retrieval | CodeCode Available | 1 |
| Ranking-aware Uncertainty for Text-guided Image Retrieval | Aug 16, 2023 | DiversityImage Retrieval | —Unverified | 0 |
| One-shot lip-based biometric authentication: extending behavioral features with authentication phrase information | Aug 14, 2023 | One-Shot LearningTriplet | —Unverified | 0 |
| Compositional Feature Augmentation for Unbiased Scene Graph Generation | Aug 13, 2023 | DiversityGraph Generation | CodeCode Available | 1 |
| Leveraging multi-view data without annotations for prostate MRI segmentation: A contrastive approach | Aug 12, 2023 | Contrastive LearningMRI segmentation | —Unverified | 0 |
| Your Negative May not Be True Negative: Boosting Image-Text Matching with False Negative Elimination | Aug 8, 2023 | Image-text matchingRepresentation Learning | CodeCode Available | 1 |
| RoadScan: A Novel and Robust Transfer Learning Framework for Autonomous Pothole Detection in Roads | Aug 7, 2023 | Computational Efficiencyobject-detection | —Unverified | 0 |
| T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images | Aug 4, 2023 | Change DetectionDecoder | CodeCode Available | 1 |
| Sequential and Shared-Memory Parallel Algorithms for Partitioned Local Depths | Jul 31, 2023 | CPUTriplet | —Unverified | 0 |
| Learning Multi-modal Representations by Watching Hundreds of Surgical Video Lectures | Jul 27, 2023 | Automatic Speech RecognitionContrastive Learning | CodeCode Available | 1 |
| GeoTransformer: Fast and Robust Point Cloud Registration with Geometric Transformer | Jul 25, 2023 | Image to Point Cloud RegistrationPoint Cloud Registration | —Unverified | 0 |
| Jina Embeddings: A Novel Set of High-Performance Sentence Embedding Models | Jul 20, 2023 | NegationRetrieval | —Unverified | 0 |