| GraFT: Gradual Fusion Transformer for Multimodal Re-Identification | Oct 25, 2023 | Network PruningRepresentation Learning | —Unverified | 0 |
| CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction | Oct 24, 2023 | Aspect Sentiment Triplet ExtractionAspect Term Extraction and Sentiment Classification | CodeCode Available | 1 |
| Large Language Models are Temporal and Causal Reasoners for Video Question Answering | Oct 24, 2023 | Natural Language UnderstandingQuestion Answering | CodeCode Available | 1 |
| Player Re-Identification Using Body Part Appearences | Oct 23, 2023 | Pose EstimationTriplet | —Unverified | 0 |
| Two-Stage Triplet Loss Training with Curriculum Augmentation for Audio-Visual Retrieval | Oct 20, 2023 | Cross-Modal RetrievalRetrieval | —Unverified | 0 |
| Free-text Keystroke Authentication using Transformers: A Comparative Study of Architectures and Loss Functions | Oct 18, 2023 | TripletUser Identification | —Unverified | 0 |
| LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation | Oct 16, 2023 | Few-Shot LearningLarge Language Model | CodeCode Available | 1 |
| KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models | Oct 15, 2023 | Multiple-choiceTriplet | CodeCode Available | 0 |
| ESA: External Space Attention Aggregation for Image-Text Retrieval | Oct 10, 2023 | Image-text RetrievalRetrieval | CodeCode Available | 1 |
| Distillation Improves Visual Place Recognition for Low Quality Images | Oct 10, 2023 | Knowledge DistillationQuantization | CodeCode Available | 0 |
| AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition | Oct 8, 2023 | Re-RankingTriplet | CodeCode Available | 1 |
| Music- and Lyrics-driven Dance Synthesis | Sep 30, 2023 | Triplet | CodeCode Available | 0 |
| SCALE: Synergized Collaboration of Asymmetric Language Translation Engines | Sep 29, 2023 | Continual LearningTranslation | CodeCode Available | 1 |
| Video-adverb retrieval with compositional adverb-action embeddings | Sep 26, 2023 | TripletVideo-Adverb Retrieval | CodeCode Available | 0 |
| A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent Attention | Sep 23, 2023 | BlockingData Augmentation | CodeCode Available | 1 |
| Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Learning | Sep 23, 2023 | Contrastive LearningRecommendation Systems | CodeCode Available | 1 |
| Learning From Noisy Correspondence With Tri-Partition for Cross-Modal Matching | Sep 22, 2023 | Cross-modal retrieval with noisy correspondenceMemorization | —Unverified | 0 |
| Bridging Sensor Gaps via Attention Gated Tuning for Hyperspectral Image Classification | Sep 22, 2023 | Computational EfficiencyHyperspectral Image Classification | CodeCode Available | 0 |
| Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? | Sep 22, 2023 | Semantic SimilaritySemantic Textual Similarity | —Unverified | 0 |
| StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding | Sep 20, 2023 | Chart Question AnsweringChart Understanding | CodeCode Available | 2 |
| Dual-Modal Attention-Enhanced Text-Video Retrieval with Triplet Partial Margin Contrastive Learning | Sep 20, 2023 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| Realistic Website Fingerprinting By Augmenting Network Trace | Sep 18, 2023 | Self-Supervised LearningTriplet | CodeCode Available | 1 |
| Ugly Ducklings or Swans: A Tiered Quadruplet Network with Patient-Specific Mining for Improved Skin Lesion Classification | Sep 18, 2023 | Lesion ClassificationMetric Learning | —Unverified | 0 |
| Unsupervised Contrast-Consistent Ranking with Language Models | Sep 13, 2023 | Language ModellingNegation | CodeCode Available | 0 |
| MaskRenderer: 3D-Infused Multi-Mask Realistic Face Reenactment | Sep 10, 2023 | Face ReenactmentTriplet | —Unverified | 0 |