| GraFT: Gradual Fusion Transformer for Multimodal Re-Identification | Oct 25, 2023 | Network PruningRepresentation Learning | —Unverified | 0 |
| CONTRASTE: Supervised Contrastive Pre-training With Aspect-based Prompts For Aspect Sentiment Triplet Extraction | Oct 24, 2023 | Aspect Sentiment Triplet ExtractionAspect Term Extraction and Sentiment Classification | CodeCode Available | 1 |
| Large Language Models are Temporal and Causal Reasoners for Video Question Answering | Oct 24, 2023 | Natural Language UnderstandingQuestion Answering | CodeCode Available | 1 |
| Player Re-Identification Using Body Part Appearences | Oct 23, 2023 | Pose EstimationTriplet | —Unverified | 0 |
| Two-Stage Triplet Loss Training with Curriculum Augmentation for Audio-Visual Retrieval | Oct 20, 2023 | Cross-Modal RetrievalRetrieval | —Unverified | 0 |
| Free-text Keystroke Authentication using Transformers: A Comparative Study of Architectures and Loss Functions | Oct 18, 2023 | TripletUser Identification | —Unverified | 0 |
| LLM4SGG: Large Language Models for Weakly Supervised Scene Graph Generation | Oct 16, 2023 | Few-Shot LearningLarge Language Model | CodeCode Available | 1 |
| KGQuiz: Evaluating the Generalization of Encoded Knowledge in Large Language Models | Oct 15, 2023 | Multiple-choiceTriplet | CodeCode Available | 0 |
| ESA: External Space Attention Aggregation for Image-Text Retrieval | Oct 10, 2023 | Image-text RetrievalRetrieval | CodeCode Available | 1 |
| Distillation Improves Visual Place Recognition for Low Quality Images | Oct 10, 2023 | Knowledge DistillationQuantization | CodeCode Available | 0 |