| AsyReC: A Multimodal Graph-based Framework for Spatio-Temporal Asymmetric Dyadic Relationship Classification | Apr 7, 2025 | Graph Neural NetworkTriplet | CodeCode Available | 0 |
| Scaling Prompt Instructed Zero Shot Composed Image Retrieval with Image-Only Data | Apr 1, 2025 | Image RetrievalRetrieval | —Unverified | 0 |
| LANID: LLM-assisted New Intent Discovery | Mar 31, 2025 | Intent DiscoveryTask-Oriented Dialogue Systems | CodeCode Available | 0 |
| Enhancing Learnable Descriptive Convolutional Vision Transformer for Face Anti-Spoofing | Mar 29, 2025 | DescriptiveDomain Generalization | CodeCode Available | 0 |
| Tune It Up: Music Genre Transfer and Prediction | Mar 27, 2025 | Music Genre TransferMusic Style Transfer | CodeCode Available | 0 |
| Learnable Sequence Augmenter for Triplet Contrastive Learning in Sequential Recommendation | Mar 26, 2025 | Contrastive LearningSelf-Supervised Learning | —Unverified | 0 |
| fine-CLIP: Enhancing Zero-Shot Fine-Grained Surgical Action Recognition with Vision-Language Models | Mar 25, 2025 | Action RecognitionSurgical phase recognition | —Unverified | 0 |
| A-MESS: Anchor based Multimodal Embedding with Semantic Synchronization for Multimodal Intent Recognition | Mar 25, 2025 | Contrastive LearningIntent Recognition | —Unverified | 0 |
| CoSimGen: Controllable Diffusion Model for Simultaneous Image and Mask Generation | Mar 25, 2025 | Triplet | —Unverified | 0 |
| EMPLACE: Self-Supervised Urban Scene Change Detection | Mar 22, 2025 | Change DetectionScene Change Detection | CodeCode Available | 0 |