| AANet: Aggregation and Alignment Network with Semi-hard Positive Sample Mining for Hierarchical Place Recognition | Oct 8, 2023 | Re-RankingTriplet | CodeCode Available | 1 |
| Music- and Lyrics-driven Dance Synthesis | Sep 30, 2023 | Triplet | CodeCode Available | 0 |
| SCALE: Synergized Collaboration of Asymmetric Language Translation Engines | Sep 29, 2023 | Continual LearningTranslation | CodeCode Available | 1 |
| Video-adverb retrieval with compositional adverb-action embeddings | Sep 26, 2023 | TripletVideo-Adverb Retrieval | CodeCode Available | 0 |
| A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent Attention | Sep 23, 2023 | BlockingData Augmentation | CodeCode Available | 1 |
| Generative Retrieval with Semantic Tree-Structured Item Identifiers via Contrastive Learning | Sep 23, 2023 | Contrastive LearningRecommendation Systems | CodeCode Available | 1 |
| Learning From Noisy Correspondence With Tri-Partition for Cross-Modal Matching | Sep 22, 2023 | Cross-modal retrieval with noisy correspondenceMemorization | —Unverified | 0 |
| Bridging Sensor Gaps via Attention Gated Tuning for Hyperspectral Image Classification | Sep 22, 2023 | Computational EfficiencyHyperspectral Image Classification | CodeCode Available | 0 |
| Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? | Sep 22, 2023 | Semantic SimilaritySemantic Textual Similarity | —Unverified | 0 |
| StructChart: On the Schema, Metric, and Augmentation for Visual Chart Understanding | Sep 20, 2023 | Chart Question AnsweringChart Understanding | CodeCode Available | 2 |