| BotTriNet: A Unified and Efficient Embedding for Social Bots Detection via Metric Learning | Apr 6, 2023 | Metric LearningSentence | —Unverified | 0 |
| Open-Vocabulary Point-Cloud Object Detection without 3D Annotation | Apr 3, 2023 | 3D Object Detection3D Open-Vocabulary Object Detection | CodeCode Available | 1 |
| AUDIT: Audio Editing by Following Instructions with Latent Diffusion Models | Apr 3, 2023 | DenoisingSuper-Resolution | —Unverified | 0 |
| SPAN: Learning Similarity between Scene Graphs and Images with Transformers | Apr 2, 2023 | Contrastive LearningGraph Generation | CodeCode Available | 1 |
| Whether and When does Endoscopy Domain Pretraining Make Sense? | Mar 30, 2023 | Action Triplet DetectionSurgical phase recognition | CodeCode Available | 1 |
| Joint embedding in Hierarchical distance and semantic representation learning for link prediction | Mar 28, 2023 | Graph EmbeddingKnowledge Graph Embedding | —Unverified | 0 |
| Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation | Mar 28, 2023 | SentenceTriplet | —Unverified | 0 |
| Cross-View Visual Geo-Localization for Outdoor Augmented Reality | Mar 28, 2023 | geo-localizationPose Estimation | —Unverified | 0 |
| Improving Contextualized Topic Models with Negative Sampling | Mar 27, 2023 | DiversityTopic Models | CodeCode Available | 0 |
| Exploiting Unlabelled Photos for Stronger Fine-Grained SBIR | Mar 24, 2023 | Image RetrievalKnowledge Distillation | —Unverified | 0 |