| Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation | Dec 20, 2022 | Machine TranslationMultimodal Machine Translation | CodeCode Available | 0 |
| DDIPNet and DDIPNet+: Discriminant Deep Image Prior Networks for Remote Sensing Image Classification | Dec 20, 2022 | Classificationimage-classification | —Unverified | 0 |
| SrTR: Self-reasoning Transformer with Visual-linguistic Knowledge for Scene Graph Generation | Dec 19, 2022 | DecoderGraph Generation | —Unverified | 0 |
| A Better Choice: Entire-space Datasets for Aspect Sentiment Triplet Extraction | Dec 18, 2022 | Aspect Sentiment Triplet ExtractionExtract Aspect | CodeCode Available | 0 |
| Resolving Semantic Confusions for Improved Zero-Shot Detection | Dec 12, 2022 | Generalized Zero-Shot Object DetectionObject Detection | CodeCode Available | 1 |
| VASR: Visual Analogies of Situation Recognition | Dec 8, 2022 | Common Sense ReasoningTriplet | CodeCode Available | 0 |
| Rendezvous in Time: An Attention-based Temporal Fusion approach for Surgical Triplet Recognition | Nov 30, 2022 | Action Triplet RecognitionTriplet | CodeCode Available | 1 |
| A Revenue Function for Comparison-Based Hierarchical Clustering | Nov 29, 2022 | ClusteringOpen-Ended Question Answering | CodeCode Available | 0 |
| Generalized Face Anti-Spoofing via Multi-Task Learning and One-Side Meta Triplet Loss | Nov 29, 2022 | Depth EstimationFace Anti-Spoofing | —Unverified | 0 |
| Renmin University of China at TRECVID 2022: Improving Video Search by Feature Fusion and Negation Understanding | Nov 28, 2022 | Ad-hoc video searchNegation | —Unverified | 0 |