| Bridging Language and Items for Retrieval and Recommendation | Mar 6, 2024 | RetrievalSentence | CodeCode Available | 3 | 5 |
| Active Retrieval Augmented Generation | May 11, 2023 | RetrievalRetrieval-augmented Generation | CodeCode Available | 2 | 5 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 | 5 |
| Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation | Apr 4, 2024 | Contrastive LearningReferring Expression | CodeCode Available | 2 | 5 |
| CLUE: A Chinese Language Understanding Evaluation Benchmark | Apr 13, 2020 | General ClassificationMachine Reading Comprehension | CodeCode Available | 2 | 5 |
| Compositional Entailment Learning for Hyperbolic Vision-Language Models | Oct 9, 2024 | Language ModellingRepresentation Learning | CodeCode Available | 2 | 5 |
| Comprehending and Ordering Semantics for Image Captioning | Jun 14, 2022 | Cross-Modal RetrievalImage Captioning | CodeCode Available | 2 | 5 |
| Correlation-Guided Query-Dependency Calibration for Video Temporal Grounding | Nov 15, 2023 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 | 5 |
| CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing | Feb 21, 2022 | Few-Shot LearningSentence | CodeCode Available | 2 | 5 |
| BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs | Jul 17, 2023 | Instruction FollowingSentence | CodeCode Available | 2 | 5 |