| Compositional Entailment Learning for Hyperbolic Vision-Language Models | Oct 9, 2024 | Language ModellingRepresentation Learning | CodeCode Available | 2 |
| Compositional Visual Generation with Composable Diffusion Models | Jun 3, 2022 | Sentence | CodeCode Available | 2 |
| BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric | Dec 16, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| CVSS Corpus and Massively Multilingual Speech-to-Speech Translation | Jan 11, 2022 | SentenceSpeech-to-Speech Translation | CodeCode Available | 2 |
| Decoupling Static and Hierarchical Motion Perception for Referring Video Segmentation | Apr 4, 2024 | Contrastive LearningReferring Expression | CodeCode Available | 2 |
| Deduplicating Training Data Makes Language Models Better | Jul 14, 2021 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Advancing Large Language Models to Capture Varied Speaking Styles and Respond Properly in Spoken Conversations | Feb 20, 2024 | Sentence | CodeCode Available | 2 |
| AgentRE: An Agent-Based Framework for Navigating Complex Information Landscapes in Relation Extraction | Sep 3, 2024 | RelationRelation Extraction | CodeCode Available | 2 |
| DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement | Jun 18, 2025 | Graph GenerationHallucination | CodeCode Available | 2 |
| MedCPT: Contrastive Pre-trained Transformers with Large-scale PubMed Search Logs for Zero-shot Biomedical Information Retrieval | Jul 2, 2023 | Biomedical Information RetrievalContrastive Learning | CodeCode Available | 2 |