| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation | May 24, 2023 | Formal LogicGPU | CodeCode Available | 1 |
| CMOT: Cross-modal Mixup via Optimal Transport for Speech Translation | May 24, 2023 | Machine TranslationTranslation | CodeCode Available | 1 |
| SAMScore: A Content Structural Similarity Metric for Image Translation Evaluation | May 24, 2023 | Semantic SimilaritySemantic Textual Similarity | CodeCode Available | 1 |
| Sāmayik: A Benchmark and Dataset for English-Sanskrit Translation | May 23, 2023 | Machine TranslationTranslation | CodeCode Available | 1 |
| BigVideo: A Large-scale Video Subtitle Translation Dataset for Multimodal Machine Translation | May 23, 2023 | Contrastive LearningMachine Translation | CodeCode Available | 1 |
| Gloss-Free End-to-End Sign Language Translation | May 22, 2023 | Gloss-free Sign Language TranslationSign Language Translation | CodeCode Available | 1 |
| Communication Efficient Federated Learning for Multilingual Neural Machine Translation with Adapter | May 21, 2023 | ClusteringFederated Learning | CodeCode Available | 1 |
| Scene Graph as Pivoting: Inference-time Image-free Unsupervised Multimodal Machine Translation with Visual Scene Hallucination | May 20, 2023 | HallucinationMachine Translation | CodeCode Available | 1 |
| DUB: Discrete Unit Back-translation for Speech Translation | May 19, 2023 | Machine TranslationSpeech-to-Text | CodeCode Available | 1 |