| XLAVS-R: Cross-Lingual Audio-Visual Speech Representation Learning for Noise-Robust Speech Perception | Mar 21, 2024 | Audio-Visual Speech RecognitionRepresentation Learning | —Unverified | 0 |
| Isometric Neural Machine Translation using Phoneme Count Ratio Reward-based Reinforcement Learning | Mar 20, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| SumTra: A Differentiable Pipeline for Few-Shot Cross-Lingual Summarization | Mar 20, 2024 | Language ModellingTranslation | CodeCode Available | 0 |
| Enhancing Fingerprint Image Synthesis with GANs, Diffusion Models, and Style Transfer Techniques | Mar 20, 2024 | DiversityImage Generation | —Unverified | 0 |
| Multi-Dimensional Machine Translation Evaluation: Model Evaluation and Resource for Korean | Mar 19, 2024 | Machine TranslationSentence | CodeCode Available | 0 |
| MSLM-S2ST: A Multitask Speech Language Model for Textless Speech-to-Speech Translation with Speaker Style Preservation | Mar 19, 2024 | DecoderLanguage Modeling | —Unverified | 0 |
| FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Mar 19, 2024 | Translationvalid | CodeCode Available | 4 |
| Generalized Consistency Trajectory Models for Image Manipulation | Mar 19, 2024 | DenoisingImage Manipulation | CodeCode Available | 1 |
| Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation | Mar 19, 2024 | Gloss-free Sign Language TranslationLanguage Modeling | CodeCode Available | 1 |
| Self-generated Replay Memories for Continual Neural Machine Translation | Mar 19, 2024 | DecoderMachine Translation | CodeCode Available | 0 |