| FBSDiff: Plug-and-Play Frequency Band Substitution of Diffusion Features for Highly Controllable Text-Driven Image Translation | Aug 2, 2024 | Image GenerationImage-to-Image Translation | CodeCode Available | 1 |
| Cross-Domain Separable Translation Network for Multimodal Image Change Detection | Jul 23, 2024 | Change DetectionTranslation | CodeCode Available | 1 |
| LLaST: Improved End-to-end Speech Translation System Leveraged by Large Language Models | Jul 22, 2024 | Data AugmentationLanguage Modeling | CodeCode Available | 1 |
| Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation Models | Jul 5, 2024 | Adversarial AttackAutomatic Speech Recognition | CodeCode Available | 1 |
| Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization | Jul 5, 2024 | GPUImage-to-Image Translation | CodeCode Available | 1 |
| Frequency-Controlled Diffusion Model for Versatile Text-Guided Image-to-Image Translation | Jul 3, 2024 | Image-to-Image TranslationTranslation | CodeCode Available | 1 |
| SignSpeak: Open-Source Time Series Classification for ASL Translation | Jun 27, 2024 | Time SeriesTime Series Classification | CodeCode Available | 1 |
| Leveraging Pre-trained Models for FF-to-FFPE Histopathological Image Translation | Jun 26, 2024 | DiagnosticLanguage Modelling | CodeCode Available | 1 |
| ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs | Jun 26, 2024 | ArzEn Code-switched Translation to araArzEn Code-switched Translation to eng | CodeCode Available | 1 |
| LLMs Are Zero-Shot Context-Aware Simultaneous Translators | Jun 19, 2024 | Machine TranslationTranslation | CodeCode Available | 1 |