| Low-Rank and Sparse Model Merging for Multi-Lingual Speech Recognition and Translation | Feb 24, 2025 | Automatic Speech RecognitionDiversity | —Unverified | 0 |
| Low-Resource Speech-to-Text Translation | Mar 24, 2018 | DecoderMachine Translation | —Unverified | 0 |
| M3ST: Mix at Three Levels for Speech Translation | Dec 7, 2022 | Data AugmentationDiversity | —Unverified | 0 |
| MAM: Masked Acoustic Modeling for End-to-End Speech-to-Text Translation | Oct 22, 2020 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MinMo: A Multimodal Large Language Model for Seamless Voice Interaction | Jan 10, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech Recognition | Nov 29, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Modular Speech-to-Text Translation for Zero-Shot Cross-Modal Transfer | Oct 5, 2023 | Speech-to-TextSpeech-to-Text Translation | —Unverified | 0 |
| Multi-Discriminator Sobolev Defense-GAN Against Adversarial Attacks for End-to-End Speech Systems | Mar 15, 2021 | Speech-to-Text | —Unverified | 0 |
| Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search | Oct 31, 2022 | Emotion RecognitionNeural Architecture Search | —Unverified | 0 |
| Multilingual Speech Translation from Efficient Finetuning of Pretrained Models | Aug 1, 2021 | DecoderSpeech-to-Text | —Unverified | 0 |