SOTAVerified

Speech-to-Text

Papers

Showing 2650 of 403 papers

TitleStatusHype
Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech TranslationCode1
Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented GenerationCode1
Kosp2e: Korean Speech to English Translation CorpusCode1
End-to-End Single-Channel Speaker-Turn Aware Conversational Speech TranslationCode1
ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMsCode1
Learning Shared Semantic Space for Speech-to-Text TranslationCode1
IESTAC: English-Italian Parallel Corpus for End-to-End Speech-to-Text Machine TranslationCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
Clotho: An Audio Captioning DatasetCode1
Indoor Air Quality Dataset with Activities of Daily Living in Low to Middle-income CommunitiesCode1
Brilla AI: AI Contestant for the National Science and Maths QuizCode1
A Large-Scale Chinese Multimodal NER Dataset with Speech CluesCode1
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text TranslationCode1
CoVoST: A Diverse Multilingual Speech-To-Text Translation CorpusCode1
Cross-modal Contrastive Learning for Speech TranslationCode1
Audio Jailbreak Attacks: Exposing Vulnerabilities in SpeechGPT in a White-Box FrameworkCode1
Cross Attention Augmented Transducer Networks for Simultaneous TranslationCode1
Benchmarking Large Multimodal Models against Common CorruptionsCode1
Denial-of-Service Poisoning Attacks against Large Language ModelsCode1
Fine-tuning Whisper on Low-Resource Languages for Real-World ApplicationsCode1
Automatic speech recognition for the Nepali language using CNN, bidirectional LSTM and ResNetCode1
Back Translation for Speech-to-text Translation Without TranscriptsCode1
DuplexMamba: Enhancing Real-time Speech Conversations with Duplex and Streaming CapabilitiesCode1
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and EditingCode1
CoVoST 2 and Massively Multilingual Speech-to-Text TranslationCode1
Show:102550
← PrevPage 2 of 17Next →

No leaderboard results yet.