SOTAVerified

Speech-to-Text

Papers

Showing 5175 of 403 papers

TitleStatusHype
Denial-of-Service Poisoning Attacks against Large Language ModelsCode1
Stacked DeBERT: All Attention in Incomplete Data for Text ClassificationCode1
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and EditingCode1
Benchmarking Large Multimodal Models against Common CorruptionsCode1
CoVoST 2 and Massively Multilingual Speech-to-Text TranslationCode1
Towards Automatic Speech to Sign Language GenerationCode1
Cross-modal Contrastive Learning for Speech TranslationCode1
WhiSPA: Semantically and Psychologically Aligned Whisper with Self-Supervised Contrastive and Student-Teacher LearningCode1
DUB: Discrete Unit Back-translation for Speech TranslationCode1
Common Voice: A Massively-Multilingual Speech CorpusCode1
ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMsCode1
FlexiBO: A Decoupled Cost-Aware Multi-Objective Optimization Approach for Deep Neural NetworksCode1
LeaPformer: Enabling Linear Transformers for Autoregressive and Simultaneous Tasks via Learned ProportionsCode1
Cross Attention Augmented Transducer Networks for Simultaneous TranslationCode1
Revisiting Interpolation Augmentation for Speech-to-Text GenerationCode1
Deep Reinforcement Learning For Sequence to Sequence ModelsCode1
Challenges and Opportunities of Speech Recognition for Bengali Language0
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning0
Can We Achieve High-quality Direct Speech-to-Speech Translation without Parallel Speech Data?0
Application of Audio Fingerprinting Techniques for Real-Time Scalable Speech Retrieval and Speech Clusterization0
A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks0
CoSTA: Code-Switched Speech Translation using Aligned Speech-Text Interleaving0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
Application-Agnostic Language Modeling for On-Device ASR0
Bridging the Modality Gap for Speech-to-Text Translation0
Show:102550
← PrevPage 3 of 17Next →

No leaderboard results yet.