SOTAVerified

Speech-to-Text

Papers

Showing 176200 of 403 papers

TitleStatusHype
MuAViC: A Multilingual Audio-Visual Corpus for Robust Speech Recognition and Robust Speech-to-Text TranslationCode2
Improving Medical Speech-to-Text Accuracy with Vision-Language Pre-training Model0
PATCorrect: Non-autoregressive Phoneme-augmented Transformer for ASR Error Correction0
Characterizing Financial Market Coverage using Artificial Intelligence0
PSST! Prosodic Speech Segmentation with TransformersCode1
Pre-training for Speech Translation: CTC Meets Optimal TransportCode1
Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition0
Pushing the performances of ASR models on English and Spanish accents0
WACO: Word-Aligned Contrastive Learning for Speech TranslationCode0
M3ST: Mix at Three Levels for Speech Translation0
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech RecognitionCode0
Handling and extracting key entities from customer conversations using Speech recognition and Named Entity recognition0
Multilingual Speech Emotion Recognition With Multi-Gating Mechanism and Neural Architecture Search0
Phonemic Representation and Transcription for Speech to Text Applications for Under-resourced Indigenous African Languages: The Case of Kiswahili0
Efficient Speech Translation with Dynamic Latent PerceiversCode0
Don't Discard Fixed-Window Audio Segmentation in Speech-to-Text TranslationCode0
Information-Transport-based Policy for Simultaneous TranslationCode1
Named Entity Detection and Injection for Direct Speech Translation0
Improving Semi-supervised End-to-end Automatic Speech Recognition using CycleGAN and Inter-domain Losses0
Simple and Effective Unsupervised Speech Translation0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker PrivacyCode0
CTC Alignments Improve Autoregressive Translation0
SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-trainingCode0
JoeyS2T: Minimalistic Speech-to-Text Modeling with JoeyNMTCode1
Speech-to-Text and Evaluation of Multiple Machine Translation Systems0
Show:102550
← PrevPage 8 of 17Next →

No leaderboard results yet.