SOTAVerified

Speech-to-Text

Papers

Showing 76100 of 403 papers

TitleStatusHype
Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration ApproachCode0
OWSM-CTC: An Open Encoder-Only Speech Foundation Model for Speech Recognition, Translation, and Language IdentificationCode0
An Empirical Study of Consistency Regularization for End-to-End Speech-to-Text TranslationCode0
A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architectureCode0
M-Adapter: Modality Adaptation for End-to-End Speech-to-Text TranslationCode0
Measuring the Effect of Transcription Noise on Downstream Language Understanding TasksCode0
MMSpeech: Multi-modal Multi-task Encoder-Decoder Pre-training for Speech RecognitionCode0
Automatic Quality Assessment for Speech Translation Using Joint ASR and MT FeaturesCode0
Let's Give a Voice to Conversational Agents in Virtual RealityCode0
Kurdish (Sorani) Speech to Text: Presenting an Experimental DatasetCode0
CoVoSwitch: Machine Translation of Synthetic Code-Switched Text Based on Intonation UnitsCode0
Augmenting Librispeech with French Translations: A Multimodal Corpus for Direct Speech Translation EvaluationCode0
LibriS2S: A German-English Speech-to-Speech Translation CorpusCode0
Investigating Zero-Shot Generalizability on Mandarin-English Code-Switched ASR and Speech-to-text Translation of Recent Foundation Models with Self-Supervision and Weak SupervisionCode0
A Dataset for Speech Emotion Recognition in Greek Theatrical PlaysCode0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene RecognitionCode0
Joint CTC-Attention based End-to-End Speech Recognition using Multi-task LearningCode0
A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality ConversionCode0
Contextualized Translation of Automatically Segmented SpeechCode0
Audio Adversarial Examples: Targeted Attacks on Speech-to-TextCode0
Infusing Future Information into Monotonic Attention Through Language ModelsCode0
Listen and Speak Fairly: A Study on Semantic Gender Bias in Speech Integrated Large Language ModelsCode0
Attentively Embracing Noise for Robust Latent Representation in BERTCode0
Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text modelsCode0
Fleurs-SLU: A Massively Multilingual Benchmark for Spoken Language UnderstandingCode0
Show:102550
← PrevPage 4 of 17Next →

No leaderboard results yet.