SOTAVerified

Speech-to-Text

Papers

Showing 76100 of 403 papers

TitleStatusHype
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models0
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy0
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR0
Development of Natural Language Processing Tools for Cook Islands M\=aori0
Adversarial Attacks against Neural Networks in Audio Domain: Exploiting Principal Components0
BCN2BRNO: ASR System Fusion for Albayzin 2020 Speech to Text Challenge0
Balancing Speech Understanding and Generation Using Continual Pre-training for Codec-based Speech LLM0
An Experiment on Speech-to-Text Translation Systems for Manipuri to English on Low Resource Setting0
Design of a novel Korean learning application for efficient pronunciation correction0
Data Efficient Direct Speech-to-Text Translation with Modality Agnostic Meta-Learning0
A Dutch Dysarthric Speech Database for Individualized Speech Therapy Research0
An Empirical Evaluation of AI-Powered Non-Player Characters' Perceived Realism and Performance in Virtual Reality Environments0
A Voice Controlled E-Commerce Web Application0
A combined approach to the analysis of speech conversations in a contact center domain0
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect0
Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network0
Dialetto, ma Quanto Dialetto? Transcribing and Evaluating Dialects on a Continuum0
Multilingual Speech Translation with Efficient Finetuning of Pretrained Models0
Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing0
CTC Alignments Improve Autoregressive Translation0
CUIfy the XR: An Open-Source Package to Embed LLM-powered Conversational Agents in XR0
Cross-modal Contrastive Learning for Speech Translation0
DARTS: Dialectal Arabic Transcription System0
Analyzing Utility of Visual Context in Multimodal Speech Recognition Under Noisy Conditions0
Deep Learning Based Natural Language Processing for End to End Speech Translation0
Show:102550
← PrevPage 4 of 17Next →

No leaderboard results yet.