SOTAVerified

Speech-to-Text

Papers

Showing 251300 of 403 papers

TitleStatusHype
Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation0
NAIST Simultaneous Speech-to-Text Translation System for IWSLT 20220
The AISP-SJTU Simultaneous Translation System for IWSLT 20220
The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation0
LibriS2S: A German-English Speech-to-Speech Translation CorpusCode0
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment0
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation0
A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems0
Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents0
The MIT Voice Name System0
A Dataset for Speech Emotion Recognition in Greek Theatrical PlaysCode0
XTREME-S: Evaluating Cross-lingual Speech Representations0
A combined approach to the analysis of speech conversations in a contact center domain0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems0
Which French speech recognition system for assistant robots?0
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning EnvironmentsCode0
Punctuation restoration in Swedish through fine-tuned KB-BERT0
Semantic-aware Speech to Text Transmission with Redundancy Removal0
Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility0
A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architectureCode0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene RecognitionCode0
Cross-modal Contrastive Learning for Speech Translation0
Training end-to-end speech-to-text models on mobile phones0
Impact of Microphone position Measurement Error on Multi Channel Distant Speech Recognition & Intelligibility0
Improve Sinhala Speech Recognition Through e2e LF-MMI Model0
An Experiment on Speech-to-Text Translation Systems for Manipuri to English on Low Resource Setting0
Scribosermo: Fast Speech-to-Text models for German and other LanguagesCode0
Decision Attentive Regularization to Improve Simultaneous Speech Translation Systems0
Comparison of SVD and factorized TDNN approaches for speech to text0
A Comparative Study on Non-Autoregressive Modelings for Speech-to-Text Generation0
Automated Testing of AI Models0
Challenges and Opportunities of Speech Recognition for Bengali Language0
Audio Interval Retrieval using Convolutional Neural Networks0
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition0
Infusing Future Information into Monotonic Attention Through Language ModelsCode0
With One Voice: Composing a Travel Voice Assistant from Re-purposed Models0
Multilingual Speech Translation from Efficient Finetuning of Pretrained Models0
Corpus Creation and Evaluation for Speech-to-Text and Speech Translation0
BTS: Back TranScription for Speech-to-Text Post-Processor using Text-to-Speech-to-Text0
Improving Speech Translation by Understanding and Learning from the Auxiliary Text Translation Task0
The USTC-NELSLIP Systems for Simultaneous Speech Translation Task at IWSLT 20210
Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling0
Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR0
TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS0
On the Design of Strategic Task Recommendations for Sustainable Crowdsourcing-Based Content Moderation0
Findings of the Second Workshop on Automatic Simultaneous Translation0
Worldly Wise (WoW) - Cross-Lingual Knowledge Fusion for Fact-based Visual Spoken-Question Answering0
What shall we do with an hour of data? Speech recognition for the un- and under-served languages of Common Voice0
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect0
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models0
Show:102550
← PrevPage 6 of 9Next →

No leaderboard results yet.