SOTAVerified

Speech-to-Text

Papers

Showing 201250 of 403 papers

TitleStatusHype
Kencorpus: A Kenyan Language Corpus of Swahili, Dholuo and Luhya for Natural Language Processing Tasks0
Improving Hypernasality Estimation with Automatic Speech Recognition in Cleft Palate Speech0
Extending RNN-T-based speech recognition systems with emotion and language classification0
RSD-GAN: Regularized Sobolev Defense GAN Against Speech-to-Text Adversarial Attacks0
M-Adapter: Modality Adaptation for End-to-End Speech-to-Text TranslationCode0
Language Model Augmented Monotonic Attention for Simultaneous Translation0
System Description on Automatic Simultaneous Translation Workshop0
Findings of the Third Workshop on Automatic Simultaneous Translation0
Swiss German Speech to Text system evaluation0
Finstreder: Simple and fast Spoken Language Understanding with Finite State Transducers using modern Speech-to-Text modelsCode0
Developing a Speech Recognition System for Recognizing Tonal Speech Signals Using a Convolutional Neural Network0
Revisiting End-to-End Speech-to-Text Translation From Scratch0
The Nós Project: Opening routes for the Galician language in the field of language technologies0
Towards Large Vocabulary Kazakh-Russian Sign Language Dataset: KRSL-OnlineSchool0
A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking0
Clinical Dialogue Transcription Error Correction using Seq2Seq Models0
Semantic-preserved Communication System for Highly Efficient Speech Transmission0
PaddleSpeech: An Easy-to-Use All-in-One Speech ToolkitCode6
SAMU-XLSR: Semantically-Aligned Multimodal Utterance-level Cross-Lingual Speech Representation0
Hearing voices at the National Library -- a speech corpus and acoustic model for the Swedish language0
Cross-modal Contrastive Learning for Speech TranslationCode1
Design of a novel Korean learning application for efficient pronunciation correction0
Wav2Seq: Pre-training Speech-to-Text Encoder-Decoder Models Using Pseudo LanguagesCode1
Learning Adaptive Segmentation Policy for End-to-End Simultaneous Translation0
NAIST Simultaneous Speech-to-Text Translation System for IWSLT 20220
The HW-TSC’s Simultaneous Speech Translation System for IWSLT 2022 Evaluation0
The AISP-SJTU Simultaneous Translation System for IWSLT 20220
LibriS2S: A German-English Speech-to-Speech Translation CorpusCode0
WaBERT: A Low-resource End-to-end Model for Spoken Language Understanding and Speech-to-BERT Alignment0
Enhanced Direct Speech-to-Speech Translation Using Self-supervised Pre-training and Data Augmentation0
A Study of Gender Impact in Self-supervised Models for Speech-to-Text Systems0
Deep Speech Based End-to-End Automated Speech Recognition (ASR) for Indian-English Accents0
The MIT Voice Name System0
A Dataset for Speech Emotion Recognition in Greek Theatrical PlaysCode0
XTREME-S: Evaluating Cross-lingual Speech Representations0
STEMM: Self-learning with Speech-text Manifold Mixup for Speech TranslationCode1
A^3T: Alignment-Aware Acoustic and Text Pretraining for Speech Synthesis and EditingCode1
A combined approach to the analysis of speech conversations in a contact center domain0
Attacks as Defenses: Designing Robust Audio CAPTCHAs Using Attacks on Automatic Speech Recognition Systems0
Which French speech recognition system for assistant robots?0
Spanish and English Phoneme Recognition by Training on Simulated Classroom Audio Recordings of Collaborative Learning EnvironmentsCode0
Punctuation restoration in Swedish through fine-tuned KB-BERT0
Semantic-aware Speech to Text Transmission with Redundancy Removal0
Optimization of a Real-Time Wavelet-Based Algorithm for Improving Speech Intelligibility0
CVSS Corpus and Massively Multilingual Speech-to-Speech TranslationCode2
A wearable sensor vest for social humanoid robots with GPGPU, IoT, and modular software architectureCode0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene RecognitionCode0
Regularizing End-to-End Speech Translation with Triangular Decomposition AgreementCode1
Cross-modal Contrastive Learning for Speech Translation0
X-Vector based voice activity detection for multi-genre broadcast speech-to-textCode1
Show:102550
← PrevPage 5 of 9Next →

No leaderboard results yet.