SOTAVerified

Automatic Speech Recognition

Papers

Showing 28262850 of 3174 papers

TitleStatusHype
Tag and correct: high precision post-editing approach to correction of speech recognition errors0
TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline0
Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval0
Targeted Adversarial Examples for Black Box Audio Systems0
TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS0
Task-aware Warping Factors in Mask-based Speech Enhancement0
Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC100
Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine0
Teach an all-rounder with experts in different domains0
Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques0
Technology-Augmented Multilingual Communication Models: New Interaction Paradigms, Shifts in the Language Services Industry, and Implications for Training Programs0
TED-LIUM: an Automatic Speech Recognition dedicated corpus0
Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR0
Tencent-MVSE: A Large-Scale Benchmark Dataset for Multi-Modal Video Similarity Evaluation0
Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models0
Text Injection for Neural Contextual Biasing0
Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis0
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator0
Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation0
Text-To-Speech Data Augmentation for Low Resource Speech Recognition0
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition0
That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages0
The 2015 Sheffield System for Transcription of Multi-Genre Broadcast Media0
Show:102550
← PrevPage 114 of 127Next →

No leaderboard results yet.