SOTAVerified

Automatic Speech Recognition

Papers

Showing 22012250 of 3174 papers

TitleStatusHype
SynthASR: Unlocking Synthetic Data for Speech Recognition0
Synthesising Audio Adversarial Examples for Automatic Speech Recognition0
Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition0
Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition0
Synt++: Utilizing Imperfect Synthetic Data to Improve Speech Recognition0
Tag and correct: high precision post-editing approach to correction of speech recognition errors0
TALCS: An Open-Source Mandarin-English Code-Switching Corpus and a Speech Recognition Baseline0
Talk, Don't Write: A Study of Direct Speech-Based Image Retrieval0
Targeted Adversarial Examples for Black Box Audio Systems0
TASK AWARE MULTI-TASK LEARNING FOR SPEECH TO TEXT TASKS0
Task-aware Warping Factors in Mask-based Speech Enhancement0
Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC100
Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine0
Teach an all-rounder with experts in different domains0
Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques0
Technology-Augmented Multilingual Communication Models: New Interaction Paradigms, Shifts in the Language Services Industry, and Implications for Training Programs0
TED-LIUM: an Automatic Speech Recognition dedicated corpus0
Temporal Order Preserved Optimal Transport-based Cross-modal Knowledge Transfer Learning for ASR0
Tencent-MVSE: A Large-Scale Benchmark Dataset for Multi-Modal Video Similarity Evaluation0
Teochew-Wild: The First In-the-wild Teochew Dataset with Orthographic Annotations0
Text Generation with Speech Synthesis for ASR Data Augmentation0
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models0
Text Injection for Neural Contextual Biasing0
Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis0
Text-only domain adaptation for end-to-end ASR using integrated text-to-mel-spectrogram generator0
Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation0
Text-To-Speech Data Augmentation for Low Resource Speech Recognition0
Thank you for Attention: A survey on Attention-based Artificial Neural Networks for Automatic Speech Recognition0
That Sounds Familiar: an Analysis of Phonetic Representations Transfer Across Languages0
The 2015 Sheffield System for Transcription of Multi-Genre Broadcast Media0
The AFRL IWSLT 2018 Systems: What Worked, What Didn’t0
The AFRL IWSLT 2020 Systems: Work-From-Home Edition0
The Balancing Act: Unmasking and Alleviating ASR Biases in Portuguese0
The CHiME-7 Challenge: System Description and Performance of NeMo Team's DASR System0
The CHiME-7 DASR Challenge: Distant Meeting Transcription with Multiple Devices in Diverse Scenarios0
The CHiME-8 DASR Challenge for Generalizable and Array Agnostic Distant Automatic Speech Recognition and Diarization0
The CUHK-TENCENT speaker diarization system for the ICASSP 2022 multi-channel multi-party meeting transcription challenge0
The DIRHA Portuguese Corpus: A Comparison of Home Automation Command Detection and Recognition in Simulated and Real Data.0
The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR0
The Esethu Framework: Reimagining Sustainable Dataset Governance and Curation for Low-Resource Languages0
The ETAPE speech processing evaluation0
The evaluation of a code-switched Sepedi-English automatic speech recognition system0
The Faetar Benchmark: Speech Recognition in a Very Under-Resourced Language0
SoK: The Faults in our ASRs: An Overview of Attacks against Automatic Speech Recognition and Speaker Identification Systems0
The fifth 'CHiME' Speech Separation and Recognition Challenge: Dataset, task and baselines0
The Gift of Feedback: Improving ASR Model Quality by Learning from User Corrections through Federated Learning0
The HW-TSC's Offline Speech Translation Systems for IWSLT 2021 Evaluation0
The HW-TSC’s Offline Speech Translation System for IWSLT 2022 Evaluation0
The IBM 2016 Speaker Recognition System0
The IBM Speaker Recognition System: Recent Advances and Error Analysis0
Show:102550
← PrevPage 45 of 64Next →

No leaderboard results yet.