SOTAVerified

Automatic Speech Recognition

Papers

Showing 15011550 of 3174 papers

TitleStatusHype
Conversational Speech Recognition Needs Data? Experiments with Austrian German0
Towards a Unified ASR System for the Armenian Standards0
Huqariq: A Multilingual Speech Corpus of Native Languages of Peru forSpeech Recognition0
Generating Synthetic Clinical Speech Data through Simulated ASR Deletion Error0
SSR7000: A Synchronized Corpus of Ultrasound Tongue Imaging for End-to-End Silent Speech RecognitionCode0
Building Open-source Speech Technology for Low-resource Minority Languages with SáMi as an Example – Tools, Methods and Experiments0
Evaluation of Off-the-shelf Speech Recognizers on Different Accents in a Dialogue Domain0
Development of Automatic Speech Recognition for the Documentation of Cook Islands Māori0
Towards an Open-Source Dutch Speech Recognition System for the Healthcare Domain0
Samrómur Children: An Icelandic Speech Corpus0
ParlaSpeech-HR - a Freely Available ASR Dataset for Croatian Bootstrapped from the ParlaMint Corpus0
A Semi-Automated Live Interlingual Communication Workflow Featuring Intralingual Respeaking: Evaluation and Benchmarking0
ParlamentParla: A Speech Corpus of Catalan Parliamentary Sessions0
BEA-Base: A Benchmark for ASR of Spontaneous Hungarian0
Mesures linguistiques automatiques pour l’évaluation des systèmes de Reconnaissance Automatique de la Parole (Automated linguistic measures for automatic speech recognition systems’ evaluation)0
Snow Mountain: Dataset of Audio Recordings of The Bible in Low Resource Languages0
Adversarial synthesis based data-augmentation for code-switched spoken language identification0
Adaptive Activation Network For Low Resource Multilingual Speech Recognition0
Acoustic-to-articulatory Speech Inversion with Multi-task Learning0
Punctuation Restoration in Spanish Customer Support Transcripts using Transfer Learning0
Contextual Adapters for Personalized Speech Recognition in Neural Transducers0
Clinical Dialogue Transcription Error Correction using Seq2Seq Models0
Joint Training of Speech Enhancement and Self-supervised Model for Noise-robust ASR0
On Building Spoken Language Understanding Systems for Low Resourced Languages0
Investigating Lexical Replacements for Arabic-English Code-Switched Data Augmentation0
Improving CTC-based ASR Models with Gated Interlayer Collaboration0
Heterogeneous Reservoir Computing Models for Persian Speech Recognition0
FLEURS: Few-shot Learning Evaluation of Universal Representations of SpeechCode0
An Investigation on Applying Acoustic Feature Conversion to ASR of Adult and Child Speech0
Multi-Level Modeling Units for End-to-End Mandarin Speech Recognition0
Calibrate and Refine! A Novel and Agile Framework for ASR-error Robust Intent Detection0
Language Models with Image Descriptors are Strong Few-Shot Video-Language LearnersCode1
Self-Supervised Speech Representation Learning: A Review0
Automatic Spoken Language Identification using a Time-Delay Neural Network0
Insights on Neural Representations for End-to-End Speech Recognition0
Streaming Noise Context Aware Enhancement For Automatic Speech Recognition in Multi-Talker Environments0
Deploying self-supervised learning in the wild for hybrid automatic speech recognition0
Improved Consistency Training for Semi-Supervised Sequence-to-Sequence ASR via Speech Chain Reconstruction and Self-Transcribing0
Pretraining Approaches for Spoken Language Recognition: TalTech Submission to the OLR 2021 Challenge0
Personalized Adversarial Data Augmentation for Dysarthric and Elderly Speech Recognition0
Unified Modeling of Multi-Domain Multi-Device ASR Systems0
Who Are We Talking About? Handling Person Names in Speech Translation0
A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection0
End-to-End Multi-Person Audio/Visual Automatic Speech Recognition0
Best of Both Worlds: Multi-task Audio-Visual Automatic Speech Recognition and Active Speaker Detection0
Speaker Reinforcement Using Target Source Extraction for Robust Automatic Speech Recognition0
Vietnamese Automatic Speech Recognition using Wav2vec 2.0Code1
Transformer-Based Multi-Aspect Multi-Granularity Non-Native English Speaker Pronunciation AssessmentCode1
A Conformer-based Waveform-domain Neural Acoustic Echo Canceller Optimized for ASR Accuracy0
Speaker Recognition in the WildCode1
Show:102550
← PrevPage 31 of 64Next →

No leaderboard results yet.