SOTAVerified

Automatic Speech Recognition

Papers

Showing 26512700 of 3174 papers

TitleStatusHype
Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural NetworksCode0
Towards Online End-to-end Transformer Automatic Speech Recognition0
ESPnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech ToolkitCode0
Analyzing the impact of speaker localization errors on speech separation for automatic speech recognitionCode0
Pre-training in Deep Reinforcement Learning for Automatic Speech Recognition0
Recognizing long-form speech using streaming end-to-end models0
Analyzing ASR pretraining for low-resource speech-to-text translation0
RNN based Incremental Online Spoken Language Understanding0
A practical two-stage training strategy for multi-stream end-to-end speech recognition0
Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model0
Word-level Embeddings for Cross-Task Transfer Learning in Speech ProcessingCode0
G2G: TTS-Driven Pronunciation Learning for Graphemic Hybrid ASR0
Robust Neural Machine Translation for Clean and Noisy Speech Transcripts0
AeGAN: Time-Frequency Speech Denoising via Generative Adversarial Networks0
Neuro-SERKET: Development of Integrative Cognitive System through the Composition of Deep Probabilistic Generative Models0
Multi-Talker MVDR Beamforming Based on Extended Complex Gaussian Mixture Model0
Transformer ASR with Contextual Block Processing0
Lead2Gold: Towards exploiting the full potential of noisy transcriptions for speech recognition0
Analyzing Large Receptive Field Convolutional Networks for Distant Speech Recognition0
VAIS ASR: Building a conversational speech recognition system using language model combination0
Query-by-example on-device keyword spotting0
Hear "No Evil", See "Kenansville": Efficient and Transferable Black-Box Attacks on Speech Recognition and Voice Identification Systems0
One-To-Many Multilingual End-to-end Speech Translation0
Adapting a FrameNet Semantic Parser for Spoken Language Understanding Using Adversarial Learning0
A Case Study on Combining ASR and Visual Features for Generating Instructional Video Captions0
Modeling Confidence in Sequence-to-Sequence Models0
Neural Zero-Inflated Quality Estimation Model For Automatic Speech Recognition System0
From Senones to Chenones: Tied Context-Dependent Graphemes for Hybrid Speech Recognition0
Multilingual End-to-End Speech Translation0
使用生成對抗網路於強健式自動語音辨識的應用(Exploiting Generative Adversarial Network for Robustness Automatic Speech Recognition)0
End-to-End Code-Switching ASR for Low-Resourced Language Pairs0
Improving RNN Transducer Modeling for End-to-End Speech RecognitionCode0
Improved Training Techniques for Online Neural Machine Translation0
Generating Robust Audio Adversarial Examples using Iterative Proportional Clipping0
Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability Training0
Understanding Semantics from Speech Through Pre-training0
Improving OOV Detection and Resolution with External Language Models in Acoustic-to-Word ASR0
Code-Switched Language Models Using Neural Based Synthetic Data from Parallel Sentences0
Simultaneous Speech Recognition and Speaker Diarization for Monaural Dialogue Recordings with Target-Speaker Acoustic Models0
Integrating Source-channel and Attention-based Sequence-to-sequence Models for Speech Recognition0
NeMo: a toolkit for building AI applications using Neural Modules0
An Investigation Into On-device Personalization of End-to-end Automatic Speech Recognition Models0
Harnessing Indirect Training Data for End-to-End Automatic Speech Translation: Tricks of the Trade0
A Comparative Study on Transformer vs RNN in Speech ApplicationsCode0
Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model0
Neural Network-Based Modeling of Phonetic Durations0
Towards Accurate Text Verbalization for ASR Based on Audio Alignment0
Semantic Language Model for Tunisian Dialect0
Dialect-Specific Models for Automatic Speech Recognition of African American Vernacular English0
Human-Informed Speakers and Interpreters Analysis in the WAW Corpus and an Automatic Method for Calculating Interpreters' D\'ecalage0
Show:102550
← PrevPage 54 of 64Next →

No leaderboard results yet.