SOTAVerified

Automatic Speech Recognition

Papers

Showing 901950 of 3174 papers

TitleStatusHype
Detecting Speech Abnormalities with a Perceiver-based Sequence Classifier that Leverages a Universal Speech Model0
Developing Acoustic Models for Automatic Speech Recognition in Swedish0
Attention Enhanced Citrinet for Speech Recognition0
Developing ASR for Indonesian-English Bilingual Language Teaching0
Developing Automatic Speech Recognition for Scottish Gaelic0
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability0
Correction of Automatic Speech Recognition with Transformer Sequence-to-sequence Model0
Development of Automatic Speech Recognition for the Documentation of Cook Islands Māori0
Correction Focused Language Model Training for Speech Recognition0
Device Directedness with Contextual Cues for Spoken Dialog Systems0
Device-directed Utterance Detection0
DFSMN-SAN with Persistent Memory Model for Automatic Speech Recognition0
Attention-based Wav2Text with Feature Transfer Learning0
A Likelihood Ratio based Domain Adaptation Method for E2E Models0
A Curriculum Learning Method for Improved Noise Robustness in Automatic Speech Recognition0
Dialect Adaptation and Data Augmentation for Low-Resource ASR: TalTech Systems for the MADASR 2023 Challenge0
Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models0
Corpus Phonetics Tutorial0
Dialect Identification through Adversarial Learning and Knowledge Distillation on Romanian BERT0
Dialect-Specific Models for Automatic Speech Recognition of African American Vernacular English0
Dialog act guided contextual adapter for personalized speech recognition0
Dialogue Act Segmentation for Vietnamese Human-Human Conversational Texts0
Attention based on-device streaming speech recognition with large speech corpus0
Diarization-Aware Multi-Speaker Automatic Speech Recognition via Large Language Models0
Corpus Generation for Voice Command in Smart Home and the Effect of Speech Synthesis on End-to-End SLU0
Corpora for Cross-Language Information Retrieval in Six Less-Resourced Languages0
Align, Write, Re-order: Explainable End-to-End Speech Translation via Operation Sequence Generation0
CORILGA: a Galician Multilevel Annotated Speech Corpus for Linguistic Analysis0
Digits micro-model for accurate and secure transactions0
Dilated U-net based approach for multichannel speech enhancement from First-Order Ambisonics recordings0
Attention based end to end Speech Recognition for Voice Search in Hindi and English0
Directed Speech Separation for Automatic Speech Recognition of Long Form Conversational Speech0
Directional ASR: A New Paradigm for E2E Multi-Speaker Speech Recognition with Source Localization0
Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments0
Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition0
Direct Speech to Speech Translation: A Review0
Align With Purpose: Optimize Desired Properties in CTC Models with a General Plug-and-Play Framework0
Disappeared Command: Spoofing Attack On Automatic Speech Recognition Systems with Sound Masking0
Activity focused Speech Recognition of Preschool Children in Early Childhood Classrooms0
A Closer Look at Audio-Visual Multi-Person Speech Recognition and Active Speaker Selection0
Convolutional Variational Autoencoders for Spectrogram Compression in Automatic Speech Recognition0
Convolutional Speech Recognition with Pitch and Voice Quality Features0
Convoifilter: A case study of doing cocktail party speech recognition0
Conversational Speech Recognition Needs Data? Experiments with Austrian German0
Attention-based ASR with Lightweight and Dynamic Convolutions0
Discriminative Speech Recognition Rescoring with Pre-trained Language Models0
Discriminative training of RNNLMs with the average word error criterion0
Disentangled-Transformer: An Explainable End-to-End Automatic Speech Recognition Model with Speech Content-Context Separation0
Disentangleing Content and Fine-grained Prosody Information via Hybrid ASR Bottleneck Features for Voice Conversion0
Alignment Restricted Streaming Recurrent Neural Network Transducer0
Show:102550
← PrevPage 19 of 64Next →

No leaderboard results yet.