SOTAVerified

Automatic Speech Recognition

Papers

Showing 23512400 of 3174 papers

TitleStatusHype
Training Speech Enhancement Systems with Noisy Speech Datasets0
Training variance and performance evaluation of neural networks in speech0
Train your classifier first: Cascade Neural Networks Training from upper layers to lower layers0
Transcending Controlled Environments Assessing the Transferability of ASRRobust NLU Models to Real-World Applications0
TranscRater: a Tool for Automatic Speech Recognition Quality Estimation0
Transcribe, Align and Segment: Creating speech datasets for low-resource languages0
Transcribe-to-Diarize: Neural Speaker Diarization for Unlimited Number of Speakers using End-to-End Speaker-Attributed ASR0
Transcribing and Translating, Fast and Slow: Joint Speech Translation and Recognition0
Transcribing Educational Videos Using Whisper: A preliminary study on using AI for transcribing educational videos0
Transcription-Free Fine-Tuning of Speech Separation Models for Noisy and Reverberant Multi-Speaker Automatic Speech Recognition0
Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation0
Transducer-Llama: Integrating LLMs into Streamable Transducer-based Speech Recognition0
Transducers with Pronunciation-aware Embeddings for Automatic Speech Recognition0
Transferable Adversarial Attacks against ASR0
Transferable and Configurable Audio Adversarial Attack from Low-Level Features0
Transfer Learning Approaches for Streaming End-to-End Speech Recognition System0
Transfer Learning-Based Deep Residual Learning for Speech Recognition in Clean and Noisy Environments0
Transfer Learning for British Sign Language Modelling0
Transfer Learning for Less-Resourced Semitic Languages Speech Recognition: the Case of Amharic0
Transfer Learning for Robust Low-Resource Children's Speech ASR with Transformers and Source-Filter Warping0
Transfer Learning from Adult to Children for Speech Recognition: Evaluation, Analysis and Recommendations0
Transfer Learning from Pre-trained Language Models Improves End-to-End Speech Summarization0
Transfer Learning from Whisper for Microscopic Intelligibility Prediction0
Transferring Knowledge from a RNN to a DNN0
Transformer ASR with Contextual Block Processing0
Transformer-based ASR Incorporating Time-reduction Layer and Fine-tuning with Self-Knowledge Distillation0
Transformer-based Automatic Speech Recognition of Formal and Colloquial Czech in MALACH Project0
Transformer-based Model for ASR N-Best Rescoring and Rewriting0
Transformer-based Online CTC/attention End-to-End Speech Recognition Architecture0
Transformer-based Streaming ASR with Cumulative Attention0
Transformer-Based Video Front-Ends for Audio-Visual Speech Recognition for Single and Multi-Person Video0
Transformers in Speech Processing: A Survey0
Transformer-Transducers for Code-Switched Speech Recognition0
Transformer with Bidirectional Decoder for Speech Recognition0
Transforming NLU with Babylon: A Case Study in Development of Real-time, Edge-Efficient, Multi-Intent Translation System for Automated Drive-Thru Ordering0
Transliterated Zero-Shot Domain Adaptation for Automatic Speech Recognition0
TranUSR: Phoneme-to-word Transcoder Based Unified Speech Representation Learning for Cross-lingual Speech Recognition0
Tree-constrained Pointer Generator for End-to-end Contextual Speech Recognition0
Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition0
Tropical Modeling of Weighted Transducer Algorithms on Graphs0
TRScore: A Novel GPT-based Readability Scorer for ASR Segmentation and Punctuation model evaluation and selection0
Trustera: A Live Conversation Redaction System0
t-SOT FNT: Streaming Multi-talker ASR with Text-only Domain Adaptation Capability0
TTS Skins: Speaker Conversion via ASR0
TUKE-BNews-SK: Slovak Broadcast News Corpus Construction and Evaluation0
Tutorial Proposal: End-to-End Speech Translation0
Two Front-Ends, One Model : Fusing Heterogeneous Speech Features for Low Resource ASR with Multilingual Pre-Training0
Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems0
Two-Stage Augmentation and Adaptive CTC Fusion for Improved Robustness of Multi-Stream End-to-End ASR0
Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews0
Show:102550
← PrevPage 48 of 64Next →

No leaderboard results yet.