SOTAVerified

Automatic Speech Recognition

Papers

Showing 301350 of 3174 papers

TitleStatusHype
Mamba for Streaming ASR Combined with Unimodal AggregationCode1
Confidence Estimation for Attention-based Sequence-to-sequence Models for Speech RecognitionCode1
Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation ModelsCode1
When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLPCode1
Deep Contextualized Acoustic Representations For Semi-Supervised Speech RecognitionCode1
ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global ContextCode1
CORAA: a large corpus of spontaneous and prepared speech manually validated for speech recognition in Brazilian PortugueseCode1
Continuous speech separation: dataset and analysisCode1
Controlling Whisper: Universal Acoustic Adversarial Attacks to Control Speech Foundation ModelsCode1
A Reference-less Quality Metric for Automatic Speech Recognition via Contrastive-Learning of a Multi-Language Model with Self-SupervisionCode1
A context-aware knowledge transferring strategy for CTC-based ASRCode1
Cross-modal information fusion for voice spoofing detectionCode1
Cross Attention Augmented Transducer Networks for Simultaneous TranslationCode1
Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech RecognitionCode1
Towards Improved Room Impulse Response Estimation for Speech RecognitionCode1
CTC-synchronous Training for Monotonic Attention ModelCode1
A Comparison of Adaptation Techniques and Recurrent Neural Network ArchitecturesCode0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC VideosCode0
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and SubtitlingCode0
Addressing Pitfalls in Auditing Practices of Automatic Speech Recognition Technologies: A Case Study of People with AphasiaCode0
Learning to adapt: a meta-learning approach for speaker adaptationCode0
A Comparative Study on Transformer vs RNN in Speech ApplicationsCode0
A Dataset for Speech Emotion Recognition in Greek Theatrical PlaysCode0
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context ModelingCode0
DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distributionCode0
An Automatic Speech Recognition System for Bengali Language based on Wav2Vec2 and Transfer LearningCode0
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task LearningCode0
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech RecognitionCode0
Kurdish (Sorani) Speech to Text: Presenting an Experimental DatasetCode0
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn'tCode0
Analyzing the impact of speaker localization errors on speech separation for automatic speech recognitionCode0
Key Frame Mechanism For Efficient Conformer Based End-to-end Speech RecognitionCode0
Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic InformationCode0
Adapting the adapters for code-switching in multilingual ASRCode0
Analyzing Robustness of End-to-End Neural Models for Automatic Speech RecognitionCode0
Language Identification Using Deep Convolutional Recurrent Neural NetworksCode0
Analyzing Phonetic and Graphemic Representations in End-to-End Automatic Speech RecognitionCode0
Analyzing Hidden Representations in End-to-End Automatic Speech Recognition SystemsCode0
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptationCode0
Iterative Pseudo-Labeling for Speech RecognitionCode0
Joint Automatic Speech Recognition And Structure Learning For Better Speech UnderstandingCode0
Advancing African-Accented Speech Recognition: Epistemic Uncertainty-Driven Data Selection for Generalizable ASR ModelsCode0
Intrinsic evaluation of language models for code-switchingCode0
Investigating the Effects of Word Substitution Errors on Sentence EmbeddingsCode0
Language Modeling for Code-Switching: Evaluation, Integration of Monolingual Data, and Discriminative TrainingCode0
Analysis of EEG frequency bands for Envisioned Speech RecognitionCode0
Improving Voice Separation by Incorporating End-to-end Speech RecognitionCode0
Improving RNN Transducer Modeling for End-to-End Speech RecognitionCode0
Show:102550
← PrevPage 7 of 64Next →

No leaderboard results yet.