SOTAVerified

Automatic Speech Recognition

Papers

Showing 351400 of 3174 papers

TitleStatusHype
LIP-RTVE: An Audiovisual Database for Continuous Spanish in the WildCode0
Let SSMs be ConvNets: State-space Modeling with Optimal Tensor ContractionsCode0
Analysis of EEG frequency bands for Envisioned Speech RecognitionCode0
Leveraging Broadcast Media Subtitle Transcripts for Automatic Speech Recognition and SubtitlingCode0
Learning from Past Mistakes: Improving Automatic Speech Recognition Output via Noisy-Clean Phrase Context ModelingCode0
Learning to adapt: a meta-learning approach for speaker adaptationCode0
DistriBlock: Identifying adversarial audio samples by leveraging characteristics of the output distributionCode0
LASER: Learning by Aligning Self-supervised Representations of Speech for Improving Content-related TasksCode0
Large-Scale End-to-End Multilingual Speech Recognition and Language Identification with Multi-Task LearningCode0
Language-Universal Adapter Learning with Knowledge Distillation for End-to-End Multilingual Speech RecognitionCode0
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision QuantizationCode0
A comparative analysis between Conformer-Transducer, Whisper, and wav2vec2 for improving the child speech recognitionCode0
Language Modeling for Code-Switching: Evaluation, Integration of Monolingual Data, and Discriminative TrainingCode0
A Method to Reveal Speaker Identity in Distributed ASR Training, and How to Counter ItCode0
Kurdish (Sorani) Speech to Text: Presenting an Experimental DatasetCode0
Key Frame Mechanism For Efficient Conformer Based End-to-end Speech RecognitionCode0
AdaCS: Adaptive Normalization for Enhanced Code-Switching ASRCode0
Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic InformationCode0
Joint Automatic Speech Recognition And Structure Learning For Better Speech UnderstandingCode0
Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn'tCode0
Iterative pseudo-forced alignment by acoustic CTC loss for self-supervised ASR domain adaptationCode0
Investigating the Effects of Word Substitution Errors on Sentence EmbeddingsCode0
Iterative Pseudo-Labeling for Speech RecognitionCode0
Attention-based Multi-hypothesis Fusion for Speech SummarizationCode0
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech RecognitionCode0
Intrinsic evaluation of language models for code-switchingCode0
Attentional Speech Recognition Models Misbehave on Out-of-domain UtterancesCode0
Improving Voice Separation by Incorporating End-to-end Speech RecognitionCode0
Improving LSTM-CTC based ASR performance in domains with limited training dataCode0
A Theory of Unsupervised Speech RecognitionCode0
Improving RNN Transducer Modeling for End-to-End Speech RecognitionCode0
Language Identification Using Deep Convolutional Recurrent Neural NetworksCode0
NeMo Inverse Text Normalization: From Development To ProductionCode0
Imperceptible, Robust, and Targeted Adversarial Examples for Automatic Speech RecognitionCode0
HydraFormer: One Encoder For All Subsampling RatesCode0
Hybrid ASR for Resource-Constrained Robots: HMM - Deep Learning FusionCode0
HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanismCode0
HuBERT-EE: Early Exiting HuBERT for Efficient Speech RecognitionCode0
A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality ConversionCode0
Human Transcription Quality ImprovementCode0
Hybrid phonetic-neural model for correction in speech recognition systemsCode0
How Phonotactics Affect Multilingual and Zero-shot ASR PerformanceCode0
Guided Source Separation Meets a Strong ASR Backend: Hitachi/Paderborn University Joint Investigation for Dinner Party ASRCode0
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
Guiding Frame-Level CTC Alignments Using Self-knowledge DistillationCode0
Graph Neural Networks for Contextual ASR with the Tree-Constrained Pointer GeneratorCode0
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations GenerationCode0
Attentively Embracing Noise for Robust Latent Representation in BERTCode0
Growing Trees on Sounds: Assessing Strategies for End-to-End Dependency Parsing of SpeechCode0
How You Say It Matters: Measuring the Impact of Verbal Disfluency Tags on Automated Dementia DetectionCode0
Show:102550
← PrevPage 8 of 64Next →

No leaderboard results yet.