SOTAVerified

Automatic Speech Recognition

Papers

Showing 901950 of 3174 papers

TitleStatusHype
Convoifilter: A case study of doing cocktail party speech recognition0
Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model0
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition0
Indonesian Automatic Speech Recognition with XLSR-530
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction0
Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals0
Accurate synthesis of Dysarthric Speech for ASR data augmentation0
Improving CTC-AED model with integrated-CTC and auxiliary loss regularization0
End-to-End Open Vocabulary Keyword Search With Multilingual Neural RepresentationsCode0
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models0
Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion EncoderCode1
Using Text Injection to Improve Recognition of Personal Identifiers in Speech0
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition0
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss0
A Novel Self-training Approach for Low-resource Speech Recognition0
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel AudioCode0
Comparative Analysis of the wav2vec 2.0 Feature Extractor0
OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data GenerationCode1
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism0
ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging0
Federated Representation Learning for Automatic Speech Recognition0
Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification0
Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text0
ÌròyìnSpeech: A multi-purpose Yorùbá Speech CorpusCode1
Learning Multi-modal Representations by Watching Hundreds of Surgical Video LecturesCode1
Cascaded Cross-Modal Transformer for Request and Complaint Detection0
CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition0
On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer0
Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training0
Adaptation of Whisper models to child speech recognitionCode1
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision QuantizationCode0
Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNNCode0
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition0
Boosting Punctuation Restoration with Data Generation and Reinforcement Learning0
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation0
A meta learning scheme for fast accent domain expansion in Mandarin speech recognition0
Prompting Large Language Models with Speech Recognition Abilities0
A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality ConversionCode0
Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic InformationCode0
A Deep Dive into the Disparity of Word Error Rates Across Thousands of NPTEL MOOC VideosCode0
OxfordVGG Submission to the EGO4D AV Transcription ChallengeCode6
Model Adaptation for ASR in low-resource Indian Languages0
Ed-Fed: A generic federated learning framework with resource-aware client selection for edge devices0
Replay to Remember: Continual Layer-Specific Fine-tuning for German Speech Recognition0
Representation Learning With Hidden Unit Clustering For Low Resource Speech Applications0
Exploring the Integration of Large Language Models into Automatic Speech Recognition Systems: An Empirical Study0
Writer adaptation for offline text recognition: An exploration of neural network-based methodsCode0
Speech Diarization and ASR with GMM0
Token-Level Serialized Output Training for Joint Streaming ASR and ST Leveraging Textual Alignments0
Show:102550
← PrevPage 19 of 64Next →

No leaderboard results yet.