SOTAVerified

Automatic Speech Recognition

Papers

Showing 11011150 of 3174 papers

TitleStatusHype
Perceptual and Task-Oriented Assessment of a Semantic Metric for ASR EvaluationCode0
Multiple Representation Transfer from Large Language Models to End-to-End ASR Systems0
TODM: Train Once Deploy Many Efficient Supernet-Based RNN-T Compression For On-device ASR Models0
Bring the Noise: Introducing Noise Robustness to Pretrained Automatic Speech Recognition0
AVATAR: Robust Voice Search Engine Leveraging Autoregressive Document Retrieval and Contrastive Learning0
Text-Only Domain Adaptation for End-to-End Speech Recognition through Down-Sampling Acoustic Representation0
Learning Speech Representation From Contrastive Token-Acoustic Pretraining0
Contextual Biasing of Named-Entities with Large Language Models0
Knowledge Distillation from Non-streaming to Streaming ASR Encoder using Auxiliary Non-streaming Layer0
ASTER: Automatic Speech Recognition System Accessibility Testing for Stutterers0
Adapting Text-based Dialogue State Tracker for Spoken Dialogues0
Neural approaches to spoken content embedding0
Unsupervised Active Learning: Optimizing Labeling Cost-Effectiveness for Automatic Speech Recognition0
Decoupled Structure for Improved Adaptability of End-to-End Models0
A Small and Fast BERT for Chinese Medical Punctuation RestorationCode0
Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model0
Convoifilter: A case study of doing cocktail party speech recognition0
TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition0
Indonesian Automatic Speech Recognition with XLSR-530
Bayes Risk Transducer: Transducer with Controllable Alignment Prediction0
Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals0
Accurate synthesis of Dysarthric Speech for ASR data augmentation0
Improving CTC-AED model with integrated-CTC and auxiliary loss regularization0
End-to-End Open Vocabulary Keyword Search With Multilingual Neural RepresentationsCode0
Using Text Injection to Improve Recognition of Personal Identifiers in Speech0
Text Injection for Capitalization and Turn-Taking Prediction in Speech Models0
Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for ConversationsCode0
Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition0
Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss0
A Novel Self-training Approach for Low-resource Speech Recognition0
Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel AudioCode0
Comparative Analysis of the wav2vec 2.0 Feature Extractor0
Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism0
ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging0
Federated Representation Learning for Automatic Speech Recognition0
Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification0
Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text0
Cascaded Cross-Modal Transformer for Request and Complaint Detection0
CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition0
On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer0
Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition0
A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision QuantizationCode0
Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training0
Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNNCode0
Boosting Punctuation Restoration with Data Generation and Reinforcement Learning0
Exploring the Integration of Speech Separation and Recognition with Self-Supervised Learning Representation0
A meta learning scheme for fast accent domain expansion in Mandarin speech recognition0
Topic Identification For Spontaneous Speech: Enriching Audio Features With Embedded Linguistic InformationCode0
Prompting Large Language Models with Speech Recognition Abilities0
A Change of Heart: Improving Speech Emotion Recognition through Speech-to-Text Modality ConversionCode0
Show:102550
← PrevPage 23 of 64Next →

No leaderboard results yet.