An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification Aug 22, 2023 Self-Supervised Learning Speaker Identification
Code Code Available 0Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model Aug 22, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Convoifilter: A case study of doing cocktail party speech recognition Aug 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Continuous Sign Language Recognition with Cross-Lingual Signs Aug 21, 2023 Sign Language Recognition speech-recognition
— Unverified 0TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition Aug 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Indonesian Automatic Speech Recognition with XLSR-53 Aug 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Another Point of View on Visual Speech Recognition Aug 20, 2023 Landmark-based Lipreading speech-recognition
— Unverified 0Bayes Risk Transducer: Transducer with Controllable Alignment Prediction Aug 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals Aug 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Accurate synthesis of Dysarthric Speech for ASR data augmentation Aug 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving CTC-AED model with integrated-CTC and auxiliary loss regularization Aug 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model Aug 15, 2023 Quantization speech-recognition
— Unverified 0End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations Aug 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Using Text Injection to Improve Recognition of Personal Identifiers in Speech Aug 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0O-1: Self-training with Oracle and 1-best Hypothesis Aug 14, 2023 speech-recognition Speech Recognition
— Unverified 0Text Injection for Capitalization and Turn-Taking Prediction in Speech Models Aug 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Cross-Attribute Matrix Factorization Model with Shared User Embedding Aug 14, 2023 Attribute Collaborative Filtering
— Unverified 0Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations Aug 14, 2023 Action Detection Activity Detection
Code Code Available 0Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition Aug 12, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping Aug 11, 2023 Lip Reading speech-recognition
— Unverified 0Improving Joint Speech-Text Representations Without Alignment Aug 11, 2023 Speech Recognition
— Unverified 0Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss Aug 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Novel Self-training Approach for Low-resource Speech Recognition Aug 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio Aug 9, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 0TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks Aug 9, 2023 speech-recognition Speech Recognition
— Unverified 0Unsupervised Out-of-Distribution Dialect Detection with Mahalanobis Distance Aug 9, 2023 Classification Dialect Identification
— Unverified 0A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique Aug 9, 2023 speech-recognition Speech Recognition
— Unverified 0FPGA Resource-aware Structured Pruning for Real-Time Neural Networks Aug 9, 2023 Classification image-classification
— Unverified 0On Monotonic Aggregation for Open-domain QA Aug 8, 2023 Language Modeling Language Modelling
Code Code Available 0Comparative Analysis of the wav2vec 2.0 Feature Extractor Aug 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dialogue Systems Can Generate Appropriate Responses without the Use of Question Marks? -- Investigation of the Effects of Question Marks on Dialogue Systems Aug 7, 2023 Sentence speech-recognition
— Unverified 0Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism Aug 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Critical Review of Physics-Informed Machine Learning Applications in Subsurface Energy Systems Aug 6, 2023 Decision Making Management
— Unverified 0ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging Aug 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speaker Diarization of Scripted Audiovisual Content Aug 4, 2023 speaker-diarization Speaker Diarization
— Unverified 0Federated Representation Learning for Automatic Speech Recognition Aug 3, 2023 Automatic Speech Recognition Federated Learning
— Unverified 0Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification Aug 2, 2023 Automatic Speech Recognition Decoder
— Unverified 0Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings Jul 31, 2023 Grapheme-to-Phoneme Conversion speech-recognition
— Unverified 0Mispronunciation detection using self-supervised speech representations Jul 30, 2023 Self-Supervised Learning speech-recognition
Code Code Available 0Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text Jul 30, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0UniBriVL: Robust Universal Representation and Generation of Audio Driven Diffusion Models Jul 29, 2023 Representation Learning speech-recognition
— Unverified 0The timing bottleneck: Why timing and overlap are mission-critical for conversational user interfaces, speech recognition and dialogue systems Jul 28, 2023 Intent Recognition speech-recognition
— Unverified 0Cascaded Cross-Modal Transformer for Request and Complaint Detection Jul 27, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CIF-T: A Novel CIF-based Transducer Architecture for Automatic Speech Recognition Jul 26, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0On-Device Speaker Anonymization of Acoustic Embeddings for ASR based onFlexible Location Gradient Reversal Layer Jul 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Robust Automatic Speech Recognition via WavAugment Guided Phoneme Adversarial Training Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Integration of Frame- and Label-synchronous Beam Search for Streaming Encoder-decoder Speech Recognition Jul 24, 2023 Automatic Speech Recognition Decoder
— Unverified 0Code-Switched Urdu ASR for Noisy Telephonic Environment using Data Centric Approach with Hybrid HMM and CNN-TDNN Jul 24, 2023 Automatic Speech Recognition Sentiment Analysis
Code Code Available 0Boosting Punctuation Restoration with Data Generation and Reinforcement Learning Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0