Neural approaches to spoken content embedding Aug 28, 2023 Automatic Speech Recognition Dynamic Time Warping
— Unverified 0The USTC-NERCSLIP Systems for the CHiME-7 DASR Challenge Aug 28, 2023 speaker-diarization Speaker Diarization
— Unverified 0Effect of Attention and Self-Supervised Speech Embeddings on Non-Semantic Speech Tasks Aug 28, 2023 Speech Recognition
Code Code Available 0Decoupled Structure for Improved Adaptability of End-to-End Models Aug 25, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Small and Fast BERT for Chinese Medical Punctuation Restoration Aug 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion Aug 24, 2023 Audio Classification Binary Classification
— Unverified 0AdVerb: Visually Guided Audio Dereverberation Aug 23, 2023 Speaker Verification Speech Enhancement
— Unverified 0KinSPEAK: Improving speech recognition for Kinyarwanda via semi-supervised learning methods Aug 23, 2023 Robust Speech Recognition speech-recognition
— Unverified 0Convoifilter: A case study of doing cocktail party speech recognition Aug 22, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Identifying depression-related topics in smartphone-collected free-response speech recordings using an automatic speech recognition system and a deep learning topic model Aug 22, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0An Effective Transformer-based Contextual Model and Temporal Gate Pooling for Speaker Identification Aug 22, 2023 Self-Supervised Learning Speaker Identification
Code Code Available 0TokenSplit: Using Discrete Speech Representations for Direct, Refined, and Transcript-Conditioned Speech Separation and Recognition Aug 21, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Continuous Sign Language Recognition with Cross-Lingual Signs Aug 21, 2023 Sign Language Recognition speech-recognition
— Unverified 0Another Point of View on Visual Speech Recognition Aug 20, 2023 Landmark-based Lipreading speech-recognition
— Unverified 0Indonesian Automatic Speech Recognition with XLSR-53 Aug 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Bayes Risk Transducer: Transducer with Controllable Alignment Prediction Aug 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Radio2Text: Streaming Speech Recognition Using mmWave Radio Signals Aug 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Accurate synthesis of Dysarthric Speech for ASR data augmentation Aug 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-End Open Vocabulary Keyword Search With Multilingual Neural Representations Aug 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improving CTC-AED model with integrated-CTC and auxiliary loss regularization Aug 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model Aug 15, 2023 Quantization speech-recognition
— Unverified 0O-1: Self-training with Oracle and 1-best Hypothesis Aug 14, 2023 speech-recognition Speech Recognition
— Unverified 0Using Text Injection to Improve Recognition of Personal Identifiers in Speech Aug 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder Aug 14, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 1Text Injection for Capitalization and Turn-Taking Prediction in Speech Models Aug 14, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Integrating Emotion Recognition with Speech Recognition and Speaker Diarisation for Conversations Aug 14, 2023 Action Detection Activity Detection
Code Code Available 0Cross-Attribute Matrix Factorization Model with Shared User Embedding Aug 14, 2023 Attribute Collaborative Filtering
— Unverified 0Alternative Pseudo-Labeling for Semi-Supervised Automatic Speech Recognition Aug 12, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Bilingual Streaming ASR with Grapheme units and Auxiliary Monolingual Loss Aug 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping Aug 11, 2023 Lip Reading speech-recognition
— Unverified 0Improving Joint Speech-Text Representations Without Alignment Aug 11, 2023 Speech Recognition
— Unverified 0A Novel Self-training Approach for Low-resource Speech Recognition Aug 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Conformer-based Target-Speaker Automatic Speech Recognition for Single-Channel Audio Aug 9, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 0FPGA Resource-aware Structured Pruning for Real-Time Neural Networks Aug 9, 2023 Classification image-classification
— Unverified 0TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks Aug 9, 2023 speech-recognition Speech Recognition
— Unverified 0Unsupervised Out-of-Distribution Dialect Detection with Mahalanobis Distance Aug 9, 2023 Classification Dialect Identification
— Unverified 0A Novel Method for improving accuracy in neural network by reinstating traditional back propagation technique Aug 9, 2023 speech-recognition Speech Recognition
— Unverified 0OmniDataComposer: A Unified Data Structure for Multimodal Data Fusion and Infinite Data Generation Aug 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Comparative Analysis of the wav2vec 2.0 Feature Extractor Aug 8, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On Monotonic Aggregation for Open-domain QA Aug 8, 2023 Language Modeling Language Modelling
Code Code Available 0Dialogue Systems Can Generate Appropriate Responses without the Use of Question Marks? -- Investigation of the Effects of Question Marks on Dialogue Systems Aug 7, 2023 Sentence speech-recognition
— Unverified 0Boosting Chinese ASR Error Correction with Dynamic Error Scaling Mechanism Aug 7, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Critical Review of Physics-Informed Machine Learning Applications in Subsurface Energy Systems Aug 6, 2023 Decision Making Management
— Unverified 0ApproBiVT: Lead ASR Models to Generalize Better Using Approximated Bias-Variance Tradeoff Guided Early Stopping and Checkpoint Averaging Aug 5, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speaker Diarization of Scripted Audiovisual Content Aug 4, 2023 speaker-diarization Speaker Diarization
— Unverified 0Federated Representation Learning for Automatic Speech Recognition Aug 3, 2023 Automatic Speech Recognition Federated Learning
— Unverified 0Careful Whisper -- leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification Aug 2, 2023 Automatic Speech Recognition Decoder
— Unverified 0Improving grapheme-to-phoneme conversion by learning pronunciations from speech recordings Jul 31, 2023 Grapheme-to-Phoneme Conversion speech-recognition
— Unverified 0Mispronunciation detection using self-supervised speech representations Jul 30, 2023 Self-Supervised Learning speech-recognition
Code Code Available 0Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text Jul 30, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0