CB-Conformer: Contextual biasing Conformer for biased word recognition Apr 19, 2023 Automatic Speech Recognition Language Modeling
Code Code Available 1Security and Privacy Problems in Voice Assistant Applications: A Survey Apr 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dynamic Chunk Convolution for Unified Streaming and Non-Streaming Conformer ASR Apr 18, 2023 speech-recognition Speech Recognition
— Unverified 0Approximate Nearest Neighbour Phrase Mining for Contextual Speech Recognition Apr 18, 2023 speech-recognition Speech Recognition
— Unverified 0Towards the Transferable Audio Adversarial Attack via Ensemble Methods Apr 18, 2023 Adversarial Attack Autonomous Driving
— Unverified 0Political corpus creation through automatic speech recognition on EU debates Apr 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Multimodal Short Video Rumor Detection System Based on Contrastive Learning Apr 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers Apr 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluation of Speaker Anonymization on Emotional Speech Apr 15, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 0A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition Apr 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10 Apr 14, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Solving Tensor Low Cycle Rank Approximation Apr 13, 2023 speech-recognition Speech Recognition
— Unverified 0Efficient Sequence Transduction by Jointly Predicting Tokens and Durations Apr 13, 2023 Intent Classification Intent Classification and Slot Filling
Code Code Available 1Regularizing Contrastive Predictive Coding for Speech Applications Apr 12, 2023 Acoustic Unit Discovery Automatic Speech Recognition
— Unverified 0Acoustic absement in detail: Quantifying acoustic differences across time-series representations of speech data Apr 12, 2023 Dynamic Time Warping speech-recognition
Code Code Available 0Speech Reconstruction from Silent Tongue and Lip Articulation By Pseudo Target Generation and Domain Adversarial Training Apr 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition Apr 11, 2023 Decoder speech-recognition
— Unverified 0Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR Apr 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable Confidence Apr 10, 2023 Benchmarking speech-recognition
Code Code Available 0Adaptive Feature Fusion: Enhancing Generalization in Deep Learning Models Apr 4, 2023 Deep Learning speech-recognition
— Unverified 0Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data Apr 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Self-Supervised Learning-Based Source Separation for Meeting Data Apr 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition Apr 3, 2023 speech-recognition Speech Recognition
— Unverified 0Multilingual Word Error Rate Estimation: e-WER3 Apr 2, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R Mar 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lego-Features: Exporting modular encoder features for streaming and deliberation ASR Mar 31, 2023 Decoder speech-recognition
— Unverified 0The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR Mar 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dialog act guided contextual adapter for personalized speech recognition Mar 31, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers Mar 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision Mar 30, 2023 Lip Reading speech-recognition
— Unverified 0AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR Mar 29, 2023 Automatic Speech Recognition Domain Adaptation
— Unverified 0Joint unsupervised and supervised learning for context-aware language identification Mar 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0When Good and Reproducible Results are a Giant with Feet of Clay: The Importance of Software Quality in NLP Mar 28, 2023 Automatic Speech Recognition speech-recognition
Code Code Available 1Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis Mar 27, 2023 All Automatic Speech Recognition
— Unverified 0Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels Mar 25, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
Code Code Available 2A Deliberation-based Joint Acoustic and Text Decoder Mar 23, 2023 Decoder speech-recognition
— Unverified 0Enhancing Unsupervised Speech Recognition with Diffusion GANs Mar 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition Mar 23, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0MSAT: Biologically Inspired Multi-Stage Adaptive Threshold for Conversion of Spiking Neural Networks Mar 23, 2023 Sentiment Analysis Sentiment Classification
— Unverified 0Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition Mar 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Exploring Turkish Speech Recognition via Hybrid CTC/Attention Architecture and Multi-feature Fusion Network Mar 22, 2023 Model Compression speech-recognition
— Unverified 0Self-supervised Learning with Speech Modulation Dropout Mar 22, 2023 Automatic Speech Recognition Self-Supervised Learning
— Unverified 0CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning Mar 22, 2023 Contrastive Learning Retrieval
— Unverified 0End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations Mar 21, 2023 Action Detection Activity Detection
— Unverified 0Transformers in Speech Processing: A Survey Mar 21, 2023 Automatic Speech Recognition Speech Enhancement
— Unverified 0Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition Mar 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Code-Switching Text Generation and Injection in Mandarin-English ASR Mar 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0On-the-fly Text Retrieval for End-to-End ASR Adaptation Mar 20, 2023 Language Modeling Language Modelling
— Unverified 0A Deep Learning System for Domain-specific Speech Recognition Mar 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Visual Information Matters for ASR Error Correction Mar 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0