Towards the Transferable Audio Adversarial Attack via Ensemble Methods Apr 18, 2023 Adversarial Attack Autonomous Driving
— Unverified 0Multimodal Short Video Rumor Detection System Based on Contrastive Learning Apr 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Political corpus creation through automatic speech recognition on EU debates Apr 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A Virtual Simulation-Pilot Agent for Training of Air Traffic Controllers Apr 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Evaluation of Speaker Anonymization on Emotional Speech Apr 15, 2023 Automatic Speech Recognition Emotion Recognition
— Unverified 0A CTC Alignment-based Non-autoregressive Transformer for End-to-end Automatic Speech Recognition Apr 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Task-oriented Document-Grounded Dialog Systems by HLTPR@RWTH for DSTC9 and DSTC10 Apr 14, 2023 Automatic Speech Recognition Data Augmentation
— Unverified 0Solving Tensor Low Cycle Rank Approximation Apr 13, 2023 speech-recognition Speech Recognition
— Unverified 0Speech Reconstruction from Silent Tongue and Lip Articulation By Pseudo Target Generation and Domain Adversarial Training Apr 12, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Regularizing Contrastive Predictive Coding for Speech Applications Apr 12, 2023 Acoustic Unit Discovery Automatic Speech Recognition
— Unverified 0Acoustic absement in detail: Quantifying acoustic differences across time-series representations of speech data Apr 12, 2023 Dynamic Time Warping speech-recognition
Code Code Available 0Wav2code: Restore Clean Speech Representations via Codebook Lookup for Noise-Robust ASR Apr 11, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Sim-T: Simplify the Transformer Network by Multiplexing Technique for Speech Recognition Apr 11, 2023 Decoder speech-recognition
— Unverified 0Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable Confidence Apr 10, 2023 Benchmarking speech-recognition
Code Code Available 0Adaptive Feature Fusion: Enhancing Generalization in Deep Learning Models Apr 4, 2023 Deep Learning speech-recognition
— Unverified 0Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data Apr 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Dual-Attention Neural Transducers for Efficient Wake Word Spotting in Speech Recognition Apr 3, 2023 speech-recognition Speech Recognition
— Unverified 0Self-Supervised Learning-Based Source Separation for Meeting Data Apr 3, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Multilingual Word Error Rate Estimation: e-WER3 Apr 2, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0The Edinburgh International Accents of English Corpus: Towards the Democratization of English ASR Mar 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Lego-Features: Exporting modular encoder features for streaming and deliberation ASR Mar 31, 2023 Decoder speech-recognition
— Unverified 0Dialog act guided contextual adapter for personalized speech recognition Mar 31, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Improving the previous state-of-the-art Frisian ASR by fine-tuning XLS-R Mar 31, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision Mar 30, 2023 Lip Reading speech-recognition
— Unverified 0PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers Mar 30, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR Mar 29, 2023 Automatic Speech Recognition Domain Adaptation
— Unverified 0Joint unsupervised and supervised learning for context-aware language identification Mar 29, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Text is All You Need: Personalizing ASR Models using Controllable Speech Synthesis Mar 27, 2023 All Automatic Speech Recognition
— Unverified 0MSAT: Biologically Inspired Multi-Stage Adaptive Threshold for Conversion of Spiking Neural Networks Mar 23, 2023 Sentiment Analysis Sentiment Classification
— Unverified 0Enhancing Unsupervised Speech Recognition with Diffusion GANs Mar 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Pyramid Multi-branch Fusion DCNN with Multi-Head Self-Attention for Mandarin Speech Recognition Mar 23, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0Beyond Universal Transformer: block reusing with adaptor in Transformer for automatic speech recognition Mar 23, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Deliberation-based Joint Acoustic and Text Decoder Mar 23, 2023 Decoder speech-recognition
— Unverified 0Self-supervised Learning with Speech Modulation Dropout Mar 22, 2023 Automatic Speech Recognition Self-Supervised Learning
— Unverified 0CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning Mar 22, 2023 Contrastive Learning Retrieval
— Unverified 0Exploring Turkish Speech Recognition via Hybrid CTC/Attention Architecture and Multi-feature Fusion Network Mar 22, 2023 Model Compression speech-recognition
— Unverified 0Transformers in Speech Processing: A Survey Mar 21, 2023 Automatic Speech Recognition Speech Enhancement
— Unverified 0End-to-End Integration of Speech Separation and Voice Activity Detection for Low-Latency Diarization of Telephone Conversations Mar 21, 2023 Action Detection Activity Detection
— Unverified 0On-the-fly Text Retrieval for End-to-End ASR Adaptation Mar 20, 2023 Language Modeling Language Modelling
— Unverified 0Knowledge Distillation from Multiple Foundation Models for End-to-End Speech Recognition Mar 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Code-Switching Text Generation and Injection in Mandarin-English ASR Mar 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Deep Learning System for Domain-specific Speech Recognition Mar 18, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0DistillW2V2: A Small and Streaming Wav2vec 2.0 Based ASR Model Mar 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Visual Information Matters for ASR Error Correction Mar 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Trustera: A Live Conversation Redaction System Mar 16, 2023 Automatic Speech Recognition Natural Language Understanding
— Unverified 0Improving Perceptual Quality, Intelligibility, and Acoustics on VoIP Platforms Mar 16, 2023 Multi-Task Learning Speech Enhancement
— Unverified 0Sharing Low Rank Conformer Weights for Tiny Always-On Ambient Speech Recognition Models Mar 15, 2023 speech-recognition Speech Recognition
— Unverified 0HYBRIDFORMER: improving SqueezeFormer with hybrid attention and NSR mechanism Mar 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0A large-scale multimodal dataset of human speech recognition Mar 15, 2023 Lip Reading Motion Detection
— Unverified 0Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences Mar 15, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0