A Multi-Purpose Audio-Visual Corpus for Multi-Modal Persian Speech Recognition: the Arman-AV Dataset Jan 21, 2023 Audio-Visual Speech Recognition Automatic Speech Recognition
— Unverified 0Regeneration Learning: A Learning Paradigm for Data Generation Jan 21, 2023 Image Generation Representation Learning
— Unverified 0Neural Architecture Search: Insights from 1000 Papers Jan 20, 2023 Natural Language Understanding Neural Architecture Search
Code Code Available 0Language Agnostic Data-Driven Inverse Text Normalization Jan 20, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0From English to More Languages: Parameter-Efficient Model Reprogramming for Cross-Lingual Speech Recognition Jan 19, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Adapting Multilingual Speech Representation Model for a New, Underresourced Language through Multilingual Fine-tuning and Continued Pretraining Jan 18, 2023 speech-recognition Speech Recognition
— Unverified 0Syllable Subword Tokens for Open Vocabulary Speech Recognition in Malayalam Jan 17, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0BayesSpeech: A Bayesian Transformer Network for Automatic Speech Recognition Jan 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset Jan 16, 2023 Audio-Visual Speech Recognition Lip Reading
Code Code Available 1Multi-resolution location-based training for multi-channel continuous speech separation Jan 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Using Kaldi for Automatic Speech Recognition of Conversational Austrian German Jan 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Rationalizing Predictions by Adversarial Information Calibration Jan 15, 2023 Language Modelling Prediction
— Unverified 0Streaming Punctuation: A Novel Punctuation Technique Leveraging Bidirectional Context for Continuous Speech Recognition Jan 10, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0FullStop:Punctuation and Segmentation Prediction for Dutch with Transformers Jan 9, 2023 Language Modelling Machine Translation
— Unverified 0Equivariant and Steerable Neural Networks: A review with special emphasis on the symmetric group Jan 8, 2023 speech-recognition Speech Recognition
— Unverified 0Using External Off-Policy Speech-To-Text Mappings in Contextual End-To-End Automated Speech Recognition Jan 6, 2023 Domain Adaptation GPU
— Unverified 0Audio-Visual Efficient Conformer for Robust Speech Recognition Jan 4, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Supervised Acoustic Embeddings And Their Transferability Across Languages Jan 3, 2023 speech-recognition Speech Recognition
Code Code Available 0Towards Voice Reconstruction from EEG during Imagined Speech Jan 2, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Unsupervised Pre-Training for Vietnamese Automatic Speech Recognition in the HYKIST Project Jan 1, 2023 Automatic Speech Recognition speech-recognition
— Unverified 0ReVISE: Self-Supervised Speech Resynthesis With Visual Input for Universal and Generalized Speech Regeneration Jan 1, 2023 Audio-Visual Speech Recognition Resynthesis
— Unverified 0Sample-Efficient Unsupervised Domain Adaptation of Speech Recognition Systems A case study for Modern Greek Dec 31, 2022 Diversity Domain Adaptation
— Unverified 0Memory Augmented Lookup Dictionary based Language Modeling for Automatic Speech Recognition Dec 30, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Macro-block dropout for improved regularization in training end-to-end speech recognition models Dec 29, 2022 Decoder speech-recognition
— Unverified 0Learning to Detect Noisy Labels Using Model-Based Features Dec 28, 2022 Meta-Learning speech-recognition
Code Code Available 1Don't Be So Sure! Boosting ASR Decoding via Confidence Relaxation Dec 27, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Skit-S2I: An Indian Accented Speech to Intent dataset Dec 26, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Alignment Entropy Regularization Dec 22, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 04D ASR: Joint modeling of CTC, Attention, Transducer, and Mask-Predict decoders Dec 21, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0ReVISE: Self-Supervised Speech Resynthesis with Visual Input for Universal and Generalized Speech Enhancement Dec 21, 2022 Audio-Visual Speech Recognition Resynthesis
— Unverified 0End-to-End Automatic Speech Recognition model for the Sudanese Dialect Dec 21, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks Dec 20, 2022 Dialog Act Classification Question Answering
— Unverified 0Mu^2SLAM: Multitask, Multilingual Speech and Language Models Dec 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0NusaCrowd: Open Source Initiative for Indonesian NLP Resources Dec 19, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2AdaTranS: Adapting with Boundary-based Shrinking for End-to-End Speech Translation Dec 17, 2022 Machine Translation speech-recognition
— Unverified 0Fast Entropy-Based Methods of Word-Level Confidence Estimation for End-To-End Automatic Speech Recognition Dec 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech Aware Dialog System Technology Challenge (DSTC11) Dec 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Context-aware Fine-tuning of Self-supervised Speech Models Dec 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0BLASER: A Text-Free Speech-to-Speech Translation Evaluation Metric Dec 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Effectiveness of Text, Acoustic, and Lattice-based representations in Spoken Language Understanding tasks Dec 16, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Improving Fast-slow Encoder based Transducer with Streaming Deliberation Dec 15, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Tackling the Cocktail Fork Problem for Separation and Transcription of Real-World Soundtracks Dec 14, 2022 Action Detection Activity Detection
— Unverified 0Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language Dec 14, 2022 Decoder image-classification
Code Code Available 1Disentangling Prosody Representations with Unsupervised Speech Reconstruction Dec 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Speech and Natural Language Processing Technologies for Pseudo-Pilot Simulator Dec 14, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Jointly Learning Visual and Auditory Speech Representations from Raw Data Dec 12, 2022 Audio-Visual Speech Recognition Lipreading
Code Code Available 1BASPRO: a balanced script producer for speech corpus collection based on the genetic algorithm Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1End-to-End Speech Translation of Arabic to English Broadcast News Dec 11, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Leveraging Modality-specific Representations for Audio-visual Speech Recognition via Reinforcement Learning Dec 10, 2022 Audio-Visual Speech Recognition reinforcement-learning
— Unverified 0GPU-accelerated Guided Source Separation for Meeting Transcription Dec 10, 2022 blind source separation CPU
Code Code Available 1