DENOASR: Debiasing ASRs through Selective Denoising Oct 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Low-Resource ASR through Versatile TTS: Bridging the Data Gap Oct 22, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0AlignVSR: Audio-Visual Cross-Modal Alignment for Visual Speech Recognition Oct 21, 2024 cross-modal alignment speech-recognition
Code Code Available 1Moonshine: Speech Recognition for Live Transcription and Voice Commands Oct 21, 2024 Decoder Position
Code Code Available 9Acoustic Model Optimization over Multiple Data Sources: Merging and Valuation Oct 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding Oct 21, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Ichigo: Mixed-Modal Early-Fusion Realtime Voice Assistant Oct 20, 2024 Question Answering speech-recognition
Code Code Available 7End-to-End Transformer-based Automatic Speech Recognition for Northern Kurdish: A Pioneering Approach Oct 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention Oct 19, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1AC-Mix: Self-Supervised Adaptation for Low-Resource Automatic Speech Recognition using Agnostic Contrastive Mixup Oct 18, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Roadmap towards Superhuman Speech Understanding using Large Language Models Oct 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR Oct 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation Oct 17, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Computational Approaches to Arabic-English Code-Switching Oct 17, 2024 Data Augmentation Language Identification
— Unverified 0EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning Oct 17, 2024 Representation Learning Self-Supervised Learning
Code Code Available 1Investigation of Speaker Representation for Target-Speaker Speech Processing Oct 15, 2024 Action Detection Activity Detection
— Unverified 0A Framework for Adapting Human-Robot Interaction to Diverse User Groups Oct 15, 2024 Action Detection Activity Detection
Code Code Available 0In-Materia Speech Recognition Oct 14, 2024 Autonomous Driving speech-recognition
— Unverified 0Character-aware audio-visual subtitling in context Oct 14, 2024 Language Modelling Large Language Model
— Unverified 0State of NLP in Kenya: A Survey Oct 13, 2024 Information Retrieval Machine Translation
— Unverified 0SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs Oct 12, 2024 AudioCaps Audio captioning
— Unverified 0Automatic Speech Recognition with BERT and CTC Transformers: A Review Oct 12, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0UniGlyph: A Seven-Segment Script for Universal Language Representation Oct 11, 2024 Diversity speech-recognition
— Unverified 0Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities Oct 11, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Full-Rank No More: Low-Rank Weight Training for Modern Speech Recognition Models Oct 10, 2024 speech-recognition Speech Recognition
— Unverified 0A two-stage transliteration approach to improve performance of a multilingual ASR Oct 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Advocating Character Error Rate for Multilingual ASR Evaluation Oct 9, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0The USTC-NERCSLIP Systems for the CHiME-8 MMCSG Challenge Oct 8, 2024 speech-recognition Speech Recognition
— Unverified 0Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges Oct 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Incorporating Talker Identity Aids With Improving Speech Recognition in Adversarial Environments Oct 7, 2024 Speaker Identification speech-recognition
— Unverified 0CR-CTC: Consistency regularization on CTC for improved speech recognition Oct 7, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Punctuation Prediction for Polish Texts using Transformers Oct 6, 2024 Prediction Reading Comprehension
— Unverified 0Casablanca: Data and Models for Multidialectal Arabic Speech Recognition Oct 6, 2024 Arabic Speech Recognition speech-recognition
— Unverified 0The OCON model: an old but gold solution for distributable supervised classification Oct 5, 2024 Automatic Speech Recognition Classification
Code Code Available 0Enhancement of Dysarthric Speech Reconstruction by Contrastive Learning Oct 5, 2024 Contrastive Learning speech-recognition
— Unverified 0The OCON model: an old but green solution for distributable supervised classification for acoustic monitoring in smart cities Oct 5, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Reverb: Open-Source ASR and Diarization from Rev Oct 4, 2024 speech-recognition Speech Recognition
— Unverified 0Self-Powered LLM Modality Expansion for Large Speech-Text Models Oct 4, 2024 Automatic Speech Recognition Instruction Following
Code Code Available 0Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques Oct 4, 2024 Automatic Speech Recognition speech-recognition
— Unverified 0Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges Oct 4, 2024 Dialect Identification Diversity
Code Code Available 0Convolutional Variational Autoencoders for Spectrogram Compression in Automatic Speech Recognition Oct 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0HAINAN: Fast and Accurate Transducer for Hybrid-Autoregressive ASR Oct 3, 2024 speech-recognition Speech Recognition
— Unverified 0Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems Oct 3, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0A Pilot Study of Applying Sequence-to-Sequence Voice Conversion to Evaluate the Intelligibility of L2 Speech Using a Native Speaker's Shadowings Oct 3, 2024 speech-recognition Speech Recognition
— Unverified 0Efficient Streaming LLM for Speech Recognition Oct 2, 2024 Decoder speech-recognition
— Unverified 0Spoken Grammar Assessment Using LLM Oct 2, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0End-to-End Speech Recognition with Pre-trained Masked Language Model Oct 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Recent Advances in Speech Language Models: A Survey Oct 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 2Automatic Speech Recognition for the Ika Language Oct 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0VHASR: A Multimodal Speech Recognition System With Vision Hotwords Oct 1, 2024 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1