| Enhancing Multimodal Sentiment Analysis for Missing Modality through Self-Distillation and Unified Modality Cross-Attention | Oct 19, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| AC-Mix: Self-Supervised Adaptation for Low-Resource Automatic Speech Recognition using Agnostic Contrastive Mixup | Oct 18, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Failing Forward: Improving Generative Error Correction for ASR with Synthetic Data and Retrieval Augmentation | Oct 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Parameter-efficient Adaptation of Multilingual Multimodal Models for Low-resource ASR | Oct 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Roadmap towards Superhuman Speech Understanding using Large Language Models | Oct 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Investigation of Speaker Representation for Target-Speaker Speech Processing | Oct 15, 2024 | Action DetectionActivity Detection | —Unverified | 0 |
| Automatic Speech Recognition with BERT and CTC Transformers: A Review | Oct 12, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Enhancing Indonesian Automatic Speech Recognition: Evaluating Multilingual Models with Diverse Speech Variabilities | Oct 11, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| A two-stage transliteration approach to improve performance of a multilingual ASR | Oct 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Advocating Character Error Rate for Multilingual ASR Evaluation | Oct 9, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Automatic Screening for Children with Speech Disorder using Automatic Speech Recognition: Opportunities and Challenges | Oct 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| CR-CTC: Consistency regularization on CTC for improved speech recognition | Oct 7, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| The OCON model: an old but gold solution for distributable supervised classification | Oct 5, 2024 | Automatic Speech RecognitionClassification | CodeCode Available | 0 |
| The OCON model: an old but green solution for distributable supervised classification for acoustic monitoring in smart cities | Oct 5, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Self-Powered LLM Modality Expansion for Large Speech-Text Models | Oct 4, 2024 | Automatic Speech RecognitionInstruction Following | CodeCode Available | 0 |
| Team MTS @ AutoMin 2021: An Overview of Existing Summarization Approaches and Comparison to Unsupervised Summarization Techniques | Oct 4, 2024 | Automatic Speech Recognitionspeech-recognition | —Unverified | 0 |
| Convolutional Variational Autoencoders for Spectrogram Compression in Automatic Speech Recognition | Oct 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Algorithms For Automatic Accentuation And Transcription Of Russian Texts In Speech Recognition Systems | Oct 3, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Spoken Grammar Assessment Using LLM | Oct 2, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| End-to-End Speech Recognition with Pre-trained Masked Language Model | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Recent Advances in Speech Language Models: A Survey | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 2 |
| VHASR: A Multimodal Speech Recognition System With Vision Hotwords | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| Automatic Speech Recognition for the Ika Language | Oct 1, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| MOSEL: 950,000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages | Oct 1, 2024 | Automatic Speech Recognitionspeech-recognition | CodeCode Available | 2 |
| Mamba for Streaming ASR Combined with Unimodal Aggregation | Sep 30, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |