CosyVoice 3: Towards In-the-wild Speech Generation via Scaling-up and Post-training May 23, 2025 Automatic Speech Recognition Emotion Recognition
Code Code Available 115 OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia Jan 23, 2025 Emotion Recognition Event Detection
Code Code Available 35 Attention Is All You Need Jun 12, 2017 Abstractive Text Summarization All
Code Code Available 35 EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark Jun 11, 2024 Cross-corpus Emotion Recognition
Code Code Available 35 emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation Dec 23, 2023 Emotion Recognition Self-Supervised Learning
Code Code Available 35 EmoSphere-SER: Enhancing Speech Emotion Recognition Through Spherical Representation with Auxiliary Classification May 26, 2025 Emotion Recognition regression
Code Code Available 25 AST: Audio Spectrogram Transformer Apr 5, 2021 Audio Classification Audio Tagging
Code Code Available 25 EMO-SUPERB: An In-depth Look at Speech Emotion Recognition Feb 20, 2024 Emotion Recognition Self-Supervised Learning
Code Code Available 25 BLSP-Emo: Towards Empathetic Large Speech-Language Models Jun 6, 2024 Emotion Recognition Instruction Following
Code Code Available 25 Dawn of the transformer era in speech emotion recognition: closing the valence gap Mar 14, 2022 Cross-corpus Emotion Recognition
Code Code Available 25 LauraGPT: Listen, Attend, Understand, and Regenerate Audio with GPT Oct 7, 2023 Audio captioning Automatic Speech Recognition
Code Code Available 25 SERAB: A multi-lingual benchmark for speech emotion recognition Oct 7, 2021 Benchmarking Emotion Recognition
Code Code Available 15 SigWavNet: Learning Multiresolution Signal Wavelet Network for Speech Emotion Recognition Feb 1, 2025 Denoising Emotion Recognition
Code Code Available 15 SER Evals: In-domain and Out-of-domain Benchmarking for Speech Emotion Recognition Aug 14, 2024 Automatic Speech Recognition Benchmarking
Code Code Available 15 Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset Oct 28, 2020 Decoder Emotion Recognition
Code Code Available 15 LSSED: a large-scale dataset and benchmark for speech emotion recognition Jan 30, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 Semi-FedSER: Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling Mar 15, 2022 Emotion Recognition Federated Learning
Code Code Available 15 SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers Nov 4, 2022 Cross-corpus Emotion Recognition
Code Code Available 15 Enhancing Speech Emotion Recognition Through Differentiable Architecture Search May 23, 2023 Emotion Recognition Neural Architecture Search
Code Code Available 15 Large Raw Emotional Dataset with Aggregation Mechanism Dec 23, 2022 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 Cross-Lingual Cross-Age Group Adaptation for Low-Resource Elderly Speech Emotion Recognition Jun 26, 2023 Data Augmentation Emotion Recognition
Code Code Available 15 Jointly Fine-Tuning “BERT-like” Self Supervised Models to Improve Multimodal Speech Emotion Recognition Aug 15, 2020 Emotion Recognition Multimodal Deep Learning
Code Code Available 15 A Persian ASR-based SER: Modification of Sharif Emotional Speech Database and Investigation of Persian Text Corpora Nov 18, 2022 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 15 Odyssey 2024 - Speech Emotion Recognition Challenge: Dataset, Baseline Framework, and Results Jun 20, 2024 Attribute Emotion Recognition
Code Code Available 15 Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition Mar 2, 2021 Emotion Recognition Sentence
Code Code Available 15 Privacy-preserving Speech Emotion Recognition through Semi-Supervised Federated Learning Feb 5, 2022 Emotion Recognition Federated Learning
Code Code Available 15 SepTr: Separable Transformer for Audio Spectrogram Processing Mar 17, 2022 Audio Classification Speech Emotion Recognition
Code Code Available 15 Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition Apr 6, 2020 Deep Learning Emotion Recognition
Code Code Available 15 DWFormer: Dynamic Window transFormer for Speech Emotion Recognition Mar 3, 2023 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection Aug 7, 2023 Continual Learning Emotion Recognition
Code Code Available 15 Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer Mar 26, 2024 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 Frame-level emotional state alignment method for speech emotion recognition Dec 27, 2023 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings Apr 8, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 A vector quantized masked autoencoder for speech emotion recognition Apr 21, 2023 Emotion Recognition Self-Supervised Learning
Code Code Available 15 Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recognition Mar 24, 2020 Emotion Recognition regression
Code Code Available 15 Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion Recognition Aug 4, 2023 Cross-corpus Domain Adaptation
Code Code Available 15 EmoGator: A New Open Source Vocal Burst Dataset with Baseline Machine Learning Classification Methodologies Jan 2, 2023 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 Attribute Inference Attack of Speech Emotion Recognition in Federated Learning Settings Dec 26, 2021 Attribute Emotion Recognition
Code Code Available 15 Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention Jun 8, 2021 Emotion Classification Emotion Recognition
Code Code Available 15 A proposal for Multimodal Emotion Recognition using aural transformers and Action Units on RAVDESS dataset Dec 30, 2021 Autonomous Driving Emotion Recognition
Code Code Available 15 Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset Oct 9, 2021 Deep Learning Emotion Recognition
Code Code Available 15 emoDARTS: Joint Optimisation of CNN & Sequential Neural Network Architectures for Superior Speech Emotion Recognition Mar 21, 2024 Emotion Recognition Neural Architecture Search
Code Code Available 15 EmoNet: A Transfer Learning Framework for Multi-Corpus Speech Emotion Recognition Mar 10, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition Oct 28, 2022 Emotion Recognition Representation Learning
Code Code Available 15 Continuous control with deep reinforcement learning Sep 9, 2015 Action Detection continuous-control
Code Code Available 15 Compact Graph Architecture for Speech Emotion Recognition Aug 5, 2020 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition Oct 12, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition Oct 7, 2021 Emotion Recognition Speech Emotion Recognition
Code Code Available 15 MMER: Multimodal Multi-task Learning for Speech Emotion Recognition Mar 31, 2022 Emotion Recognition Multimodal Emotion Recognition
Code Code Available 15 Speech Emotion Diarization: Which Emotion Appears When? Jun 22, 2023 Emotion Recognition speaker-diarization
Code Code Available 15